Skip to content
  • P
    Projects
  • G
    Groups
  • S
    Snippets
  • Help

SeetaResearch / Dragon

  • This project
    • Loading...
  • Sign in
Go to a project
  • Project
  • Repository
  • Issues 0
  • Merge Requests 0
  • Pipelines
  • Wiki
  • Snippets
  • Settings
  • Activity
  • Graph
  • Charts
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
  • Files
  • Commits
  • Branches
  • Tags
  • Contributors
  • Graph
  • Compare
  • Charts
Switch branch/tag
  • dragon
  • dali
  • _api
  • ops
  • __init__.py
  • Ting PAN's avatar
    Implement softmax kernels via warp reduce · 654febe3
    Summary:
    This commit adds extra CUDA softmax kernels using warp reduce.
    Warp reduce leads to better performance when dimension <= 256,
    which is preferred for the recent vision transformers.
    Ting PAN committed Jun 26, 2021
    654febe3
__init__.py 2.05 KB
BlameHistoryPermalink