Use local workspace for Context
Summary: This commit uses local(thread or stream) workspace for Context, which provides a more elegant way to dispatch kernels requiring scratch. Besides, TF32 math type is provided as a cuDNN option for Ampere device.
Showing
with
1813 additions
and
1654 deletions
-
Please register or sign in to post a comment