-
Notifications
You must be signed in to change notification settings - Fork 255
Pull requests: NVIDIA/TransformerEngine
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[WIP] [PyTorch] Support dtype casting in fused adam
#977
opened Jul 1, 2024 by
Wong4j
Loading…
6 of 13 tasks
Add test for building without support for any DL frameworks
testing
Improvements to tests or testing infrastructure
#974
opened Jun 27, 2024 by
timmoon10
Loading…
6 of 14 tasks
[PyTorch] Runtime lookup for CUDA Driver API calls in Userbuffers
1.8.0
bug
Something isn't working
#970
opened Jun 26, 2024 by
denera
Loading…
8 of 13 tasks
[Paddle] Fix forward and backward logic of te.Linear(parallel_mode='column') to adapt DiT of PaddleMIX
#963
opened Jun 25, 2024 by
yumin066
Loading…
8 of 13 tasks
[C/PyTorch] Add support for bottom-right-diagonal causal mask
#960
opened Jun 25, 2024 by
cyanguwa
Loading…
5 tasks
[Paddle] Add deterministic option in DotProductAttention
#956
opened Jun 23, 2024 by
Wong4j
Loading…
8 of 13 tasks
Lower memory usage during AttnFuncWithCP.forward
#951
opened Jun 21, 2024 by
i4never
Loading…
8 of 13 tasks
[TE/JAX] Prototype for New XLA Custom Calls with FFI
enhancement
New feature or request
jax
#946
opened Jun 19, 2024 by
phu0ngng
Loading…
3 of 13 tasks
[PyTorch] Add option to pass kwargs to CUDA graph module
enhancement
New feature or request
#945
opened Jun 19, 2024 by
timmoon10
Loading…
9 of 13 tasks
Expose
rotary_base
as an arg instead of hardcoding
#944
opened Jun 18, 2024 by
sudhakarsingh27
Loading…
1 of 6 tasks
[MoE][Common/PyTorch] Add permutation
enhancement
New feature or request
#936
opened Jun 17, 2024 by
StudyingShao
Loading…
13 tasks
[Pytorch] Implement fp32 accumulation for attention with context parallel in both forward and backward pass.
#821
opened Apr 28, 2024 by
Yuxin-CV
Loading…
[PyTorch] Fix minor bug in computing num_gqa_groups_per_partition
bug
Something isn't working
#777
opened Apr 13, 2024 by
knowlsie
Loading…
[C/PyTorch] Refactor and move userbuffers into TE/common
#760
opened Apr 8, 2024 by
denera
Loading…
12 of 13 tasks
[PyTorch] Prototype for operation-based API
enhancement
New feature or request
#707
opened Mar 9, 2024 by
timmoon10
Loading…
2 of 6 tasks
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.