-
Notifications
You must be signed in to change notification settings - Fork 578
Pull requests: pytorch/FBGEMM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
optimization of perKVhead quantization
cla signed
fb-exported
#4161
opened May 20, 2025 by
Aya-ZIbra
Loading…
support filling partial rows from backend
cla signed
fb-exported
#4158
opened May 20, 2025 by
duduyi2013
Loading…
Leverage fuse kernel in inference workload (#1237)
cla signed
fb-exported
#4157
opened May 20, 2025 by
ycui1984
Loading…
Jemalloc Mempool and Adaptation for CPU HASHTABLE
cla signed
#4154
opened May 20, 2025 by
ArronHZG
Loading…
Add more parameter specializations for autovec TBE kernels
cla signed
fb-exported
#4153
opened May 20, 2025 by
excelle08
Loading…
improve read/write performance by 100%
cla signed
fb-exported
#4150
opened May 19, 2025 by
steven1327
Loading…
Make iter persistent for AdagradW
cla signed
fb-exported
#4147
opened May 17, 2025 by
minhua-chen
Loading…
support get state dict and apply state dict
cla signed
fb-exported
#4145
opened May 17, 2025 by
emlin
Loading…
Simplify grouped gemm output allocations
cla signed
fb-exported
#4134
opened May 16, 2025 by
jwfromm
Loading…
Update the rowwise adagrad optimizer to leverage optimizer state offloading, v3
cla signed
fb-exported
#4133
opened May 15, 2025 by
q10
Loading…
Add TBE data configuration reporter to TBE forward"
cla signed
fb-exported
#4130
opened May 15, 2025 by
gchalump
Loading…
Trim constexpr from isA to improve Windows clang-cl support.
cla signed
#4119
opened May 13, 2025 by
ScottTodd
Loading…
Replace
C10_CUDA_KERNEL_LAUNCH_CHECK()
in the KernelLauncher
cla signed
fb-exported
#4097
opened May 8, 2025 by
q10
Loading…
Do FP8 rowwise bias addition in higher precision
cla signed
fb-exported
#4095
opened May 8, 2025 by
jwfromm
Loading…
Use bounds_check_indices v2 on ROCm
ciflow/rocm
cla signed
fb-exported
module: rocm
#4085
opened May 6, 2025 by
sryap
Loading…
Change GenAI OSS runner to fix OOM
cla signed
fb-exported
#4082
opened May 6, 2025 by
spcyppt
Loading…
Migrate TBE backward kernels to
FBGEMM_LAUNCH_KERNEL
cla signed
fb-exported
#4076
opened May 5, 2025 by
q10
Loading…
Back out "Simplify weight row cache load and evict routines"
cla signed
fb-exported
#4064
opened May 1, 2025 by
q10
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2025-05-18.