-
Notifications
You must be signed in to change notification settings - Fork 630
Pull requests: pytorch/FBGEMM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Replace
.packed_accessor32()
with PTA_B()
in intraining embedding pruning ops
cla signed
fb-exported
#4718
opened Aug 16, 2025 by
q10
Loading…
fix typo: quantize_packed_fp8_symmetric -> dequantize_packed_fp8_symmetric (#1740)
cla signed
fb-exported
#4717
opened Aug 15, 2025 by
ColinPeppler
Loading…
Extract the res backend to a separate class and export to python side
cla signed
fb-exported
#4714
opened Aug 15, 2025 by
chouxi
Loading…
Fix quantize kernels on rocm 6.4
ciflow/rocm
cla signed
fb-exported
module: rocm
#4708
opened Aug 15, 2025 by
jwfromm
Loading…
[ROCm] remove hipify work-around that is no longer needed
cla signed
module: rocm
#4705
opened Aug 14, 2025 by
jeffdaily
Loading…
Support optimizer state offloading for the full Adam optimizer
cla signed
fb-exported
#4696
opened Aug 13, 2025 by
q10
Loading…
Boost performance of MXFP4 quantization with inline PTX
cla signed
fb-exported
#4694
opened Aug 13, 2025 by
jiawenliu64
Loading…
Fold unaligned vec4 load and store into function
cla signed
fb-exported
#4684
opened Aug 12, 2025 by
q10
Loading…
Feature score eviction frontend support
cla signed
fb-exported
#4682
opened Aug 12, 2025 by
EddyLXJ
Loading…
Adding trace to inference benchmark
cla signed
fb-exported
#4674
opened Aug 11, 2025 by
gchalump
Loading…
Add TBE data configuration reporter to TBE forward (v3) (#4455)
cla signed
fb-exported
#4672
opened Aug 11, 2025 by
gchalump
Loading…
Allowing small tiles to work on 2k*2k shapes
cla signed
fb-exported
#4669
opened Aug 11, 2025 by
RandySheriff
Loading…
Dummy commit to test for errors
cla signed
fb-exported
#4668
opened Aug 10, 2025 by
ionuthristodorescu
Loading…
Migrate sparse ops kernels to
FBGEMM_LAUNCH_KERNEL
, pt 7
cla signed
fb-exported
#4653
opened Aug 7, 2025 by
q10
Loading…
implement jagged_unique_indices_cpu
cla signed
fb-exported
#4651
opened Aug 7, 2025 by
wangyh
Loading…
Adding performance logger calls
cla signed
fb-exported
#4650
opened Aug 7, 2025 by
ionuthristodorescu
Loading…
Add 'device-with-speclist' bench
cla signed
fb-exported
#4648
opened Aug 6, 2025 by
YanXiong-Meta
Loading…
Add exhaustive autotune results for 500x IGCTR
cla signed
fb-exported
#4644
opened Aug 6, 2025 by
JChunX
Loading…
Adding optional D multiplication to index shuffling
cla signed
fb-exported
#4631
opened Aug 1, 2025 by
sunfish2010
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.