-
Notifications
You must be signed in to change notification settings - Fork 740
Pull requests: pytorch/FBGEMM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add SVE-FP16 version of EmbeddingSpMDM8Bit
cla signed
#5702
opened Apr 27, 2026 by
ShuyangLiu
Loading…
Add SVE-FP16 version of EmbeddingSpMDMNbit
cla signed
#5701
opened Apr 27, 2026 by
ShuyangLiu
Loading…
Remove GPU sync stalls in _prefetch zero-row invalidation
cla signed
#5699
opened Apr 27, 2026 by
EddyLXJ
Contributor
Loading…
Gate enrichment_policy by per-TBE embedding_cache_mode
cla signed
#5698
opened Apr 27, 2026 by
EddyLXJ
Contributor
Loading…
[ROCm] support warpSize 32 and 64 in the same build
ciflow/rocm
cla signed
module: rocm
#5696
opened Apr 25, 2026 by
jeffdaily
Loading…
Add per-feature pooling factors support (#5690)
cla signed
fb-exported
meta-exported
#5690
opened Apr 24, 2026 by
gregmacnamara
Loading…
Remove unnecessary if __name__ == "__main__": unittest.main() boilerplate in deeplearning/fbgemm/fbgemm_gpu/test (#5689)
cla signed
fb-exported
meta-exported
#5689
opened Apr 24, 2026 by
meta-codesync
Bot
Loading…
Add diagnostic output to debug OSS CI torch import failure
cla signed
fb-exported
meta-exported
#5686
opened Apr 23, 2026 by
gchalump
Contributor
Loading…
Investigate OSS CI nightly failure: revert Python 3.10+ typing changes
ci-no-td
cla signed
fb-exported
meta-exported
#5685
opened Apr 23, 2026 by
gchalump
Contributor
Loading…
Fix OSS CI nightly failures: setuptools downgrade + cu128 deprecation
cla signed
fb-exported
meta-exported
#5684
opened Apr 23, 2026 by
gchalump
Contributor
Loading…
Refactor bounds_check_indices offset checks to condition-first (Phase 1)
cla signed
fb-exported
meta-exported
#5682
opened Apr 23, 2026 by
gchalump
Contributor
Loading…
fbcode/deeplearning/fbgemm/fbgemm_gpu/test/tbe/utils/split_embeddings_utils_test.py
cla signed
fb-exported
meta-exported
#5680
opened Apr 23, 2026 by
meta-codesync
Bot
Loading…
Fix OOM (exit code 137) in CI builds for CUDA 13.2+ (#5679)
cla signed
fb-exported
meta-exported
#5679
opened Apr 23, 2026 by
gchalump
Contributor
Loading…
fbcode/deeplearning/fbgemm/fbgemm_gpu/test/tbe/dram_kv/dram_kv_test.py (#2620)
cla signed
fb-exported
meta-exported
#5678
opened Apr 23, 2026 by
meta-codesync
Bot
Loading…
Exclude transient RES streaming buffers from checkpoints by setting persistent=False (#5674)
cla signed
fb-exported
meta-exported
#5674
opened Apr 22, 2026 by
FriedCosey
Loading…
Add FP8 rowwise padding to quantized AllToAll pooled embeddings (#5673)
cla signed
fb-exported
meta-exported
#5673
opened Apr 22, 2026 by
RohanVardhan
Loading…
fbcode/deeplearning/fbgemm/fbgemm_gpu/test/tbe/training/merge_vbe_test.py
cla signed
fb-exported
meta-exported
#5672
opened Apr 22, 2026 by
meta-codesync
Bot
Loading…
Fix AMD build incompatibility and incorrect main_module paths
cla signed
fb-exported
meta-exported
#5668
opened Apr 21, 2026 by
q10
Contributor
Loading…
log query empty count vs total count
cla signed
fb-exported
meta-exported
#5657
opened Apr 17, 2026 by
xywang9334
Loading…
Fix VBE batch sizes not passed to request builder (#5653)
cla signed
fb-exported
meta-exported
#5653
opened Apr 17, 2026 by
gregmacnamara
Loading…
Port merge_embeddings benchmark to tritonbench
cla signed
fb-exported
meta-exported
#5650
opened Apr 16, 2026 by
q10
Contributor
Loading…
Validate total_num_blocks divisibility by my_size in block_bucketize (#5646)
cla signed
fb-exported
meta-exported
#5649
opened Apr 16, 2026 by
q10
Contributor
Loading…
Fix bf16 rounding to IEEE 754 ties-to-even
cla signed
#5648
opened Apr 16, 2026 by
cyyever
Contributor
Loading…
Add CPU support in fbgemm for FloatToFP8RowwiseQuantized and FP8RowwiseQuantizedToFloat (#5644)
cla signed
fb-exported
meta-exported
#5644
opened Apr 15, 2026 by
djjatmeta
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.