Skip to content

Pull requests: alibaba/rtp-llm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat(deps): unify pip deps via PEP 503 indexes + thin requirements
#962 opened Apr 30, 2026 by LLLLKKKK Collaborator Loading…
1 of 2 tasks
feat - add flashinfer fp8 gemm
#961 opened Apr 30, 2026 by zerozw Collaborator Loading…
feat: implement CpuTpBroadcaster for CPU-only tensor broadcasting
#960 opened Apr 30, 2026 by Vinkle-hzt Collaborator Loading…
add input embedding for pg
#956 opened Apr 30, 2026 by parkerpang Loading…
fix - make prepare_cg_spec_decode_kernel easy use and understand
#954 opened Apr 29, 2026 by zerozw Collaborator Loading…
fix - fix mtp target layer_to_groups size error
#953 opened Apr 29, 2026 by zerozw Collaborator Loading…
Qwen35 chunkgdn amd1
#950 opened Apr 29, 2026 by hxy0118 Collaborator Loading…
feat(p2p): 实现PD分离模式下的P2P KV Cache传输
#948 opened Apr 28, 2026 by ZhihanYan Collaborator Loading…
Develop/bailian
#946 opened Apr 28, 2026 by jianglan89 Collaborator Loading…
feat: suport hybrid pool kvcache allocator
#943 opened Apr 28, 2026 by SJTUGavinLiu Collaborator Loading…
feat: support kimi k2.6
#942 opened Apr 27, 2026 by Bruce-Lee-LY Collaborator Loading…
mooncake support p2p connector
#941 opened Apr 27, 2026 by Vincent-Bo-ali Collaborator Loading…
async schedule [2/N]: support async prepare
#936 opened Apr 26, 2026 by Vinkle-hzt Collaborator Loading…
fix: fix rocm greedy sampling to avoid crash
#932 opened Apr 24, 2026 by liaocz Collaborator Loading…
feat(rocm): MoRI EP (Expert Parallelism) support for MI355X
#931 opened Apr 24, 2026 by jacobwin-ai Collaborator Loading…
[fix] Handle enqueue failures in RPC and API paths
#929 opened Apr 23, 2026 by ZhihanYan Collaborator Loading…
Develop/fix int64
#927 opened Apr 23, 2026 by xinfei-shi Collaborator Loading…
feat: refactor py model device
#917 opened Apr 21, 2026 by JackTan25 Collaborator Loading…
Defer engine and RPC loop start until after full server init
#916 opened Apr 21, 2026 by xinfei-shi Collaborator Loading…
Feat/hybrid cp gdn
#906 opened Apr 17, 2026 by yang1556 Collaborator Loading…
feat: support input_embeddings in inference pipeline
#905 opened Apr 17, 2026 by KrisCheng9 Collaborator Loading…
optimize beam search
#903 opened Apr 16, 2026 by parkerpang Loading…
feat: support xgrammer
#902 opened Apr 16, 2026 by wanglining97 Collaborator Loading…
ProTip! What’s not been updated in a month: updated:<2026-04-01.