Skip to content

Pull requests: InternLM/lmdeploy

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix dequant mixed
#4675 opened Jun 11, 2026 by irexyc Collaborator Loading…
Pading one more block for fa3 prefill
#4674 opened Jun 11, 2026 by RunningLeon Collaborator Draft
Batch invariant support PART1
#4666 opened Jun 10, 2026 by grimoire Collaborator Loading…
refactor: unify interleaved MRoPE rotary embedding
#4644 opened Jun 3, 2026 by CUHKSZzxy Collaborator Draft
Add multimodal preprocessing metrics
#4640 opened Jun 1, 2026 by CUHKSZzxy Collaborator Loading…
support disaggregated weight update planned feature
#4638 opened May 29, 2026 by irexyc Collaborator Loading…
TEST: Improve tool test
#4632 opened May 28, 2026 by littlegy Contributor Loading…
[WIP] Interleave long-context prefill chunks with decode
#4631 opened May 28, 2026 by grimoire Collaborator Draft
1 task
modify save model in lite module improvement
#4624 opened May 26, 2026 by 43758726 Contributor Loading…
Refactor prefix caching improvement
#4618 opened May 24, 2026 by grimoire Collaborator Loading…
feat(turbomind): support priority schedule policy
#4614 opened May 22, 2026 by 4mengy Loading…
3 of 4 tasks
perf: optimize guided decoding with xgrammar upgrade, batched API, and async D2H overlap
#4605 opened May 21, 2026 by windreamer Collaborator Loading…
1 of 4 tasks
Intern s2 preview lite awq fix bug
#4600 opened May 19, 2026 by 43758726 Contributor Loading…
[WIP]: Support reuse routed experts on eviction
#4599 opened May 19, 2026 by RunningLeon Collaborator Loading…
update anthropic endpoint test
#4594 opened May 18, 2026 by littlegy Contributor Loading…
docs(advance): add Add a New Speculative Decoding Method guide documentation Improvements or additions to documentation
#4589 opened May 17, 2026 by SuperMarioYL Loading…
4 tasks done
refactor ascend multinode
#4588 opened May 15, 2026 by yao-fengchen Collaborator Draft
[security] fix(proxy): require auth for node management
#4579 opened May 11, 2026 by Hinotoi-agent Loading…
5 of 9 tasks
feat: configure cudagraph capture batch sizes improvement
#4573 opened May 8, 2026 by CUHKSZzxy Collaborator Loading…
Fix health latency under concurrent VL request preparation Bug:P0
#4570 opened May 7, 2026 by CUHKSZzxy Collaborator Loading…
LLM evaluation skill on text datasets
#4566 opened Apr 30, 2026 by lvhan028 Collaborator Loading…
[Feature] Add guided decoding support for speculative decoding enhancement New feature or request
#4559 opened Apr 28, 2026 by windreamer Collaborator Loading…
4 tasks done
ProTip! Add no:assignee to see everything that’s not assigned.