Skip to content

Pull requests: InternLM/lmdeploy

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

miss rdkit for intern-s models
#4587 opened May 14, 2026 by lvhan028 Collaborator Loading…
tool calling alignment with openai's spec improvement
#4585 opened May 13, 2026 by lvhan028 Collaborator Loading…
Add OpenAI Responses-compatible endpoint enhancement New feature or request
#4582 opened May 13, 2026 by CUHKSZzxy Collaborator Loading…
[security] fix(proxy): require auth for node management
#4579 opened May 11, 2026 by Hinotoi-agent Loading…
5 of 9 tasks
[Improve]: Drain queues when sleep engine improvement
#4577 opened May 9, 2026 by RunningLeon Collaborator Loading…
feat: configure cudagraph capture batch sizes
#4573 opened May 8, 2026 by CUHKSZzxy Collaborator Draft
Fix health latency under concurrent VL request preparation Bug:P0
#4570 opened May 7, 2026 by CUHKSZzxy Collaborator Loading…
LLM evaluation skill on text datasets
#4566 opened Apr 30, 2026 by lvhan028 Collaborator Loading…
FP8 kv cache quantization enhancement New feature or request
#4563 opened Apr 29, 2026 by CUHKSZzxy Collaborator Loading…
Add Qwen3.5 Moe lite awq improvement
#4561 opened Apr 28, 2026 by 43758726 Collaborator Loading…
[Feature] Add guided decoding support for speculative decoding enhancement New feature or request
#4559 opened Apr 28, 2026 by windreamer Collaborator Draft
4 tasks done
[WIP]DeepSeek V4 support
#4554 opened Apr 24, 2026 by grimoire Collaborator Draft
Test: update sleep/wakeup and abort scenarios
#4528 opened Apr 15, 2026 by littlegy Contributor Loading…
style: add autopep8 pre-commit hook and apply PEP 8 formatting fixes
#4524 opened Apr 14, 2026 by windreamer Collaborator Loading…
make fp8 model quantized by llm-compressor can be inferenced in turbomind enhancement New feature or request
#4509 opened Apr 8, 2026 by 43758726 Collaborator Loading…
Integrate deep-ep nccl backend enhancement New feature or request
#4477 opened Mar 27, 2026 by irexyc Collaborator Loading…
feat: Turbomind linear gdn prefix caching enhancement New feature or request
#4465 opened Mar 25, 2026 by lapy Contributor Loading…
refactor get_ppl improvement
#4461 opened Mar 25, 2026 by lvhan028 Collaborator Loading…
Support multi stop words improvement
#4454 opened Mar 24, 2026 by lvhan028 Collaborator Loading…
Support Qwen3 Omni enhancement New feature or request
#4411 opened Mar 13, 2026 by CUHKSZzxy Collaborator Loading…
Add model deployment best practice section in user guide documentation Improvements or additions to documentation
#4399 opened Mar 9, 2026 by lvhan028 Collaborator Draft
Improve proxy server improvement
#4354 opened Feb 12, 2026 by lvhan028 Collaborator Loading…
Support MiniMax-M2 in TurboMind engine enhancement New feature or request
#4343 opened Feb 10, 2026 by zh-nj Loading…
ProTip! Add no:assignee to see everything that’s not assigned.