-
Notifications
You must be signed in to change notification settings - Fork 659
Pull requests: InternLM/lmdeploy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Update dockerfile by removing cu11 and changing cu12.4 to cu12.6
#4398
opened Mar 6, 2026 by
lvhan028
Loading…
[Fix][Feat] Fix worker sorting with external pg bundles & Support persistent buffer for update_params
#4397
opened Mar 6, 2026 by
CyCle1024
Loading…
Use pyupgrade and ruff to modernize LMDeploy Python Code
#4392
opened Mar 3, 2026 by
windreamer
Loading…
[Feature] Add TurboMind support for Qwen3.5 models (dense + MoE)
enhancement
New feature or request
#4389
opened Mar 2, 2026 by
lapy
Loading…
2 of 4 tasks
Support MiniMax-M2 in TurboMind engine
enhancement
New feature or request
#4343
opened Feb 10, 2026 by
zh-nj
Loading…
add preliminary support for EP(single-node) of turbomind backend
enhancement
New feature or request
#4332
opened Feb 6, 2026 by
irexyc
Loading…
change ascend paged attention from BSH format to TND format for better performace
#4295
opened Jan 27, 2026 by
jinminxi104
•
Draft
support repetition ngram logits processor
enhancement
New feature or request
#4288
opened Jan 23, 2026 by
grimoire
Loading…
Add step_map to track token decoding order in DLLM
#4057
opened Oct 21, 2025 by
Auraithm
Loading…
4 tasks done
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.