Skip to content

Pull requests: NVIDIA/Model-Optimizer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Support MCore auto_quantize calibration updates
#1639 opened Jun 5, 2026 by realAsma Contributor Draft
Add NVFP4 fakequant for attention BMMs
#1635 opened Jun 5, 2026 by kaix-nv Contributor Draft
fix(export): correct unified_export_megatron at EP > 1 and DP > 1
#1631 opened Jun 4, 2026 by yueshen2016 Contributor Loading…
3 of 4 tasks
Skip-Softmax calibration in vLLM
#1622 opened Jun 3, 2026 by kaix-nv Contributor Draft
Add W4A16 NVFP4-MSE Qwen3.5 dense/MoE PTQ recipes
#1620 opened Jun 3, 2026 by cjluo-nv Collaborator Loading…
[Feat]: Specdec Multinode Streaming
#1611 opened Jun 2, 2026 by h-guo18 Contributor Loading…
[OMNIML-3994] Add SharedQuantState
#1605 opened Jun 2, 2026 by sychen52 Contributor Loading…
Skip softmax calibration via Triton kernel
#1597 opened Jun 2, 2026 by rohansjoshi Contributor Loading…
Add day0-release orchestration skill with enforced gates
#1596 opened Jun 2, 2026 by Edwardf0t1 Contributor Loading…
Add Alpamayo-1 example
#1594 opened Jun 1, 2026 by rohansjoshi Contributor Loading…
ProTip! Filter pull requests by the default branch with base:main.