-
Notifications
You must be signed in to change notification settings - Fork 428
Pull requests: NVIDIA/Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
docs(skills): list .agents/clusters.yaml as canonical in deployment + env-setup hints
#1643
opened Jun 6, 2026 by
Edwardf0t1
Contributor
Loading…
[OMNIML-4962] specdec_bench cell t0_d3 — Qwen/Qwen3.5-4B / DFlash / vLLM
#1638
opened Jun 5, 2026 by
ChenhanYu
Collaborator
Loading…
docs(skills): fix VLM PTQ ViT-quantization + AA eval judge/tool-call gaps
#1632
opened Jun 5, 2026 by
Edwardf0t1
Contributor
Loading…
fix(export): correct unified_export_megatron at EP > 1 and DP > 1
#1631
opened Jun 4, 2026 by
yueshen2016
Contributor
Loading…
3 of 4 tasks
[6058907] Fix ShapeInferenceError in ONNX int8+fp16 quantization of weakly-typed models
#1627
opened Jun 4, 2026 by
ajrasane
Contributor
Loading…
DFlash speculative decoding for MiniMax-M2.7 (FSDP2): auto mask-token, FSDP2 resume fixes, per-checkpoint draft export
#1621
opened Jun 3, 2026 by
yeyu-nvidia
Contributor
Loading…
Add W4A16 NVFP4-MSE Qwen3.5 dense/MoE PTQ recipes
#1620
opened Jun 3, 2026 by
cjluo-nv
Collaborator
Loading…
Fix torch import error to remove circular dependency & move Nemotron configs
#1606
opened Jun 2, 2026 by
jenchen13
Contributor
Loading…
Add NVFP4 + QAD to the Nemotron-3-Nano-30B-A3B tutorial
#1601
opened Jun 2, 2026 by
kevalmorabia97
Collaborator
•
Draft
Skip softmax calibration via Triton kernel
#1597
opened Jun 2, 2026 by
rohansjoshi
Contributor
Loading…
Add day0-release orchestration skill with enforced gates
#1596
opened Jun 2, 2026 by
Edwardf0t1
Contributor
Loading…
Refactor local_hessian onto shared MSE flow + fused-MoE expert support
#1578
opened Jun 1, 2026 by
Fridah-nv
Contributor
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.