-
Notifications
You must be signed in to change notification settings - Fork 184
Pull requests: NVIDIA-NeMo/Automodel
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
cp: Trigger Testing CICD
fix(qwen3_5): make dense VLM pipeline-parallel safe (2524) into r0.5.0
cherry-pick
Run CICD
#2554
opened Jun 13, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
ci: flag RoPE/precision-buffer dtype hazards in automated PR review
docs-only
With great power comes great responsibility.
#2552
opened Jun 13, 2026 by
akoumpa
Contributor
Loading…
cp: Trigger Testing CICD
feat(examples): add Nemotron-3-Ultra-550B benchmark and full-SFT recipes (2539) into r0.5.0
cherry-pick
Run CICD
#2550
opened Jun 13, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
fix(models): keep RoPE frequency buffers fp32 under bf16 model cast
#2549
opened Jun 13, 2026 by
akoumpa
Contributor
Loading…
feat(moe): mxfp4-resident MoE experts for DeepSeek-V4-Flash LoRA
community-request
#2548
opened Jun 12, 2026 by
excepshenal
•
Draft
1 of 3 tasks
ci(fern): run docs check on every /ok to test (drop paths filter)
docs-only
With great power comes great responsibility.
#2547
opened Jun 12, 2026 by
akoumpa
Contributor
Loading…
ci: schedule ep-parallel finetune recipes at documented node counts
#2546
opened Jun 12, 2026 by
akoumpa
Contributor
Loading…
1 of 3 tasks
fix(retrieval): gate dummy vision forward
#2545
opened Jun 12, 2026 by
yuhezhang-ai
Contributor
•
Draft
1 of 3 tasks
docs: Update diffusiongemma.mdx
docs-only
With great power comes great responsibility.
#2542
opened Jun 12, 2026 by
zyzhou5
Contributor
Loading…
3 tasks
fix(diffusion): resolve flux nightly CI failures
r0.5.0
Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#2529
opened Jun 11, 2026 by
pthombre
Contributor
Loading…
2 of 3 tasks
fix(wandb): log different val datasets separately in wandb
community-request
waiting-on-maintainers
Waiting on maintainers to respond
#2526
opened Jun 11, 2026 by
grgkovac
Contributor
Loading…
3 tasks done
ci: Update transformers to latest version 5.11.0
#2518
opened Jun 11, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
docs: MSC cloud checkpointing + expose multi-storage-client under [s3]
community-request
docs-only
With great power comes great responsibility.
waiting-on-customer
Waiting on the original author to respond
#2517
opened Jun 11, 2026 by
edjson
Contributor
Loading…
3 tasks done
feat(mimo_v25): support MiMo-V2.5-Pro
community-request
waiting-on-maintainers
Waiting on maintainers to respond
#2514
opened Jun 10, 2026 by
Simar-malhotra09
Loading…
1 of 3 tasks
fix(checkpoint): preserve tied lm_head on resume
r0.5.0
Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#2511
opened Jun 10, 2026 by
yuhezhang-ai
Contributor
Loading…
3 tasks done
feat(mtp): enable MTP to accept pre-fused input embeddings for multimodal models
community-request
waiting-on-customer
Waiting on the original author to respond
#2510
opened Jun 10, 2026 by
Slyne
Loading…
3 tasks done
feat(moe): MTP FLOPs accounting, inline shared experts, checkpoint warning fix
r0.5.0
Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#2486
opened Jun 10, 2026 by
adil-a
Collaborator
Loading…
3 tasks done
fix(moe): preserve fp32 A_log in Qwen3.5-MoE and Qwen3-Next GatedDeltaNet
r0.5.0
Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#2484
opened Jun 10, 2026 by
yuhezhang-ai
Contributor
Loading…
3 tasks done
fix: resolve nightly CI failures (FP8, custom-arch TP, ckpt, gemma3n, benchmark)
r0.5.0
Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#2482
opened Jun 9, 2026 by
akoumpa
Contributor
Loading…
perf(distributed): add retrieval tuning knobs
#2452
opened Jun 8, 2026 by
yuhezhang-ai
Contributor
Loading…
2 of 3 tasks
perf(vlm): batch Nemotron VL image processing
#2451
opened Jun 8, 2026 by
yuhezhang-ai
Contributor
Loading…
3 tasks done
feat(speculative): add SGLang target backend for EAGLE-3 training
community-request
waiting-on-maintainers
Waiting on maintainers to respond
#2449
opened Jun 8, 2026 by
khazic
Contributor
Loading…
feat: support DeepSeek V4 context parallel training
#2445
opened Jun 7, 2026 by
HuiyingLi
Contributor
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.