-
Notifications
You must be signed in to change notification settings - Fork 189
Pull requests: NVIDIA-NeMo/Automodel
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
cp: Trigger Testing CICD
fix(distributed): use flattened CP FSDP mesh (2768) into r0.5.0
cherry-pick
Run CICD
#2769
opened Jun 25, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
fix(benchmark): load nested AutoConfig via compatibility path
r0.5.0
Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#2767
opened Jun 25, 2026 by
akoumpa
Contributor
Loading…
fix(distributed): control frozen multimodal FSDP sharding
#2763
opened Jun 25, 2026 by
yuhezhang-ai
Contributor
•
Draft
build(deps): move ffmpeg/opencv deps to opt-in media extra
r0.5.0
Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
fix(qwen3-moe): export grouped HF expert weights
r0.5.0
Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#2741
opened Jun 23, 2026 by
akoumpa
Contributor
Loading…
ci: update claude review guidelines
#2739
opened Jun 23, 2026 by
akoumpa
Contributor
Loading…
3 tasks
fix(checkpoint): resolve tie_word_embeddings top-level-first to match HF tying
community-request
#2732
opened Jun 23, 2026 by
Achyuthan-S
Contributor
Loading…
feat(datasets): add DP-aware stateful dataloader
community-request
waiting-on-maintainers
Waiting on maintainers to respond
#2730
opened Jun 23, 2026 by
huahuajhu
Loading…
2 of 3 tasks
fix(security): Potential Path Traversal in Dataset Loading
community-request
#2713
opened Jun 22, 2026 by
tomaioo
Contributor
Loading…
test: add per-test timeout via pytest-timeout
#2710
opened Jun 22, 2026 by
akoumpa
Contributor
Loading…
2 tasks
fix(loss): avoid pkg_resources in linear CE
r0.5.0
Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#2700
opened Jun 22, 2026 by
akoumpa
Contributor
Loading…
3 tasks done
feat(nemotron_v3): support dense Nemotron-H (Nano 4B)
community-request
#2670
opened Jun 20, 2026 by
stanley1208
Contributor
Loading…
3 tasks done
fix(vlm): guard validation forward against cuDNN fused-MHA SDPA backend
r0.5.0
Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
fix(qwen3_5): route dense MTP through SDPA + block-causal mask for pack
r0.5.0
Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#2656
opened Jun 20, 2026 by
akoumpa
Contributor
Loading…
ci: Update transformers to latest version 5.12.1
#2632
opened Jun 18, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
fix(checkpoint): write consolidated safetensors without append
community-request
#2627
opened Jun 18, 2026 by
huahuajhu
Loading…
3 tasks done
feat(magi): honor AttnMaskSpec on the HF attention backend
#2622
opened Jun 17, 2026 by
HuiyingLi
Contributor
Loading…
fix(loss): support THD/packed layout in FusedLinearCrossEntropy
r0.5.0
Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#2615
opened Jun 17, 2026 by
akoumpa
Contributor
Loading…
feat(retrieval): vl retrieval normalized/resolved dataset
#2596
opened Jun 16, 2026 by
yuhezhang-ai
Contributor
Loading…
feat(dflash): add dpace loss
community-request
waiting-on-customer
Waiting on the original author to respond
#2572
opened Jun 15, 2026 by
kashif
Contributor
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.