Skip to content

Pull requests: NVIDIA-NeMo/Automodel

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix(benchmark): load nested AutoConfig via compatibility path r0.5.0 Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#2767 opened Jun 25, 2026 by akoumpa Contributor Loading…
ci: runtime-install media extras in VLM/diffusion launchers
#2760 opened Jun 24, 2026 by thomasdhc Contributor Draft
3 tasks
build(deps): move ffmpeg/opencv deps to opt-in media extra r0.5.0 Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#2743 opened Jun 23, 2026 by thomasdhc Contributor Draft
3 tasks
fix(qwen3-moe): export grouped HF expert weights r0.5.0 Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#2741 opened Jun 23, 2026 by akoumpa Contributor Loading…
ci: update claude review guidelines
#2739 opened Jun 23, 2026 by akoumpa Contributor Loading…
3 tasks
feat(datasets): add DP-aware stateful dataloader community-request waiting-on-maintainers Waiting on maintainers to respond
#2730 opened Jun 23, 2026 by huahuajhu Loading…
2 of 3 tasks
test: add per-test timeout via pytest-timeout
#2710 opened Jun 22, 2026 by akoumpa Contributor Loading…
2 tasks
fix(loss): avoid pkg_resources in linear CE r0.5.0 Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#2700 opened Jun 22, 2026 by akoumpa Contributor Loading…
3 tasks done
fix(deepseek_v4): support packed THD document bounds
#2696 opened Jun 21, 2026 by akoumpa Contributor Draft
feat(nemotron_v3): support dense Nemotron-H (Nano 4B) community-request
#2670 opened Jun 20, 2026 by stanley1208 Contributor Loading…
3 tasks done
fix(vlm): guard validation forward against cuDNN fused-MHA SDPA backend r0.5.0 Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#2659 opened Jun 20, 2026 by akoumpa Contributor Draft
fix(qwen3_5): route dense MTP through SDPA + block-causal mask for pack r0.5.0 Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#2656 opened Jun 20, 2026 by akoumpa Contributor Loading…
ci: Update transformers to latest version 5.12.1
#2632 opened Jun 18, 2026 by svcnvidia-nemo-ci Contributor Loading…
feat(magi): honor AttnMaskSpec on the HF attention backend
#2622 opened Jun 17, 2026 by HuiyingLi Contributor Loading…
fix(loss): support THD/packed layout in FusedLinearCrossEntropy r0.5.0 Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#2615 opened Jun 17, 2026 by akoumpa Contributor Loading…
feat(retrieval): vl retrieval normalized/resolved dataset
#2596 opened Jun 16, 2026 by yuhezhang-ai Contributor Loading…
feat(dflash): add dpace loss community-request waiting-on-customer Waiting on the original author to respond
#2572 opened Jun 15, 2026 by kashif Contributor Loading…
feat(engine): Engine training API
#2556 opened Jun 14, 2026 by HuiyingLi Contributor Draft
ProTip! Filter pull requests by the default branch with base:main.