Skip to content

Pull requests: NVIDIA-NeMo/RL

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

ci: update dapo test metrics following PR 2478
#2903 opened Jun 23, 2026 by ashors1 Contributor Loading…
4 tasks
fix: Reconcile #2315 and #2612 CI:L1 Run doctests, unit tests, and functional tests
#2902 opened Jun 23, 2026 by tdene Contributor Loading…
4 tasks
[draft] docs: add Qwen3.5 model guide and model-family hub Documentation Improvements or additions to documentation
#2900 opened Jun 23, 2026 by sharonyu-115 Contributor Loading…
4 tasks
fix: fix sglang env CI:L1 Run doctests, unit tests, and functional tests CI Relating to CI
#2898 opened Jun 23, 2026 by yuki-97 Contributor Draft
ci: Bump Megatron-Bridge to af3124a CI:L1 Run doctests, unit tests, and functional tests
#2897 opened Jun 23, 2026 by svcnvidia-nemo-ci Contributor Loading…
feat: Update Megatron Inference API interface CI:L1 Run doctests, unit tests, and functional tests
#2891 opened Jun 22, 2026 by tdene Contributor Loading…
4 tasks
feat: vllm worker env shutdown
#2887 opened Jun 22, 2026 by arnavk-nvidia Contributor Loading…
1 of 4 tasks
feat(loss): support TIS lower bound CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version) Documentation Improvements or additions to documentation
#2886 opened Jun 22, 2026 by macandro96 Contributor Loading…
4 tasks
feat: add multiple penalties for model behaviour CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2885 opened Jun 22, 2026 by macandro96 Contributor Loading…
4 tasks
feat: async colocated GRPO with Megatron inference CI:L1 Run doctests, unit tests, and functional tests
#2884 opened Jun 22, 2026 by tdene Contributor Loading…
4 tasks
fix: topk fp32 chunk memory community-request
#2883 opened Jun 22, 2026 by odedovadia Contributor Loading…
2 of 4 tasks
feat: add Mistral Medium 3.5 (128B) text-only DAPO support CI:L1 Run doctests, unit tests, and functional tests
#2875 opened Jun 19, 2026 by sharonyu-115 Contributor Loading…
3 of 4 tasks
DRAT: fix: run Nemotron Nano v2 workplace assistant recipe
#2868 opened Jun 18, 2026 by snowmanwwg Contributor Loading…
4 tasks
DRAFT fix: prefer real NeMo-Gym package in actor
#2867 opened Jun 18, 2026 by snowmanwwg Contributor Loading…
4 tasks
feat(megatron): add large-scale MoE tuning knobs and longer PG timeout community-request waiting-on-maintainers Waiting on maintainers to respond
#2866 opened Jun 18, 2026 by dafu-wu Loading…
1 of 4 tasks
fix(data): stabilize multi-turn chat chunking and tokenization CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2856 opened Jun 17, 2026 by jinglinglingling Contributor Loading…
ci: Add super nightly tests Documentation Improvements or additions to documentation
#2855 opened Jun 16, 2026 by ashors1 Contributor Draft
4 tasks
docs(xtoken): X-Token distillation guide and README updates Documentation Improvements or additions to documentation
#2854 opened Jun 16, 2026 by avenkateshha Contributor Loading…
test: add vLLM HTTP logprobs contract test for NeMo-Gym capture CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2845 opened Jun 16, 2026 by ananthsub Contributor Loading…
feat: add vLLM prefix cache and preemption metrics community-request waiting-on-maintainers Waiting on maintainers to respond
#2843 opened Jun 16, 2026 by puneeshkhanna Loading…
1 of 4 tasks
feat(ppo): Megatron value-model sequence packing + context parallelism CI:L1 Run doctests, unit tests, and functional tests
#2839 opened Jun 16, 2026 by bg51717 Contributor Loading…
3 of 4 tasks
test(data_plane): session-scope mooncake fixtures CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2838 opened Jun 16, 2026 by ZhiyuLi-Nvidia Contributor Loading…
ProTip! Updated in the last three days: updated:>2026-06-20.