Skip to content

Pull requests: NVIDIA-NeMo/RL

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat(xtoken): cross-tokenizer off-policy distillation community-request Documentation Improvements or additions to documentation
#2508 opened May 16, 2026 by avenkateshha Loading…
3 tasks
fix: KL backward in GRPO
#2506 opened May 15, 2026 by smahdavi4 Loading…
4 tasks done
feat(muon): add Muon optimizer support to the DTensor backend community-request Documentation Improvements or additions to documentation
#2505 opened May 15, 2026 by bzantium Loading…
4 tasks done
feat: Adds a Comet logger community-request Documentation Improvements or additions to documentation
#2503 opened May 15, 2026 by louisfaury Loading…
4 tasks done
feat: Discard weight when finish generation in the main loop CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2495 opened May 14, 2026 by guyueh1 Contributor Loading…
4 tasks
feat(data-plane): add TQ fault tolerance APIs
#2492 opened May 14, 2026 by pthombre Loading…
ci: use registry cache for container builds CI Relating to CI
#2491 opened May 13, 2026 by kajalj22 Contributor Draft
3 of 4 tasks
feat: Add support for GLM 5.1 GRPO Documentation Improvements or additions to documentation
#2489 opened May 13, 2026 by slikhite-1 Contributor Loading…
4 tasks
fix: token mult prob error plot masking
#2485 opened May 13, 2026 by 1ytic Loading…
3 of 4 tasks
chore: upgrade sglang from v0.5.10 to v0.5.11 CI:L1 Run doctests, unit tests, and functional tests
#2481 opened May 13, 2026 by kajalj22 Contributor Loading…
2 tasks
feat: Add Tau bench environment
#2479 opened May 12, 2026 by ashors1 Contributor Draft
4 tasks
fix: run dynamic sampling on unshaped rewards
#2478 opened May 12, 2026 by ashors1 Contributor Draft
4 tasks
[draft] feat: add flextron support to Hybrid models
#2474 opened May 12, 2026 by rohitrango Contributor Draft
2 of 4 tasks
perf: Performance script tuning
#2473 opened May 12, 2026 by guyueh1 Contributor Loading…
4 tasks
feat: add AIME-2026 benchmark. CI:L1 Run doctests, unit tests, and functional tests community-request Documentation Improvements or additions to documentation
#2469 opened May 12, 2026 by xxman-google Contributor Loading…
4 tasks done
feat: add HMMT eval benchmark. community-request Documentation Improvements or additions to documentation waiting-on-customer Waiting on the original author to respond
#2468 opened May 12, 2026 by xxman-google Contributor Loading…
2 of 4 tasks
ProTip! Updated in the last three days: updated:>2026-05-13.