Skip to content

Pull requests: THUDM/slime

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat(gemma4): add Gemma4 dense and MoE support
#2135 opened Jun 26, 2026 by EazyReal Contributor Loading…
fix: handle empty colocated weight buckets
#2134 opened Jun 26, 2026 by EazyReal Contributor Loading…
docs(examples): list coding_agent_rl in examples/README
#2133 opened Jun 26, 2026 by aoshen02 Contributor Loading…
Skip entropy gradient computation when entropy_coef == 0
#2130 opened Jun 25, 2026 by CSUN1997 Loading…
Support partial rollout resume in Search-R1 example
#2128 opened Jun 23, 2026 by OLIVER-XYP Loading…
Reduce entropy logging memory when entropy coef is zero
#2127 opened Jun 23, 2026 by none0663 Contributor Loading…
Add test for megatron server run-ci-changed
#2123 opened Jun 23, 2026 by zhuzilin Contributor Loading…
fix(partial-rollout): cap max_new_tokens by prior response length
#2122 opened Jun 23, 2026 by none0663 Contributor Loading…
fix(ppo): preserve raw KL so rollout/kl logging is correct
#2114 opened Jun 21, 2026 by EazyReal Contributor Loading…
Fix(rollout): Fail closed on unknown SGLang model names
#2112 opened Jun 21, 2026 by Baiyu-Su Contributor Loading…
fix(train): support eval-only mode (--num-rollout 0)
#2109 opened Jun 20, 2026 by EazyReal Contributor Loading…
feat(examples/strands_sglang): update to strands-sglang 0.4.2
#2106 opened Jun 20, 2026 by Lawhy Contributor Loading…
fix(dist): preserve new_group options across reloadable group reload
#2095 opened Jun 17, 2026 by EazyReal Contributor Loading…
fix(scripts): correct model config source path in FP8 low_precision scripts
#2094 opened Jun 17, 2026 by aoshen02 Contributor Loading…
2 tasks done
feat(loss): add pg_loss aggregation modes
#2090 opened Jun 16, 2026 by EazyReal Contributor Loading…
Disk-level delta weight sync
#2089 opened Jun 16, 2026 by nanjiangwill Collaborator Loading…
ProTip! What’s not been updated in a month: updated:<2026-05-27.