Skip to content

Pull requests: THUDM/slime

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

chore: tidy up ci config
#2171 opened Jul 3, 2026 by udjevdbaj Loading…
support multimodal qwen36 sft
#2164 opened Jul 1, 2026 by samaritan1998 Loading…
Support Qwen3.5 MoE INT4-QAT
#2156 opened Jun 30, 2026 by ShuZihan Loading…
9 tasks done
[docker] Upgrade to sglang v0.5.14 run-ci-image
#2149 opened Jun 29, 2026 by zhuzilin Contributor Loading…
docs: fix dead examples/README link to low_precision
#2142 opened Jun 29, 2026 by aoshen02 Contributor Loading…
fix(examples): preserve geo3k response budget
#2140 opened Jun 27, 2026 by zhangdw156 Loading…
fix(examples): correct geo3k VLM default env
#2139 opened Jun 27, 2026 by zhangdw156 Loading…
docs(readme): add Dressage to Chinese ecosystem
#2138 opened Jun 27, 2026 by zhangdw156 Loading…
docs(examples): fix broken markdown links in rollout_buffer and examples
#2137 opened Jun 27, 2026 by CalvinXKY Contributor Loading…
docs(examples): list coding_agent_rl in examples/README
#2133 opened Jun 26, 2026 by aoshen02 Contributor Loading…
Skip entropy gradient computation when entropy_coef == 0
#2130 opened Jun 25, 2026 by CSUN1997 Loading…
Support partial rollout resume in Search-R1 example
#2128 opened Jun 23, 2026 by OLIVER-XYP Loading…
Reduce entropy logging memory when entropy coef is zero
#2127 opened Jun 23, 2026 by none0663 Contributor Loading…
fix(partial-rollout): cap max_new_tokens by prior response length
#2122 opened Jun 23, 2026 by none0663 Contributor Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.