-
Notifications
You must be signed in to change notification settings - Fork 274
Pull requests: NovaSky-AI/SkyRL
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix paths for instruction comments to match current location
#1294
opened Mar 8, 2026 by
linde
Loading…
[train][FullyAsync] Implement customizable weight sync frequency
#1293
opened Mar 8, 2026 by
tamoghnokandar
Loading…
feat(train): prefix-aware merge for step-wise trajectories (#1277)
#1289
opened Mar 6, 2026 by
deepsheth3
Loading…
[train] Fix cross-sample padding inflation in batch tensor construction
#1285
opened Mar 5, 2026 by
CharlieFRuan
•
Draft
1 of 4 tasks
[train] Add validation for step-wise GeneratorOutput
#1281
opened Mar 5, 2026 by
CharlieFRuan
•
Draft
3 tasks done
WIP: return_dict=False fixes + H200 validation scripts
#1280
opened Mar 5, 2026 by
tyler-griggs
•
Draft
[train] Add importance weight diagnostics and fix IS loss overflow
#1261
opened Mar 3, 2026 by
tyler-griggs
•
Draft
2 tasks
[train] Add DRO (Direct Reward Optimization) policy loss
#1259
opened Mar 3, 2026 by
tyler-griggs
•
Draft
2 tasks
Add llm_as_a_judge_local example with frozen vLLM reward model
#1208
opened Feb 25, 2026 by
ghShu
Loading…
Use the last LoRA path in the vLLM inference engine instead of "dummy_lora_path"
#1188
opened Feb 20, 2026 by
ebronstein
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.