Skip to content

Pull requests: areal-project/AReaL

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat(distillation): Support cross-tokenizer on-policy distillation
#1452 opened Jun 30, 2026 by zahrayousefijamarani Contributor Loading…
4 of 13 tasks
feat(megatron): add MTP-augmented SFT/RL training
#1445 opened Jun 27, 2026 by HT-Yuan Collaborator Draft
2 of 15 tasks
feat(vlm): add Qwen3.6 LoRA GRPO training support for 27B and 35B-A3B
#1444 opened Jun 26, 2026 by Lei00764 Loading…
5 of 15 tasks
feat(ppo): support actor loss aggregation modes
#1443 opened Jun 26, 2026 by EazyReal Contributor Loading…
8 of 15 tasks
feat(infra): add HTTP-based Ray Scheduler
#1441 opened Jun 26, 2026 by HwVanICI Collaborator Loading…
7 of 15 tasks
fix(io_struct): support multi-EOS models in stop-token handling
#1433 opened Jun 22, 2026 by PheelaV Loading…
8 of 15 tasks
fix(stats): reset single-key export metadata correctly
#1432 opened Jun 22, 2026 by EazyReal Contributor Loading…
4 tasks done
docs: mirgate and clean the documents
#1431 opened Jun 22, 2026 by mingcheng Contributor Loading…
8 of 15 tasks
feat(logging): add W&B worker GPU system metrics
#1428 opened Jun 21, 2026 by EazyReal Contributor Loading…
3 tasks done
fix(dataset): correct GSM8K SFT loss-mask boundary for merged tokens
#1427 opened Jun 21, 2026 by EazyReal Contributor Loading…
8 of 9 tasks
fix(reward): bound MathVerifyWorker.verify wall-clock on a hung verification
#1426 opened Jun 20, 2026 by EazyReal Contributor Loading…
1 task done
fix(api): normalize tokenizer-derived stop token ids
#1425 opened Jun 20, 2026 by EazyReal Contributor Loading…
7 of 9 tasks
refactor(workflow): extract grouped rollout wrapper stale
#1418 opened Jun 16, 2026 by RanranranQAQ Loading…
5 of 15 tasks
feat(ppo): add actor loss aggregation modes
#1417 opened Jun 16, 2026 by EazyReal Contributor Loading…
8 of 9 tasks
fix(ppo): group-normalize by actual group sizes for partial groups
#1415 opened Jun 16, 2026 by EazyReal Contributor Loading…
fix(ppo): derive group norm size from n_samples
#1413 opened Jun 16, 2026 by EazyReal Contributor Loading…
3 tasks done
fix(openai): render tool-call arguments as a mapping for HF chat templates
#1411 opened Jun 16, 2026 by EazyReal Contributor Loading…
7 of 9 tasks
feat: trajectory dump/replay for offline training-loop debugging
#1407 opened Jun 12, 2026 by Fyrgo8 Loading…
5 of 9 tasks
Support Megatron FP8 weight transfer in AWEX colocate mode
#1406 opened Jun 11, 2026 by equation314 Loading…
8 of 14 tasks
ci: add PyPI publish workflow and fix Megatron deps 🚀
#1404 opened Jun 10, 2026 by mingcheng Contributor Loading…
7 of 15 tasks
feat(distillation): Multi-Teacher On-Policy Distillation Support
#1400 opened Jun 8, 2026 by zahrayousefijamarani Contributor Loading…
6 of 15 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.