F-TIS: Harnessing Diverse Models in Collaborative GRPO Paper • 2605.22537 • Published about 1 month ago • 13
F-TIS: Harnessing Diverse Models in Collaborative GRPO Paper • 2605.22537 • Published about 1 month ago • 13
NoLoCo: No-all-reduce Low Communication Training Method for Large Models Paper • 2506.10911 • Published Jun 12, 2025 • 9
Hail to the Thief: Exploring Attacks and Defenses in Decentralised GRPO Paper • 2511.09780 • Published Nov 12, 2025 • 29
Faustify/Qwen3-0.6B-Gensyn-Swarm-whistling_pawing_okapi Text Generation • 0.6B • Updated Aug 21, 2025 • 1
Faustify/Qwen3-0.6B-Gensyn-Swarm-whistling_pawing_okapi Text Generation • 0.6B • Updated Aug 21, 2025 • 1
All is Not Lost: LLM Recovery without Checkpoints Paper • 2506.15461 • Published Jun 18, 2025 • 41
All is Not Lost: LLM Recovery without Checkpoints Paper • 2506.15461 • Published Jun 18, 2025 • 41
SkipPipe: Partial and Reordered Pipelining Framework for Training LLMs in Heterogeneous Networks Paper • 2502.19913 • Published Feb 27, 2025 • 6
SkipPipe: Partial and Reordered Pipelining Framework for Training LLMs in Heterogeneous Networks Paper • 2502.19913 • Published Feb 27, 2025 • 6