iVGR: Internalizing Visually Grounded Reasoning for MLLMs with Reinforcement Learning Paper • 2605.31096 • Published 15 days ago • 7
stefanocarrera/autophagycode_D_he_train-mercury_Qwen3-8B_strategy_trust_t1.1_g6_run1 Viewer • Updated 12 days ago • 164 • 46 • 1
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published 17 days ago • 423
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published 24 days ago • 204
Mem-π: Adaptive Memory through Learning When and What to Generate Paper • 2605.21463 • Published 24 days ago • 8
sentence-transformers/all-MiniLM-L6-v2 Sentence Similarity • 22.7M • Updated 11 days ago • 224M • • 4.94k
Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning Paper • 2605.06130 • Published May 7 • 112
GenLCA: 3D Diffusion for Full-Body Avatars from In-the-Wild Videos Paper • 2604.07273 • Published Apr 8 • 4