From Reasoning Chains to Verifiable Subproblems: Curriculum Reinforcement Learning Enables Credit Assignment for LLM Reasoning Paper • 2605.22074 • Published 1 day ago • 2
TerminalWorld: Benchmarking Agents on Real-World Terminal Tasks Paper • 2605.22535 • Published 1 day ago • 2
Spreadsheet-RL: Advancing Large Language Model Agents on Realistic Spreadsheet Tasks via Reinforcement Learning Paper • 2605.22642 • Published 1 day ago • 22
Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention Paper • 2605.22791 • Published 1 day ago • 8
DrawMotion: Generating 3D Human Motions by Freehand Drawing Paper • 2605.20955 • Published 2 days ago • 3
PlanningBench: Generating Scalable and Verifiable Planning Data for Evaluating and Training Large Language Models Paper • 2605.20873 • Published 2 days ago • 3
SpecBench: Measuring Reward Hacking in Long-Horizon Coding Agents Paper • 2605.21384 • Published 2 days ago • 4
Mem-π: Adaptive Memory through Learning When and What to Generate Paper • 2605.21463 • Published 2 days ago • 4
Uni-Edit: Intelligent Editing Is A General Task For Unified Model Tuning Paper • 2605.21487 • Published 2 days ago • 20
SceneCode: Executable World Programs for Editable Indoor Scenes with Articulated Objects Paper • 2605.19587 • Published 3 days ago
OpenComputer: Verifiable Software Worlds for Computer-Use Agents Paper • 2605.19769 • Published 3 days ago • 55
AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration Paper • 2605.20025 • Published 3 days ago • 92
Draft Less, Retrieve More: Hybrid Tree Construction for Speculative Decoding Paper • 2605.20104 • Published 3 days ago • 6
Incantation: Natural Language as the Action Interface for Multi-Entity Video World Models Paper • 2605.18601 • Published 4 days ago • 4
AgentKernelArena: Generalization-Aware Benchmarking of GPU Kernel Optimization Agents Paper • 2605.16819 • Published 6 days ago • 3
AtlasVA: Self-Evolving Visual Skill Memory for Teacher-Free VLM Agents Paper • 2605.17933 • Published 4 days ago • 6