LISA: Likelihood Score Alignment for Visual-condition Controllable Generation Paper • 2606.27192 • Published 3 days ago • 13
Running the Gauntlet: Re-evaluating the Capabilities of Agents Beyond Familiar Environments Paper • 2606.14397 • Published 3 days ago • 15
Why Multi-Step Tool-Use Reinforcement Learning Collapses and How Supervisory Signals Fix It Paper • 2606.26027 • Published 4 days ago • 15
GUI vs. CLI: Execution Bottlenecks in Screen-Only and Skill-Mediated Computer-Use Agents Paper • 2606.24551 • Published 6 days ago • 25
JetSpec: Breaking the Scaling Ceiling of Speculative Decoding with Parallel Tree Drafting Paper • 2606.18394 • Published 3 days ago • 26
The Verification Horizon: No Silver Bullet for Coding Agent Rewards Paper • 2606.26300 • Published 4 days ago • 37
Beyond NL2Code: A Structured Survey of Multimodal Code Intelligence Paper • 2606.15932 • Published 12 days ago • 36
MemGUI-Agent: An End-to-End Long-Horizon Mobile GUI Agent with Proactive Context Management Paper • 2606.19926 • Published 10 days ago • 42
Qwen-AgentWorld: Language World Models for General Agents Paper • 2606.24597 • Published 5 days ago • 132
OPID: On-Policy Skill Distillation for Agentic Reinforcement Learning Paper • 2606.26790 • Published 3 days ago • 40
V-Zero: Answer-Label-Free On-Policy Distillation with Contrastive Evidence Gating for Fine-Grained Visual Reasoning Paper • 2606.25319 • Published 4 days ago • 25
CLI-Universe: Towards Verifiable Task Synthesis Engine for Terminal Agents Paper • 2606.22883 • Published 6 days ago • 37