Speculative Decoding for Autoregressive Video Generation Paper • 2604.17397 • Published 4 days ago • 8
Multiplication in Multimodal LLMs: Computation with Text, Image, and Audio Inputs Paper • 2604.18203 • Published 3 days ago • 3
Training LLM Agents for Spontaneous, Reward-Free Self-Evolution via World Knowledge Exploration Paper • 2604.18131 • Published 3 days ago • 7
MultiWorld: Scalable Multi-Agent Multi-View Video World Models Paper • 2604.18564 • Published 3 days ago • 39
Learning Adaptive Reasoning Paths for Efficient Visual Reasoning Paper • 2604.14568 • Published 7 days ago • 7
Towards Autonomous Mechanistic Reasoning in Virtual Cells Paper • 2604.11661 • Published 9 days ago • 4
MM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generation Paper • 2604.15309 • Published 7 days ago • 6
Dive into Claude Code: The Design Space of Today's and Future AI Agent Systems Paper • 2604.14228 • Published 9 days ago • 23
HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds Paper • 2604.14268 • Published 8 days ago • 108
Zero-shot World Models Are Developmentally Efficient Learners Paper • 2604.10333 • Published 12 days ago • 7
From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models Paper • 2604.09459 • Published 10 days ago • 13
Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music Paper • 2604.10905 • Published 10 days ago • 28
Tracing the Roots: A Multi-Agent Framework for Uncovering Data Lineage in Post-Training LLMs Paper • 2604.10480 • Published 11 days ago • 20