RT-Splatting: Joint Reflection-Transmission Modeling with Gaussian Splatting Paper • 2605.18263 • Published 4 days ago • 7
OpenComputer: Verifiable Software Worlds for Computer-Use Agents Paper • 2605.19769 • Published 3 days ago • 54
PanoWorld: A Generative Spatial World Model for Consistent Whole-House Panorama Synthesis Paper • 2605.17916 • Published 3 days ago • 4
AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration Paper • 2605.20025 • Published 3 days ago • 58
AtlasVA: Self-Evolving Visual Skill Memory for Teacher-Free VLM Agents Paper • 2605.17933 • Published 4 days ago • 5
StableVLA: Towards Robust Vision-Language-Action Models without Extra Data Paper • 2605.18287 • Published 4 days ago • 13
Code-as-Room: Generating 3D Rooms from Top-Down View Images via Agentic Code Synthesis Paper • 2605.18451 • Published 4 days ago • 40
MMSkills: Towards Multimodal Skills for General Visual Agents Paper • 2605.13527 • Published 8 days ago • 116
SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer Paper • 2605.15178 • Published 8 days ago • 80
WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation Paper • 2605.10912 • Published 11 days ago • 45
TrackCraft3R: Repurposing Video Diffusion Transformers for Dense 3D Tracking Paper • 2605.12587 • Published 10 days ago • 37
PanoWorld: Towards Spatial Supersensing in 360^circ Panorama World Paper • 2605.13169 • Published 9 days ago • 21
VGGT-Edit: Feed-forward Native 3D Scene Editing with Residual Field Prediction Paper • 2605.15186 • Published 8 days ago • 26
MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory Paper • 2605.15128 • Published 8 days ago • 60
Nexus : An Agentic Framework for Time Series Forecasting Paper • 2605.14389 • Published 8 days ago • 3
IntentVLA: Short-Horizon Intent Modeling for Aliased Robot Manipulation Paper • 2605.14712 • Published 8 days ago • 16