ESI-Bench: Towards Embodied Spatial Intelligence that Closes the Perception-Action Loop Paper • 2605.18746 • Published 8 days ago • 5
siddharthmb/2026.TA.gemma2_2b_sparse_probe_l1_0p008_tc8192_decb_l1w0.008_tarbb_lb2.0_ln1_dr10000_sl15512629 4B • Updated 5 days ago • 45 • 1
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence Paper • 2605.12882 • Published 13 days ago • 268
Position: LLM Inference Should Be Evaluated as Energy-to-Token Production Paper • 2605.11733 • Published 14 days ago • 3
HERMES++: Toward a Unified Driving World Model for 3D Scene Understanding and Generation Paper • 2604.28196 • Published 26 days ago • 71
Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music Paper • 2604.10905 • Published Apr 13 • 29
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published Apr 8 • 326
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 504
Tex3D: Objects as Attack Surfaces via Adversarial 3D Textures for Vision-Language-Action Models Paper • 2604.01618 • Published Apr 2 • 15