Xiao Yutian's picture

Xiao Yutian

riley-scott55

AI & ML interests

None yet

Recent Activity

upvoted a paper about 14 hours ago

Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players

upvoted a paper 2 days ago

Perception or Prejudice: Can MLLMs Go Beyond First Impressions of Personality?

liked a model 5 days ago

openbmb/MiniCPM-V-4.6

View all activity

Organizations

None yet

upvoted a paper about 14 hours ago

Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players

Paper • 2605.28816 • Published 3 days ago • 303

upvoted a paper 2 days ago

Perception or Prejudice: Can MLLMs Go Beyond First Impressions of Personality?

Paper • 2605.22109 • Published 9 days ago • 169

upvoted a paper 8 days ago

ESI-Bench: Towards Embodied Spatial Intelligence that Closes the Perception-Action Loop

Paper • 2605.18746 • Published 12 days ago • 5

upvoted a paper 13 days ago

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

Paper • 2605.12882 • Published 17 days ago • 269

upvoted a paper 16 days ago

Position: LLM Inference Should Be Evaluated as Energy-to-Token Production

Paper • 2605.11733 • Published 18 days ago • 3

upvoted a paper 23 days ago

HERMES++: Toward a Unified Driving World Model for 3D Scene Understanding and Generation

Paper • 2604.28196 • Published about 1 month ago • 72

upvoted 9 papers about 2 months ago

Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music

Paper • 2604.10905 • Published Apr 13 • 29

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 247

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 326

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 504

AI Generalisation Gap In Comorbid Sleep Disorder Staging

Paper • 2603.23582 • Published Mar 24 • 3

Tex3D: Objects as Attack Surfaces via Adversarial 3D Textures for Vision-Language-Action Models

Paper • 2604.01618 • Published Apr 2 • 15

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published Mar 30 • 342

EpochX: Building the Infrastructure for an Emergent Agent Civilization

Paper • 2603.27304 • Published Mar 28 • 47

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 352

upvoted 4 papers 2 months ago

Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models

Paper • 2603.17051 • Published Mar 17 • 109

Efficient Reasoning with Balanced Thinking

Paper • 2603.12372 • Published Mar 12 • 150

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 372

SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models

Paper • 2603.16859 • Published Mar 17 • 248