CHI-Bench: Can AI Agents Automate End-to-End, Long-Horizon, Policy-Rich Healthcare Workflows? Paper • 2605.16679 • Published 15 days ago • 53
Watch Before You Answer: Learning from Visually Grounded Post-Training Paper • 2604.05117 • Published Apr 6 • 36
view article Article EMO: Pretraining mixture of experts for emergent modularity allenai • 22 days ago • 38
view article Article Two Years of Local AI on a Laptop: When Open Models Outpaced Moore's Law mishig • 19 days ago • 23
view article Article Introducing the agentic robotics appstore for 10,000 Reachy Minis clem • 24 days ago • 35
MediaTech Collection Collection of public datasets from the French administration, chunked, vectorized and ready to use in AI projects. • 9 items • Updated Feb 4 • 10
Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows Paper • 2604.28139 • Published about 1 month ago • 42
view article Article DeepSeek-V4: a million-token context that agents can actually use burtenshaw • Apr 24 • 47
OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation Paper • 2604.11804 • Published Apr 13 • 72
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift • Apr 2 • 902
view article Article Granite 4.0 3B Vision: Compact Multimodal Intelligence for Enterprise Documents ibm-granite • Mar 31 • 34
view article Article Liberate your OpenClaw +6 clem, burtenshaw, pcuenq, jeffboudier, merve, nielsr, victor, mishig • Mar 27 • 46