SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue Paper • 2605.30993 • Published 13 days ago • 57
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published 15 days ago • 423
VidSplat: Gaussian Splatting Reconstruction with Geometry-Guided Video Diffusion Priors Paper • 2605.11424 • Published about 1 month ago • 4
GestaltLabs/Qwen3.6-35B-A3B-NSC-ACE-SABER-GGUF Image-Text-to-Text • 35B • Updated 28 days ago • 2.35k • 5
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published May 3 • 166
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published Apr 22 • 243