UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors Paper • 2605.00658 • Published May 1 • 84
UniAudio 2.0: A Unified Audio Language Model with Text-Aligned Factorized Audio Tokenization Paper • 2602.04683 • Published Feb 4 • 3
Running on Zero Agents Featured 160 SoulX-Singer 🎤 160 Generate singing voice from lyrics and convert vocals
Running on Zero Agents Featured 77 TeleStyle 🚀 77 Transfer style between images while preserving content
Running on Zero Agents Featured 2.52k Qwen Image Multiple Angles 3D Camera 🎥 2.52k Transform image viewpoint with adjustable camera angles