Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
bezzam
's Collections
VibeVoice
Neural codecs
Omnilingual ASR (1,600+ Languages)
Multimodel audio
Speech recognition datasets
Text-to-speech datasets
DigiCam (CelebA)
DiffuserCam Mirflickr
VibeVoice
updated
Mar 2
Upvote
2
bezzam/VibeVoice-1.5B
Text-to-Speech
•
3B
•
Updated
Feb 16
•
15
•
1
bezzam/VibeVoice-7B
Text-to-Speech
•
9B
•
Updated
22 days ago
•
41
bezzam/VibeVoice-AcousticTokenizer
Feature Extraction
•
0.7B
•
Updated
Feb 5
•
20
bezzam/VibeVoice-SemanticTokenizer
Feature Extraction
•
0.3B
•
Updated
Dec 3, 2025
•
12
bezzam/vibevoice_samples
Viewer
•
Updated
Feb 2
•
2
•
12.2k
VibeVoice Technical Report
Paper
•
2508.19205
•
Published
Aug 26, 2025
•
171
Upvote
2
Share collection
View history
Collection guide
Browse collections