video/image - a dbest111 Collection

Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

dbest111 's Collections

video/image

updated Jul 24, 2025

google/vit-base-patch16-224

Image Classification • 86.6M • Updated Sep 5, 2023 • 4.85M • • 965
OpenGVLab/internimage_g_jointto22k_384

Image Classification • 3B • Updated Mar 25, 2025 • 49 • 1
chancharikm/qwen2.5-vl-72b-cam-motion

Video-Text-to-Text • 73B • Updated Sep 19, 2025 • 14 • 1
lmms-lab/Aero-1-Audio

Text Generation • 2B • Updated Jun 7, 2025 • 2.56k • 90
mipal/AVATAR

Updated Nov 3, 2025 • 74 • 1
FAVOR-Bench/FAVOR

Viewer • Updated 17 days ago • 27.1k • 4.85k • 3
lmms-lab/VideoMMMU

Viewer • Updated May 5, 2025 • 900 • 2.61k • 14
moonshotai/Kimi-VL-A3B-Thinking-2506

Image-Text-to-Text • 16B • Updated Jan 30 • 11.8k • 360
lmms-lab/llava-critic-113k

Viewer • Updated Oct 5, 2024 • 113k • 951 • 28
lmms-lab/M4-Instruct-Data

Updated Jul 21, 2024 • 1.39k • 78
lmms-lab/llava-next-interleave-qwen-7b

Text Generation • 8B • Updated Jul 24, 2024 • 138 • 27
lmms-lab/LLaVA-OneVision-Data

Viewer • Updated May 24, 2025 • 3.94M • 14.8k • 236
avalab/syndicom

Viewer • Updated May 10, 2024 • 19.2k • 57
avalab/iTBLS

Viewer • Updated Jan 17, 2025 • 12.5k • 29
Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification

Paper • 2312.14378 • Published Dec 22, 2023
avalab/cTBLS_knowledge_retriever

Updated Jan 12, 2024
avalab/cTBLS_encoder

Updated Apr 27, 2023
CraftJarvis/minecraft-vla-sft

Viewer • Updated Mar 21, 2025 • 3.78M • 2.07k • 10

Collection guide
Browse collections

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs