Official Tempo-6B collection: A query-aware framework solving the mismatch between massive video streams and bounded LLM context windows.
AI & ML interests
None defined yet.
Recent Activity
models 20
Vision-CAIR/Tempo-6B
Video-Text-to-Text • Updated • 242 • 2
Vision-CAIR/Tempo-6B-Stage2
Video-Text-to-Text • Updated • 49
Vision-CAIR/Tempo-6B-Stage1
Video-Text-to-Text • Updated • 39
Vision-CAIR/Tempo-6B-Stage0
Video-Text-to-Text • Updated • 40
Vision-CAIR/BFPO-Mistral-7b-v0.1
Text Generation • 7B • Updated • 13 • 1
Vision-CAIR/LongVU_Llama3_2_1B
Video-Text-to-Text • Updated • 23 • 12
Vision-CAIR/LongVU_Llama3_2_3B_img
Updated • 5 • 6
Vision-CAIR/LongVU_Qwen2_7B_img
Updated • 5 • 5
Vision-CAIR/LongVU_Llama3_2_3B
Video-Text-to-Text • Updated • 67 • 8
Vision-CAIR/LongVU_Qwen2_7B
Video-Text-to-Text • 8B • Updated • 216 • 76