T2I-R1 https://github.com/CaraJ7/T2I-R1 CaraJ/T2I-R1 Text-to-Image • 7B • Updated Jul 3, 2025 • 26 • 5 CaraJ/ORM-T2I-R1 Image-Text-to-Text • 8B • Updated Jul 2, 2025 • 13 • 2 T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT Paper • 2505.00703 • Published May 1, 2025 • 44
T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT Paper • 2505.00703 • Published May 1, 2025 • 44
MMSearch Webpage of MMSearch: https://mmsearch.github.io/ MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines Paper • 2409.12959 • Published Sep 19, 2024 • 38 CaraJ/MMSearch Viewer • Updated Apr 5 • 900 • 587 • 25
MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines Paper • 2409.12959 • Published Sep 19, 2024 • 38
MME-CoT Project Page: https://mmecot.github.io/ MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency Paper • 2502.09621 • Published Feb 13, 2025 • 28 CaraJ/MME-CoT Viewer • Updated Mar 19, 2025 • 1.13k • 1.01k • 22 CaraJ/MME-CoT_VLMEvalKit Viewer • Updated Mar 5, 2025 • 1.13k • 12 • 2
MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency Paper • 2502.09621 • Published Feb 13, 2025 • 28
MathVerse MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems? Paper • 2403.14624 • Published Mar 21, 2024 • 53 AI4Math/MathVerse Viewer • Updated May 15, 2025 • 4.73k • 3.24k • 71 CaraJ/MathVerse-lmmseval Viewer • Updated Apr 19, 2024 • 8.67k • 1.91k • 2 CaraJ/Mathverse_VLMEvalKit Viewer • Updated Sep 2, 2024 • 8.67k • 63 • 1
MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems? Paper • 2403.14624 • Published Mar 21, 2024 • 53
T2I-R1 https://github.com/CaraJ7/T2I-R1 CaraJ/T2I-R1 Text-to-Image • 7B • Updated Jul 3, 2025 • 26 • 5 CaraJ/ORM-T2I-R1 Image-Text-to-Text • 8B • Updated Jul 2, 2025 • 13 • 2 T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT Paper • 2505.00703 • Published May 1, 2025 • 44
T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT Paper • 2505.00703 • Published May 1, 2025 • 44
MME-CoT Project Page: https://mmecot.github.io/ MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency Paper • 2502.09621 • Published Feb 13, 2025 • 28 CaraJ/MME-CoT Viewer • Updated Mar 19, 2025 • 1.13k • 1.01k • 22 CaraJ/MME-CoT_VLMEvalKit Viewer • Updated Mar 5, 2025 • 1.13k • 12 • 2
MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency Paper • 2502.09621 • Published Feb 13, 2025 • 28
MMSearch Webpage of MMSearch: https://mmsearch.github.io/ MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines Paper • 2409.12959 • Published Sep 19, 2024 • 38 CaraJ/MMSearch Viewer • Updated Apr 5 • 900 • 587 • 25
MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines Paper • 2409.12959 • Published Sep 19, 2024 • 38
MathVerse MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems? Paper • 2403.14624 • Published Mar 21, 2024 • 53 AI4Math/MathVerse Viewer • Updated May 15, 2025 • 4.73k • 3.24k • 71 CaraJ/MathVerse-lmmseval Viewer • Updated Apr 19, 2024 • 8.67k • 1.91k • 2 CaraJ/Mathverse_VLMEvalKit Viewer • Updated Sep 2, 2024 • 8.67k • 63 • 1
MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems? Paper • 2403.14624 • Published Mar 21, 2024 • 53