LoMo: Local Modality Substitution for Deeper Vision-Language Fusion Paper • 2605.30265 • Published 9 days ago • 23