Fast and Effective On-policy Distillation from Reasoning Prefixes Paper • 2602.15260 • Published Feb 16 • 1
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe Paper • 2604.13016 • Published Apr 14 • 95
ColGemma4 — Gemma-4 Visual Retrieval Collection ColBERT-style late-interaction visual document retrieval adapters built on Google Gemma-4 (E2B and E4B variants). • 2 items • Updated about 1 month ago • 1
ColQwen3.5 — Qwen3.5 Visual Retrieval Collection Visual document retrieval models on Qwen3.5 backbone. ViDoRe v3 leaderboard competitors, 128-dim multi-vector. • 2 items • Updated Apr 13 • 1
Hydra — Dual-Head Retrieval and Generation Collection Dual-head VLM: ColBERT retrieval + autoregressive generation by toggling one LoRA. Canonical 4B + 0.8B, omni proof-of-concept, baselines. • 4 items • Updated 29 days ago • 1
athrael-soju/DualHead-GritLM-Qwen3.5-4B Visual Document Retrieval • Updated about 1 month ago • 10 • 1
TomoroAI/tomoro-ai-colqwen3-embed-4b-awq Visual Document Retrieval • 1B • Updated Dec 16, 2025 • 2.08k • 8
view article Article Adding Benchmaxxer Repellant to the Open ASR Leaderboard +9 bezzam, Steveeeeeeen, eustlb, SBruccoleriAppen, jmss-appen, c-e-ford-appen, wgb14, YukaiHuang, like2026, logicbean, ally-lxl • 13 days ago • 16
In-Context Example Selection via Similarity Search Improves Low-Resource Machine Translation Paper • 2408.00397 • Published Aug 1, 2024 • 12
Tree of Problems: Improving structured problem solving with compositionality Paper • 2410.06634 • Published Oct 9, 2024 • 9
BigO(Bench) -- Can LLMs Generate Code with Controlled Time and Space Complexity? Paper • 2503.15242 • Published Mar 19, 2025 • 11