Retrieve, Don't Retrain: Extending Vision Language Action Models to New Tasks at Test Time Paper • 2606.15631 • Published 5 days ago • 15
OpenLearnLM/special-r1-deepseek-qwen3-8b-sped-adaptive-think-reward Text Generation • 8B • Updated Apr 17 • 7
OpenLearnLM/special-r1-deepseek-qwen3-8b-sped-adaptive-think-reward Text Generation • 8B • Updated Apr 17 • 7
OpenLearnLM/special-r1-deepseek-qwen3-8b-sped-adaptive-think-noreward Text Generation • 8B • Updated Apr 7 • 2
OpenLearnLM/special-r1-deepseek-qwen3-8b-sped-adaptive-think-noreward Text Generation • 8B • Updated Apr 7 • 2