SOD: Step-wise On-policy Distillation for Small Language Model Agents Paper • 2605.07725 • Published 20 days ago • 13
LINGESH-7/tinyllama-bnb-4bit-FT-on-yahma-alpaca-cleaned Text Generation • Updated 4 days ago • 29 • 1
OSCAR: Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization Paper • 2605.17757 • Published 10 days ago • 62
EvolveMem:Self-Evolving Memory Architecture via AutoResearch for LLM Agents Paper • 2605.13941 • Published 15 days ago • 24
Rethinking Reasoning-Intensive Retrieval: Evaluating and Advancing Retrievers in Agentic Search Systems Paper • 2605.04018 • Published 23 days ago • 40
Stargazer: A Scalable Model-Fitting Benchmark Environment for AI Agents under Astrophysical Constraints Paper • 2604.15664 • Published Apr 17 • 5
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published Apr 22 • 242
Crowded in B-Space: Calibrating Shared Directions for LoRA Merging Paper • 2604.16826 • Published Apr 18 • 18
RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time Paper • 2604.11626 • Published Apr 13 • 102
Can Natural Image Autoencoders Compactly Tokenize fMRI Volumes for Long-Range Dynamics Modeling? Paper • 2604.03619 • Published Apr 4 • 9
Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning Paper • 2602.11748 • Published Feb 12 • 38