Large Language Models Explore by Latent Distilling Paper ⢠2604.24927 ⢠Published 5 days ago ⢠59
AgentSearchBench: A Benchmark for AI Agent Search in the Wild Paper ⢠2604.22436 ⢠Published 8 days ago ⢠12
Prism-Reranker: Beyond Relevance Scoring -- Jointly Producing Contributions and Evidence for Agentic Retrieval Paper ⢠2604.23734 ⢠Published 6 days ago ⢠3
view article Article DeepSeek-V4: a million-token context that agents can actually use 8 days ago ⢠40
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers 16 days ago ⢠66
DR^{3}-Eval: Towards Realistic and Reproducible Deep Research Evaluation Paper ⢠2604.14683 ⢠Published 16 days ago ⢠35
ClawEnvKit: Automatic Environment Generation for Claw-Like Agents Paper ⢠2604.18543 ⢠Published 12 days ago ⢠27
GLiNER-PII Collection PII detection models developed in collaboration with Wordcab ⢠5 items ⢠Updated Jan 29 ⢠23
Toward Autonomous Long-Horizon Engineering for ML Research Paper ⢠2604.13018 ⢠Published 18 days ago ⢠34
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe Paper ⢠2604.13016 ⢠Published 18 days ago ⢠87
KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance Paper ⢠2604.12627 ⢠Published 18 days ago ⢠99
AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents Paper ⢠2603.27490 ⢠Published Mar 29 ⢠18
VisionFoundry: Teaching VLMs Visual Perception with Synthetic Images Paper ⢠2604.09531 ⢠Published 22 days ago ⢠8
ECHO: Efficient Chest X-ray Report Generation with One-step Block Diffusion Paper ⢠2604.09450 ⢠Published 22 days ago ⢠22
Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory Paper ⢠2604.08995 ⢠Published 22 days ago ⢠48
Small Vision-Language Models are Smart Compressors for Long Video Understanding Paper ⢠2604.08120 ⢠Published 23 days ago ⢠20