Tanjf's picture

6 1

Tanjf

Sober-Clever

·

AI & ML interests

None yet

Recent Activity

published a model 18 days ago

Sober-Clever/SFT-Industrial-Qwen2.5-1.5B

updated a model 18 days ago

Sober-Clever/SFT-Industrial-Qwen2.5-1.5B

upvoted a paper 27 days ago

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

View all activity

Organizations

None yet

upvoted 2 papers 27 days ago

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

Paper • 2605.06130 • Published May 7 • 112

Rubric-based On-policy Distillation

Paper • 2605.07396 • Published about 1 month ago • 41

upvoted a paper 7 months ago

Distilled Decoding 2: One-step Sampling of Image Auto-regressive Models with Conditional Score Distillation

Paper • 2510.21003 • Published Oct 23, 2025 • 8

upvoted a paper 8 months ago

Quantile Advantage Estimation for Entropy-Safe Reasoning

Paper • 2509.22611 • Published Sep 26, 2025 • 119

upvoted an article about 1 year ago

Article

Mixture of Experts Explained

+4

osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq

•

Dec 11, 2023

• 1.14k

upvoted a paper over 1 year ago

Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching

Paper • 2412.17153 • Published Dec 22, 2024 • 39