-
Data Repetition Beats Data Scaling in Long-CoT Supervised Fine-Tuning
Paper • 2602.11149 • Published • 18 -
dakopi/distilled_numina__qwen3-0.6b
Viewer • Updated • 39.7k • 9 -
dakopi/distilled_numina__qwen3-8b
Viewer • Updated • 39.7k • 12 -
dakopi/distilled_numina__qwen3-0.6b__train_12800
Viewer • Updated • 12.8k • 13
Dawid
dakopi
AI & ML interests
None yet
Recent Activity
liked a model about 22 hours ago
Qwen/Qwen3.6-27B liked a dataset 17 days ago
nyu-visionx/CV-Bench liked a dataset 19 days ago
HuggingFaceFW/fineweb-edu