Serendipity
Yuhan
ยท
AI & ML interests
None yet
Recent Activity
upvoted a paper about 7 hours ago
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards upvoted a paper 10 days ago
The Many Faces of On-Policy Distillation: Pitfalls, Mechanisms, and Fixes liked a dataset 27 days ago
wybb/Laser-ScanPath