YUYI YANG
yyuyi
AI & ML interests
None yet
Recent Activity
upvoted a paper about 1 month ago
Small RL Controller, Large Language Model: RL-Guided Adaptive Sampling for Test-Time Scaling upvoted a paper about 1 month ago
You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories upvoted a paper about 2 months ago
Process Rewards with Learned ReliabilityOrganizations
None yet