Xinyu Zhu
TianHongZXY
AI & ML interests
Large Language Models; Reasoning; Reinforcement Learning
Recent Activity
authored a paper 2 days ago
You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories upvoted a paper 7 days ago
You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories updated a model 21 days ago
meng-lab/MATH-Qwen3-8B-Base-GRPO-Serval