AI & ML interests
None yet
Recent Activity
Organizations
None yet
view article A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond
karina-zadorozhny
• • 26
view article Forge: Scalable Agent RL Framework and Algorithm
MiniMax-AI
• • 155
view article Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective
reacted to jimzhiwei's post with ❤️ about 2 years ago