jackpan
jackpang
AI & ML interests
None yet
Recent Activity
upvoted a paper about 1 month ago
ImagineBench: Evaluating Reinforcement Learning with Large Language
Model Rollouts upvoted a paper about 1 month ago
RLVR Training of LLMs Does Not Improve Thinking Ability for General QA: Evaluation Method and a Simple Solution upvoted a paper about 1 month ago
Natural Language-conditioned Reinforcement Learning with Inside-out Task
Language Development and TranslationOrganizations
None yet