W.Jimmy's picture

W.Jimmy

WJimmy

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

Self-Distilled Agentic Reinforcement Learning

upvoted a paper 28 days ago

Pause or Fabricate? Training Language Models for Grounded Reasoning

upvoted a paper about 1 month ago

GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification

View all activity

Organizations

None yet

WJimmy 's datasets

None public yet