arxiv:2509.22638
Tianyu Pang
P2333
AI & ML interests
Machine Learning
Recent Activity
upvoted a paper about 17 hours ago
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models submitted a paper about 17 hours ago
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models upvoted a paper about 18 hours ago
Rethinking the Divergence Regularization in LLM RL