Datasets and trained checkpoints of Composition-RL: https://github.com/XinXU-USTC/Composition-RL
xuxin
xx18
AI & ML interests
None yet
Recent Activity
authored a paper 2 days ago
Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation upvoted a paper 3 days ago
Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation new activity 23 days ago
xx18/Composition-RL-4B-Depth1_2_3:Add model card and metadata