7 40 8

Qihan Ren

jasonrqh

https://nebularaid2000.github.io/

AI & ML interests

explainable AI, LLM

Recent Activity

upvoted a paper 5 days ago

Seedance 2.0: Advancing Video Generation for World Complexity

new activity 5 days ago

Jackrong/Gemopus-4-31B-it:Awesome work

liked a model 5 days ago

Jackrong/Gemopus-4-31B-it-GGUF

View all activity

Organizations

upvoted a paper 5 days ago

Seedance 2.0: Advancing Video Generation for World Complexity

Paper • 2604.14148 • Published 6 days ago • 147

New activity in Jackrong/Gemopus-4-31B-it 5 days ago

Awesome work

#1 opened 5 days ago by

jasonrqh

liked 2 models 5 days ago

Jackrong/Gemopus-4-31B-it-GGUF

Text Generation • 31B • Updated 5 days ago • 3.93k • 9

Jackrong/Gemopus-4-31B-it

Text Generation • 33B • Updated 5 days ago • 442 • 7

upvoted a paper 5 days ago

Turing Test on Screen: A Benchmark for Mobile GUI Agent Humanization

Paper • 2604.09574 • Published Feb 24 • 30

upvoted a paper 6 days ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published 7 days ago • 82

liked 2 datasets 7 days ago

jasonrqh/Math-CoT-20k

Viewer • Updated 10 days ago • 20.5k • 187 • 5

jasonrqh/Math-CoT-44k-Qwen3-32b-n32-16384-with-logprob-and-entropy

Viewer • Updated 9 days ago • 44.4k • 2.09k • 1

authored 3 papers 8 days ago

Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration?

Paper • 2603.03202 • Published Mar 3 • 17

ATBench: A Diverse and Realistic Trajectory Benchmark for Long-Horizon Agent Safety

Paper • 2604.02022 • Published 19 days ago • 15

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published 13 days ago • 317

commented a paper 8 days ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published 13 days ago • 317 •

liked a model 8 days ago

MiniMaxAI/MiniMax-M2.7

Text Generation • 229B • Updated about 19 hours ago • 314k • • 1k

updated a collection 9 days ago

Rethink_SFT_generalization

Collection

Repo for paper Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability. • 40 items • Updated 9 days ago • 16

upvoted an article 9 days ago

Article

Red Teaming with RL: Exploiting Tinker API for Harmful RL on 235B Model

Jan 1

•

updated a dataset 9 days ago

jasonrqh/Math-CoT-44k-Qwen3-32b-n32-16384-with-logprob-and-entropy

Viewer • Updated 9 days ago • 44.4k • 2.09k • 1

published a dataset 9 days ago

jasonrqh/Math-CoT-44k-Qwen3-32b-n32-16384-with-logprob-and-entropy

Viewer • Updated 9 days ago • 44.4k • 2.09k • 1

updated 3 datasets 10 days ago

Qihan Ren

AI & ML interests

Recent Activity

Organizations

jasonrqh's activity

Awesome work

Red Teaming with RL: Exploiting Tinker API for Harmful RL on 235B Model