Junyao Yang's picture

5 8

Junyao Yang

TberiusJunyao

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

TRON: Targeted Rule-Verifiable Online Environments for Visual Reasoning RL

liked a model about 1 month ago

deepseek-ai/DeepSeek-V4-Pro

upvoted a paper about 2 months ago

TEMPO: Scaling Test-time Training for Large Reasoning Models

View all activity

Organizations

None yet

upvoted a paper 4 days ago

TRON: Targeted Rule-Verifiable Online Environments for Visual Reasoning RL

Paper • 2606.01599 • Published 6 days ago • 17

liked a model about 1 month ago

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated May 6 • 5.51M • • 4.68k

upvoted 2 papers about 2 months ago

TEMPO: Scaling Test-time Training for Large Reasoning Models

Paper • 2604.19295 • Published Apr 21 • 35

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 327

liked a dataset 4 months ago

AI45Research/ATBench

Viewer • Updated 25 days ago • 1.5k • 2.01k • 39

liked 6 models 4 months ago

AI45Research/AgentDoG-FG-Llama3.1-8B

Text Classification • 8B • Updated Feb 6 • 9 • 9

AI45Research/AgentDoG-Llama3.1-8B

Text Classification • 8B • Updated Feb 6 • 33 • 11

AI45Research/AgentDoG-FG-Qwen2.5-7B

Text Classification • 8B • Updated Feb 6 • 8 • 8

AI45Research/AgentDoG-Qwen2.5-7B

Text Classification • 8B • Updated Apr 9 • 61 • 10

AI45Research/AgentDoG-FG-Qwen3-4B

Text Classification • 4B • Updated Apr 9 • 47 • 9

AI45Research/AgentDoG-Qwen3-4B

Text Classification • 4B • Updated Apr 9 • 251 • 23

upvoted a collection 4 months ago

AgentDoG

A Diagnostic Guardrail Framework for AI Agent Safety and Security • 12 items • Updated 25 days ago • 112

upvoted a paper 7 months ago

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

Paper • 2511.13704 • Published Nov 17, 2025 • 44

published 2 models about 1 year ago

TberiusJunyao/Qwen2.5-7B-Instruct-Math-GRPO

Updated Mar 27, 2025

TberiusJunyao/Qwen2.5-1.5B-Open-R1-GRPO

Updated Mar 8, 2025

published a model over 1 year ago

TberiusJunyao/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Updated Mar 6, 2025