11 5

TIANYI

BIMU233

http://bimu.site

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

Continuous Latent Diffusion Language Model

upvoted a paper 11 days ago

Anchored Policy Optimization: Mitigating Exploration Collapse Via Support-Constrained Rectification

liked a Space 14 days ago

JoaquinVanschoren/croissant-checker

View all activity

Organizations

None yet

upvoted a paper 10 days ago

Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published 11 days ago • 76

upvoted a paper 11 days ago

Anchored Policy Optimization: Mitigating Exploration Collapse Via Support-Constrained Rectification

Paper • 2602.05717 • Published Feb 5 • 1

liked a Space 14 days ago

Croissant Checker - Dev

🔎

Validate Croissant dataset files for NeurIPS submissions

published a model 21 days ago

BIMU233/GPT-2_agd

Updated 21 days ago

updated a model 21 days ago

BIMU233/GPT-2_agd

Updated 21 days ago

liked a model 29 days ago

Qwen/Qwen3.5-9B

Image-Text-to-Text • 10B • Updated Mar 2 • 8.04M • • 1.45k

upvoted a paper 29 days ago

ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents

Paper • 2604.11784 • Published Apr 13 • 143

upvoted 6 papers about 1 month ago

Reinforcement Learning via Value Gradient Flow

Paper • 2604.14265 • Published Apr 15 • 7

RAD-2: Scaling Reinforcement Learning in a Generator-Discriminator Framework

Paper • 2604.15308 • Published Apr 16 • 29

HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds

Paper • 2604.14268 • Published Apr 15 • 119

From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space

Paper • 2604.14142 • Published Apr 15 • 29

Seedance 2.0: Advancing Video Generation for World Complexity

Paper • 2604.14148 • Published Apr 15 • 161

KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance

Paper • 2604.12627 • Published Apr 14 • 100

authored a paper about 1 month ago

SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks

Paper • 2604.08865 • Published Apr 10 • 29

upvoted 2 papers about 1 month ago

From Word to World: Can Large Language Models be Implicit Text-based World Models?

Paper • 2512.18832 • Published Dec 21, 2025 • 15