13 9

何子轩

linyifan81

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

liked a model 4 days ago

parkjo/Qwen2.5-Math-1.5B_grpo_ppl_adv_entropy_rollout_8_KL_0.001_ent_0.001_USE_KL__step580

liked a model 5 days ago

iamseungpil/metacot-h200-rod-pt-R10v2-0512

View all activity

Organizations

None yet

upvoted a paper 3 days ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published 6 days ago • 201

liked a model 4 days ago

parkjo/Qwen2.5-Math-1.5B_grpo_ppl_adv_entropy_rollout_8_KL_0.001_ent_0.001_USE_KL__step580

2B • Updated 4 days ago • 32 • 1

liked a model 5 days ago

iamseungpil/metacot-h200-rod-pt-R10v2-0512

Updated 4 days ago • 1

upvoted a paper 8 days ago

Solvita: Enhancing Large Language Models for Competitive Programming via Agentic Evolution

Paper • 2605.15301 • Published 12 days ago • 22

upvoted a paper 12 days ago

PaperFit: Vision-in-the-Loop Typesetting Optimization for Scientific Documents

Paper • 2605.10341 • Published 15 days ago • 34

upvoted 2 papers 15 days ago

Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

Paper • 2605.06169 • Published 19 days ago • 229

RLDX-1 Technical Report

Paper • 2605.03269 • Published 21 days ago • 124

upvoted a paper 19 days ago

Perceptual Flow Network for Visually Grounded Reasoning

Paper • 2605.02730 • Published 22 days ago • 7

liked a dataset 25 days ago

tars-robotics/OmniVitac_Samples

Updated 18 days ago • 574 • 1

upvoted 2 papers about 1 month ago

The Continuity Layer: Why Intelligence Needs an Architecture for What It Carries Forward

Paper • 2604.17273 • Published Apr 19 • 3

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published Apr 22 • 242

liked a dataset about 1 month ago

open-index/hacker-news

Updated 1 minute ago • 33.5k • 319

liked a model about 1 month ago

willardj/msgpack-numpy-rce-poc

Updated Apr 12 • 1

liked a dataset about 2 months ago

princeton-nlp/SWE-bench_Verified

Viewer • Updated Feb 18, 2025 • 500 • 823k • 344

upvoted a paper about 2 months ago

Benchmarking and Mechanistic Analysis of Vision-Language Models for Cross-Depiction Assembly Instruction Alignment

Paper • 2604.00913 • Published Apr 1 • 4

liked a model about 2 months ago

amazon/chronos-2

Time Series Forecasting • 0.1B • Updated Jan 6 • 14.6M • 292

upvoted 2 papers about 2 months ago

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published Mar 30 • 342

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 351

liked a model about 2 months ago

inference-optimization/gpt-oss-120b-from-qwen235b-then-self-ckpt4-speculator.eagle3

0.9B • Updated Apr 1 • 4 • 1

upvoted a paper 2 months ago

Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models

Paper • 2603.17051 • Published Mar 17 • 109

何子轩

AI & ML interests

Recent Activity

Organizations

linyifan81's activity