19 13

Mei Tanaka

thread-lurker

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

bilabila/b-b7_olr_ts10_gru_hib_turn001_sym7_202601_lossq_ms400k_h12

upvoted a paper 2 days ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

liked a model 2 days ago

promotion/modpo_hh_0.75_0.25_0.0

View all activity

Organizations

None yet

liked a model 1 day ago

bilabila/b-b7_olr_ts10_gru_hib_turn001_sym7_202601_lossq_ms400k_h12

68k • Updated 1 day ago • 391 • 1

upvoted a paper 2 days ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published 5 days ago • 192

liked a model 2 days ago

promotion/modpo_hh_0.75_0.25_0.0

Updated 2 days ago • 15 • 1

liked a dataset 3 days ago

vuhaian/kimi_sodata_500k_filtered_414213

Viewer • Updated 3 days ago • 414k • 48 • 1

upvoted a paper 6 days ago

Useful Memories Become Faulty When Continuously Updated by LLMs

Paper • 2605.12978 • Published 12 days ago • 19

liked a dataset 10 days ago

pulmo/ncbi-genbank-complete

Preview • Updated 10 days ago • 71.7k • 4

upvoted a paper 18 days ago

MedSkillAudit: A Domain-Specific Audit Framework for Medical Research Agent Skills

Paper • 2604.20441 • Published Apr 22 • 3

liked a model 23 days ago

xw17/Phi-3-mini-4k-instruct_SFT_lora_usc-had

Updated 23 days ago • 1

liked a dataset about 1 month ago

open-r1/OpenR1-Math-220k

Viewer • Updated Feb 18, 2025 • 450k • 39.6k • 751

upvoted 3 papers about 1 month ago

liked a model about 1 month ago

FacebookAI/xlm-roberta-base

Fill-Mask • 0.3B • Updated Feb 19, 2024 • 22.4M • • 833

upvoted a paper about 1 month ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 629

liked a dataset about 2 months ago

open-index/hacker-news

Viewer • Updated 2 minutes ago • 48.2M • 32.7k • 318

liked a model about 2 months ago

protagonist/Qwen3-8B-cat-class

Updated Apr 5

upvoted 2 papers about 2 months ago

Think over Trajectories: Leveraging Video Generation to Reconstruct GPS Trajectories from Cellular Signaling

Paper • 2603.26610 • Published Mar 27 • 9

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published Mar 30 • 342

liked 2 datasets about 2 months ago

Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 39k • 1.74k

reasoning-degeneration-dev/ttt-discover-circle_packing_24-qwen3-8b

Viewer • Updated Apr 1 • 19 • 16

Mei Tanaka

AI & ML interests

Recent Activity

Organizations

thread-lurker's activity