arxiv:2601.23143
Sangwoo Park
Sangsang
AI & ML interests
I do LLM post-training research (KAIST AI)
Recent Activity
updated a model about 4 hours ago
Sangsang/feedback_asymmetric_kl_fixed_ema_Qwen2.5-7B-Instruct_bw0p75_fw0p25_ema0p999_ep30 published a model about 4 hours ago
Sangsang/feedback_asymmetric_kl_fixed_ema_Qwen2.5-7B-Instruct_bw0p75_fw0p25_ema0p999_ep30 updated a model about 4 hours ago
Sangsang/feedback_asymmetric_kl_fixed_ema_Qwen2.5-7B-Instruct_bw0p25_fw0p75_ema0p999_ep30Organizations
None yet