Submitted by
Haoran Zhang
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
ComBench: A Benchmark for Rigorous Proof Reasoning and Constructive Realization in Olympiad-Level Combinatorics
π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows