AI & ML interests
None defined yet.
Recent Activity
Papers
Externalizing Research Synthesis and Validation in AI Scientists through a Research Harness
AirQA: A Comprehensive QA Dataset for AI Research with Instance-Level Evaluation
None defined yet.
Externalizing Research Synthesis and Validation in AI Scientists through a Research Harness
AirQA: A Comprehensive QA Dataset for AI Research with Instance-Level Evaluation