UKPLab/agentcibench
Viewer • Updated • 203 • 71 • 2
The Ubiquitous Knowledge Processing Lab researches natural language processing, text mining, eLearning, and digital humanities.
Capable but Careless: Do Computer-Use Agents Follow Contextual Integrity?
SciCoQA: Quality Assurance for Scientific Paper--Code Alignment