Collection of LLM Evaluation Frameworks
-
iioos/llm-evaluation-model
Updated -
Continuous Latent Diffusion Language Model
Paper • 2605.06548 • Published • 80 -
ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration
Paper • 2605.03042 • Published • 124 -
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence
Paper • 2605.12882 • Published • 269