Our paper MultEval: Supporting Collaborative Alignment for LLM-as-a-Judge Evaluation Criteria will appear at CHIWORK 2026.
This work studies how teams of stakeholders can collaboratively create, negotiate, and refine criteria for LLM-as-a-judge systems.