lechmazur/debate
Adversarial multi-turn benchmark for LLM debate quality, using side-swapped matchups and multi-model judging to rank models by judged debate performance.
0Active
On the radar — signal detected
Stars
22
Forks
1
Contributors
1
Score updated Jun 26, 2026
// SUBSCRIBE
The repos that moved this week, why they matter, and what to watch next. One email. No noise.