lechmazur/debate
Adversarial multi-turn benchmark for LLM debate quality, using side-swapped matchups and multi-model judging to rank models by judged debate performance.
0Active
On the radar — signal detected
Stars
8
Forks
0
Contributors
1
Score updated Apr 7, 2026
// SUBSCRIBE
The repos that moved this week, why they matter, and what to watch next. One email. No noise.