sierra-research/tau2-bench
τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment
0Active
On the radar — signal detected
Stars
964
Forks
241
Contributors
4
Language
Python
Score updated Apr 7, 2026
// SUBSCRIBE
The repos that moved this week, why they matter, and what to watch next. One email. No noise.