GIT_FEED

beyhangl/evalcraft

The pytest for AI agents — capture, replay, mock, and evaluate agent behavior. Agent Reliability Engineering.

View on GitHub
0Active

On the radar — signal detected

Stars
4
Forks
2
Contributors
2
Language
Python

Score updated Jun 26, 2026

// SUBSCRIBE

The repos that moved this week, why they matter, and what to watch next. One email. No noise.