❯GITFIND.AI
HOMEAI/MLDEV TOOLSAI CODE INDEXINSIGHTS
SYS_OK
LIVE
[SUBMIT]
AI/MLDEV TOOLSSECURITYINFRASTRUCTUREAI CODE INDEXINSIGHTSSUBMIT

GIT_FEED
DeusData/codebase-memory-mcp+7.6k ★+93%DietrichGebert/ponytail+20k ★+52%Panniantong/Agent-Reach+7.0k ★+20%ZhuLinsen/daily_stock_analysis+6.7k ★+15%tw93/Pake+7.1k ★+14%apple/container+4.8k ★+12%jamiepine/voicebox+3.7k ★+12%Leonxlnx/taste-skill+4.4k ★+9%Kilo-Org/kilocode+2.1k ★+9%heygen-com/hyperframes+2.6k ★+9%hugohe3/ppt-master+2.7k ★+9%mattpocock/skills+11k ★+8%penpot/penpot+3.5k ★+7%Egonex-AI/Understand-Anything+4.4k ★+7%asgeirtj/system_prompts_leaks+2.8k ★+6%DeusData/codebase-memory-mcp+7.6k ★+93%DietrichGebert/ponytail+20k ★+52%Panniantong/Agent-Reach+7.0k ★+20%ZhuLinsen/daily_stock_analysis+6.7k ★+15%tw93/Pake+7.1k ★+14%apple/container+4.8k ★+12%jamiepine/voicebox+3.7k ★+12%Leonxlnx/taste-skill+4.4k ★+9%Kilo-Org/kilocode+2.1k ★+9%heygen-com/hyperframes+2.6k ★+9%hugohe3/ppt-master+2.7k ★+9%mattpocock/skills+11k ★+8%penpot/penpot+3.5k ★+7%Egonex-AI/Understand-Anything+4.4k ★+7%asgeirtj/system_prompts_leaks+2.8k ★+6%

aimvik07/agent-eval

CLI toolkit for probing LLM agent failures, comparing models on cost vs accuracy, and catching regressions. Tested across classification, sentiment, and RAG agents.

View on GitHub

0Active

On the radar — signal detected

Stars

Forks

Contributors

Language

Python

Score updated Jun 26, 2026

// SUBSCRIBE

The repos that moved this week, why they matter, and what to watch next. One email. No noise.