❯GITFIND.AI
HOMEAI/MLDEV TOOLSAI CODE INDEXINSIGHTS
SYS_OK
LIVE
[SUBMIT]
AI/MLDEV TOOLSSECURITYINFRASTRUCTUREAI CODE INDEXINSIGHTSSUBMIT

GIT_FEED
DeusData/codebase-memory-mcp+7.6k ★+93%DietrichGebert/ponytail+20k ★+52%Panniantong/Agent-Reach+7.0k ★+20%ZhuLinsen/daily_stock_analysis+6.7k ★+15%tw93/Pake+7.1k ★+14%apple/container+4.8k ★+12%jamiepine/voicebox+3.7k ★+12%Leonxlnx/taste-skill+4.4k ★+9%Kilo-Org/kilocode+2.1k ★+9%heygen-com/hyperframes+2.6k ★+9%hugohe3/ppt-master+2.7k ★+9%mattpocock/skills+11k ★+8%penpot/penpot+3.5k ★+7%Egonex-AI/Understand-Anything+4.4k ★+7%asgeirtj/system_prompts_leaks+2.8k ★+6%DeusData/codebase-memory-mcp+7.6k ★+93%DietrichGebert/ponytail+20k ★+52%Panniantong/Agent-Reach+7.0k ★+20%ZhuLinsen/daily_stock_analysis+6.7k ★+15%tw93/Pake+7.1k ★+14%apple/container+4.8k ★+12%jamiepine/voicebox+3.7k ★+12%Leonxlnx/taste-skill+4.4k ★+9%Kilo-Org/kilocode+2.1k ★+9%heygen-com/hyperframes+2.6k ★+9%hugohe3/ppt-master+2.7k ★+9%mattpocock/skills+11k ★+8%penpot/penpot+3.5k ★+7%Egonex-AI/Understand-Anything+4.4k ★+7%asgeirtj/system_prompts_leaks+2.8k ★+6%

benmeryem-tech/llm-eval-kit

A lightweight, modular toolkit for evaluating and benchmarking Large Language Models with focus on reasoning quality, consistency, and error detection.

View on GitHub

0Active

On the radar — signal detected

Stars

Forks

Contributors

Language

Python

Score updated May 12, 2026

// SUBSCRIBE

The repos that moved this week, why they matter, and what to watch next. One email. No noise.