GIT_FEED

Giovan321/Reward-Guard

Plug-and-play reward monitoring for RL training loops. Catch reward hacking, component imbalance, and starvation before they tank your run. Drop in one .step() call — get balance reports, auto weight correction, alignment scores, and WandB/TensorBoard/SB3 integrations out of the box. → rewardguard.dev

View on GitHub
0Active

On the radar — signal detected

Stars
4
Forks
0
Contributors
0
Language
Python

Score updated Apr 29, 2026

// SUBSCRIBE

The repos that moved this week, why they matter, and what to watch next. One email. No noise.