Giovan321/Reward-Guard
Plug-and-play reward monitoring for RL training loops. Catch reward hacking, component imbalance, and starvation before they tank your run. Drop in one .step() call — get balance reports, auto weight correction, alignment scores, and WandB/TensorBoard/SB3 integrations out of the box. → rewardguard.dev
0Active
On the radar — signal detected
Stars
4
Forks
0
Contributors
0
Language
Python
Score updated Apr 29, 2026
// SUBSCRIBE
The repos that moved this week, why they matter, and what to watch next. One email. No noise.