GIT_FEED

withqwerty/reep

The football entity register. Maps player, team, and coach identities across Transfermarkt, FBref, UEFA, Sofascore, and 25+ data providers.

View on GitHub

What it does

Reep is a master reference list that assigns every football player, coach, team, and competition a single stable ID, then maps that ID to the equivalent IDs used by over 25 different sports data providers like Transfermarkt, FBref, Opta, and Wyscout. Think of it as a universal translator for football identity — if you know a player's ID on one platform, Reep tells you their ID on every other platform.

Why it matters

Any product built on football data — scouting tools, fantasy apps, betting platforms, media dashboards — wastes enormous time reconciling the same player appearing under different IDs across different data vendors, and Reep eliminates that problem as a free, open resource. For founders and PMs, this is the kind of foundational infrastructure that can shave months off data pipeline work and lower the barrier to building serious football analytics products.

27Active

On the radar — signal detected

Stars
127
Forks
6
Contributors
1
Language
Python

Score updated Apr 8, 2026

Related projects

Apache Spark is a powerful open-source platform that lets companies process and analyze massive amounts of data — think billions of rows — incredibly fast, using familiar programming languages like Python or SQL. It handles everything from running complex data queries to training machine learning models, all within a single unified system.

// why it matters With over 43,000 stars and nearly 3,400 contributors, Spark is effectively the industry standard for large-scale data processing, meaning any data-heavy product — from analytics dashboards to AI features — is likely built on or competing with tools in this ecosystem. Founders and PMs evaluating data infrastructure will almost inevitably encounter Spark as a core option, making familiarity with its capabilities a strategic advantage when deciding how to scale a data pipeline.

Scala43.1k stars29.1k forks3403 contrib

Foxglove SDK is a toolkit that lets robotics and engineering teams record, stream, and visually explore complex sensor data — think camera feeds, GPS tracks, and sensor readings — all in one place. It connects to the popular Foxglove visualization platform, allowing teams to replay and analyze what their robots or autonomous systems are doing in real time or from saved recordings.

// why it matters As robotics, autonomous vehicles, and industrial automation become major investment areas, teams need better tools to understand and debug what their machines are actually doing — and Foxglove is positioning itself as the standard observability platform for that space. With 43 contributors, support for multiple programming languages, and integration with the widely-used ROS robotics framework, this SDK signals a maturing ecosystem that could become a critical dependency for any company building physical AI products.

Rust214 stars82 forks45 contrib

DuckDB is a fast database system designed specifically for analyzing large amounts of data, running directly on your laptop or server without needing a separate database service to manage. It lets analysts and developers ask complex questions about data using SQL (a standard data query language) and works seamlessly with popular data tools like Python and Excel-style file formats.

// why it matters With over 36,000 stars and nearly 340 contributors, DuckDB has become a go-to solution for companies that want powerful data analysis without the cost and complexity of cloud data warehouses like Snowflake or BigQuery — making it a real competitive threat to expensive enterprise analytics platforms. For PMs and founders, this signals a growing market trend toward lightweight, embedded analytics that can be shipped directly inside products, reducing infrastructure costs and speeding up time-to-insight for end users.

C++37.3k stars3.1k forks708 contrib

AFNI is a comprehensive software toolkit used by neuroscientists to process, analyze, and visualize brain scan images, including the functional MRI scans (brain imaging that shows activity over time) used in research studies. It handles every step of the brain imaging workflow, from initial data collection through final statistical analysis and visual reporting.

// why it matters Brain imaging research underpins a massive and growing market spanning clinical neurology, mental health diagnostics, and neurotechnology, and AFNI is a foundational open-source tool trusted by academic and medical research institutions worldwide. For founders or investors in brain health, medical imaging, or research software, understanding that AFNI represents the established standard workflow gives important context for where new AI-driven or cloud-based neuroimaging products can integrate or compete.

C185 stars117 forks81 contrib
// SUBSCRIBE

The repos that moved this week, why they matter, and what to watch next. One email. No noise.