SemiAnalysisAI/InferenceX

Open Source Continuous Inference Benchmarking Qwen3.5, DeepSeek, GPTOSS - GB200 NVL72 vs MI355X vs B200 vs GB300 NVL72 vs H100 & soon™ TPUv6e/v7/Trainium2/3

View on GitHub

What it does

InferenceX is a live benchmarking platform that continuously measures and compares how fast different AI chips (from Nvidia, AMD, and others) can run popular open-source AI models, updating results in near real-time as the underlying software improves. Think of it as a constantly refreshed leaderboard showing which AI hardware and software combinations deliver the best performance for running large language models like DeepSeek or Qwen.

Why it matters

For PMs and founders choosing AI infrastructure, this provides independent, up-to-date data to make cost and performance trade-offs when deciding which chips or cloud providers to build on — a decision that can dramatically affect product economics. It also signals that AI software is improving so rapidly that last month's benchmarks may already be outdated, meaning infrastructure decisions need to be revisited far more frequently than traditional software cycles.

38Active

On the radar — signal detected

Stars

721

Forks

108

Contributors

Language

Python

Related projects

Crosstalk-Solutions/project-nomad

66Hot

Project N.O.M.A.D. is a portable, self-contained computer system that works entirely without an internet connection, bundling survival tools, reference knowledge, and AI capabilities so users can access critical information anywhere — even in remote or disaster-struck areas. It's built with a strict no-tracking policy and only needs the internet once during setup, after which it runs completely independently.

// why it matters With over 16,000 stars, this project signals massive market appetite for offline-first, privacy-respecting tools — a sentiment that builders across emergency tech, defense, and resilience-focused consumer products should pay attention to. For founders, it's a proof point that 'works without the cloud' is becoming a genuine product differentiator, not just a niche feature.

TypeScript16.9k stars1.6k forks8 contrib

AI / Machine Learning GitHub ↗

GoogleCloudPlatform/generative-ai

62Hot

This is Google's official collection of tutorials, code examples, and ready-to-run notebooks showing builders how to create AI-powered applications using Google's Gemini models on its cloud platform. It covers everything from basic AI conversations to complex multi-step AI agents that can reason and take actions autonomously.

// why it matters With over 15,000 stars and nearly 300 contributors, this repository signals where serious enterprise AI development is heading — Google's cloud ecosystem is positioning itself as a primary destination for teams building production AI products. For founders and PMs evaluating AI infrastructure, this gives a clear picture of Google's capabilities and provides a fast track to building on the same models powering consumer Google products.

Jupyter Notebook16.5k stars4.1k forks292 contrib

AI / Machine Learning GitHub ↗

linuxhsj/openclaw-zero-token

61Hot

OpenClaw Zero Token is a tool that lets you use major AI services — including ChatGPT, Claude, Gemini, and others — without paying for API access by hijacking your existing logged-in browser sessions to bypass normal billing. Essentially, it tricks these platforms into thinking requests are coming from a regular user browsing the web, rather than a developer using the paid programmatic access.

// why it matters This project signals real market demand for affordable AI access, but it operates in a legal and ethical gray zone — these techniques violate the terms of service of every platform it targets, creating serious risk for any product built on top of it. For builders and investors, it's a reminder that API cost is a genuine pain point worth solving, but products relying on this approach could be shut down overnight.

TypeScript3.0k stars688 forks1214 contrib

AI / Machine Learning GitHub ↗

ROCm/rocm-libraries

55Hot

ROCm Libraries is a centralized collection of software building blocks that power AI and machine learning workloads on AMD graphics cards, consolidated into a single repository for easier development. It serves as the foundational layer that tools like PyTorch rely on to run efficiently on AMD hardware.

// why it matters As AI infrastructure spending diversifies beyond Nvidia, having a mature, well-organized AMD software ecosystem lowers the barrier for companies to build on lower-cost or more accessible GPU alternatives. Builders and investors evaluating AMD-based AI infrastructure should watch this project as a signal of AMD's software readiness to compete seriously in the AI hardware market.

Assembly292 stars243 forks1044 contrib

AI / Machine Learning GitHub ↗

// SUBSCRIBE

The repos that moved this week, why they matter, and what to watch next. One email. No noise.