karpathy/nanochat

The best ChatGPT that $100 can buy.

View on GitHub

What it does

Nanochat is a streamlined toolkit that lets anyone train their own AI chatbot — similar to ChatGPT — from scratch for as little as $15 to $48, compared to the tens of thousands of dollars it originally cost. It handles the entire process from raw data to a finished chat interface, and is designed to be simple enough that builders can understand and customize every part of it.

Why it matters

The collapsing cost of training capable AI models means startups and indie builders can now own their AI stack outright rather than renting it indefinitely from OpenAI or Anthropic — a major shift in product leverage and margin structure. With 45,000+ stars, this project signals strong market demand for AI ownership, and builders ignoring it may find themselves locked into expensive API dependencies while competitors build proprietary models.

Why it's trending

Training your own ChatGPT-style model used to cost tens of thousands of dollars — now it costs less than a nice dinner out, and builders are clearly paying attention. Nanochat is pulling in roughly 540 new stars this week against a base of 55,000, driven almost entirely by community discovery rather than active development (just one commit in the last 30 days), which suggests word-of-mouth momentum rather than a fresh release cycle. With over 7,500 forks, people aren't just starring this out of curiosity — they're actively forking it to build on it, making this a strong candidate to watch for anyone evaluating low-cost AI infrastructure options.

34Active

On the radar — signal detected

Stars
55.2k
Forks
7.6k
Contributors
51
Language
Python
Downloads (7d)
29

pypi/nanochat

Score updated May 14, 2026

Related projects

AITER is AMD's open-source library of high-performance building blocks that make AI models run faster on AMD hardware, supporting everything from basic AI operations to complex training and multi-GPU coordination. Think of it as a toolbox that lets AI software teams tap into AMD's chip capabilities without having to write low-level hardware code themselves.

// why it matters As AI infrastructure costs soar, builders are actively exploring alternatives to Nvidia's dominant GPU ecosystem, and AMD is positioning AITER as the key compatibility layer that makes switching or diversifying hardware more practical. For founders and PMs building AI products, this means AMD GPUs become a more credible option for cost reduction or supply chain diversification — especially relevant as demand for AI compute continues to outpace supply.

Python463 stars354 forks200 contrib

TorchBench is a standardized testing suite that measures how fast and efficiently PyTorch — Meta's popular AI training software — runs across different models and hardware configurations. It gives AI developers a consistent way to compare performance improvements or regressions when making changes to their AI infrastructure.

// why it matters For teams building AI-powered products, performance benchmarking directly impacts infrastructure costs and the speed at which models can be trained and deployed — slower AI means higher cloud bills and longer time-to-market. With over 1,000 stars and 250+ contributors, this tool signals that performance measurement is a serious, collaborative concern in the AI ecosystem, making it relevant for any founder evaluating the true cost and efficiency of their AI stack.

Python1.0k stars341 forks253 contrib

ROCm Libraries is a centralized collection of software building blocks that power AI and machine learning workloads on AMD graphics cards, consolidated into a single repository for easier development. It serves as the foundational layer that tools like PyTorch rely on to run efficiently on AMD hardware.

// why it matters As AI infrastructure spending diversifies beyond Nvidia, having a mature, well-organized AMD software ecosystem lowers the barrier for companies to build on lower-cost or more accessible GPU alternatives. Builders and investors evaluating AMD-based AI infrastructure should watch this project as a signal of AMD's software readiness to compete seriously in the AI hardware market.

Assembly364 stars320 forks1059 contrib

Neuro SAN Studio is a sandbox environment for building networks of AI agents that work together to solve complex tasks — think of it like assembling a team of specialized AI workers that coordinate with each other, rather than relying on a single AI to do everything. Builders configure these agent teams using simple text-based files, meaning you can design sophisticated AI workflows without writing much code.

// why it matters As AI products move beyond single-chatbot experiences toward systems where multiple AI agents handle different parts of a workflow, having an open-source framework to prototype and test these systems dramatically lowers the cost and time to build them. For founders and product teams, this means faster experimentation with complex AI-powered features that could otherwise require significant engineering investment.

Python526 stars182 forks23 contrib
// SUBSCRIBE

The repos that moved this week, why they matter, and what to watch next. One email. No noise.