#python
nanochat lets you train your own GPT-2 level LLM—like a basic ChatGPT—for just $73 in 3 hours on an 8xH100 GPU node using the simple speedrun.sh script. It handles tokenization, pretraining, finetuning, evaluation, inference, and a web chat UI in minimal, hackable code. You benefit by easily building, customizing, and chatting with your personal AI on a tiny budget, learning LLM mechanics hands-on without complex setups.
https://github.com/karpathy/nanochat
nanochat lets you train your own GPT-2 level LLM—like a basic ChatGPT—for just $73 in 3 hours on an 8xH100 GPU node using the simple speedrun.sh script. It handles tokenization, pretraining, finetuning, evaluation, inference, and a web chat UI in minimal, hackable code. You benefit by easily building, customizing, and chatting with your personal AI on a tiny budget, learning LLM mechanics hands-on without complex setups.
https://github.com/karpathy/nanochat
GitHub
GitHub - karpathy/nanochat: The best ChatGPT that $100 can buy.
The best ChatGPT that $100 can buy. Contribute to karpathy/nanochat development by creating an account on GitHub.
❤1