Spaces:
Running
Running
| title: README | |
| emoji: 🌖 | |
| colorFrom: gray | |
| colorTo: yellow | |
| sdk: static | |
| pinned: false | |
| <img src="https://huggingface.co/datasets/nanochat-students/images/resolve/main/students.png" alt="nanochat students banner" style="width: 100%; height: 500px; object-fit: cover; object-position: center;"> | |
| <div style="background: lightblue; border-radius: 12px; padding: 30px; color: white;; box-shadow: 0 4px 6px rgba(0, 0, 0, 0.1); max-width: 800px; margin: 10px auto;"> | |
| <a href="https://huggingface.co/spaces/nanochat-students/README/discussions"><h2 style="margin: 0 0 10px 0; font-size: 28px;">💬 Check out the community here!</h2></a> | |
| </div> | |
| Welcome to the **nanochat students** organization\! This is a community organization for students following Andrej Karpathy's [nanochat course](https://github.com/karpathy/nanochat). We are learning to build a full-stack LLM implementation from tokenization to web serving, all for under $100. | |
| ## What is nanochat? | |
| nanochat is a complete implementation of an LLM like ChatGPT in a minimal, hackable codebase. It's designed as the capstone project for the LLM101n course by Eureka Labs, teaching you to build and train your own ChatGPT clone end-to-end. | |
| ## What You'll Find Here | |
| This organization hosts community-contributed resources to help you learn and succeed with nanochat. You'll find: | |
| - notebooks that break down the implementation. | |
| - spaces that demo or illustrate the concepts we’re learning. | |
| - trained models and checkpoints from the community | |
| - relevant curated datasets. | |
| ## Getting Help and Sharing Ideas | |
| The [Discussions](https://huggingface.co/spaces/nanochat-students/README/discussions) section is where you can ask questions, share your training results and report cards, discuss optimization techniques, and collaborate on experiments. | |
| <!-- ## **Getting Started** | |
| 1. **Start with the main repo**: Clone and explore [karpathy/nanochat](https://github.com/karpathy/nanochat) | |
| 2. **Run the speedrun**: Follow the quick start guide to train your first $100 model | |
| 3. **Explore resources here**: Check out community notebooks and spaces | |
| 4. **Join discussions**: Share your results and learn from others | |
| 5. **Contribute back**: Share your own notebooks, models, or datasets\! --> | |
| ## Contributing | |
| We welcome contributions from all students or experts. Here's how you can help: notebooks, demos, models, and articles: | |
| - Join the org, we'll give you write access. | |
| - If you make anything, share it in this discussion [thread](https://huggingface.co/spaces/nanochat-students/README/discussions/1) | |
| - If you can, help answer questions in [discussions](https://huggingface.co/spaces/nanochat-students/README/discussions) | |
| Let's make this a fun, supportive, and efficient community of learners. | |
| ## **Resources** | |
| - nanochat repo - [karpathy/nanochat](https://github.com/karpathy/nanochat) | |
| - Introduction post: ["Introducing nanochat: The best ChatGPT that $100 can buy"](https://github.com/karpathy/nanochat/discussions/1) | |
| --- | |
| ## Journal! | |
| Check out these steps to join in or get help: | |
| ### Day 1 | |
| <div style="background: white; border: 1px solid #ddd; border-radius: 4px; padding: 16px; margin-bottom: 12px;"> | |
| <div style="font-weight: 600; font-size: 16px; margin-bottom: 8px; color: #333;"><a href="https://huggingface.co/spaces/nanochat-students/README/discussions/6" style="color: #0066cc; text-decoration: none; font-size: 14px;" target="_blank">1. Environment Setup →</a></div> | |
| <div style="color: #666; font-size: 14px; line-height: 1.5; margin-bottom: 8px;"> | |
| Support on your Python environment using uv, create a virtual environment, and install all necessary dependencies for the nanochat project. | |
| </div> | |
| <div style="font-weight: 600; font-size: 16px; margin-bottom: 8px; color: #333;"><a href="https://huggingface.co/spaces/nanochat-students/README/discussions/3" style="color: #0066cc; text-decoration: none; font-size: 14px;" target="_blank">2. Tokenizer Training→</a> | |
| </div> | |
| <div style="color: #666; font-size: 14px; line-height: 1.5; margin-bottom: 8px;"> | |
| Train a custom BPE tokenizer using Rust bindings. | |
| </div> | |
| <div style="font-weight: 600; font-size: 16px; margin-bottom: 8px; color: #333;"><a href="https://huggingface.co/spaces/nanochat-students/README/discussions/2" style="color: #0066cc; text-decoration: none; font-size: 14px;" target="_blank">3. Pre-training →</a></div> | |
| <div style="color: #666; font-size: 14px; line-height: 1.5; margin-bottom: 8px;"> | |
| Base training across 8 GPUs using torchrun, with metrics tracked in a shared trackio space below. | |
| </div> | |
| </div> | |
|  | |