Spaces:
Running
Running
File size: 4,720 Bytes
15b8bb5 e163f7d c4c0416 e163f7d b0029d0 ee8de2b 3723a68 71d9ae6 3723a68 71d9ae6 3723a68 71d9ae6 3723a68 71d9ae6 8d33272 71d9ae6 8d33272 71d9ae6 8d33272 71d9ae6 d2af4fe 8a7443d |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 |
---
title: README
emoji: 🌖
colorFrom: gray
colorTo: yellow
sdk: static
pinned: false
---
<img src="https://huggingface.co/datasets/nanochat-students/images/resolve/main/students.png" alt="nanochat students banner" style="width: 100%; height: 500px; object-fit: cover; object-position: center;">
<div style="background: lightblue; border-radius: 12px; padding: 30px; color: white;; box-shadow: 0 4px 6px rgba(0, 0, 0, 0.1); max-width: 800px; margin: 10px auto;">
<a href="https://huggingface.co/spaces/nanochat-students/README/discussions"><h2 style="margin: 0 0 10px 0; font-size: 28px;">💬 Check out the community here!</h2></a>
</div>
Welcome to the **nanochat students** organization\! This is a community organization for students following Andrej Karpathy's [nanochat course](https://github.com/karpathy/nanochat). We are learning to build a full-stack LLM implementation from tokenization to web serving, all for under $100.
## What is nanochat?
nanochat is a complete implementation of an LLM like ChatGPT in a minimal, hackable codebase. It's designed as the capstone project for the LLM101n course by Eureka Labs, teaching you to build and train your own ChatGPT clone end-to-end.
## What You'll Find Here
This organization hosts community-contributed resources to help you learn and succeed with nanochat. You'll find:
- notebooks that break down the implementation.
- spaces that demo or illustrate the concepts we’re learning.
- trained models and checkpoints from the community
- relevant curated datasets.
## Getting Help and Sharing Ideas
The [Discussions](https://huggingface.co/spaces/nanochat-students/README/discussions) section is where you can ask questions, share your training results and report cards, discuss optimization techniques, and collaborate on experiments.
<!-- ## **Getting Started**
1. **Start with the main repo**: Clone and explore [karpathy/nanochat](https://github.com/karpathy/nanochat)
2. **Run the speedrun**: Follow the quick start guide to train your first $100 model
3. **Explore resources here**: Check out community notebooks and spaces
4. **Join discussions**: Share your results and learn from others
5. **Contribute back**: Share your own notebooks, models, or datasets\! -->
## Contributing
We welcome contributions from all students or experts. Here's how you can help: notebooks, demos, models, and articles:
- Join the org, we'll give you write access.
- If you make anything, share it in this discussion [thread](https://huggingface.co/spaces/nanochat-students/README/discussions/1)
- If you can, help answer questions in [discussions](https://huggingface.co/spaces/nanochat-students/README/discussions)
Let's make this a fun, supportive, and efficient community of learners.
## **Resources**
- nanochat repo - [karpathy/nanochat](https://github.com/karpathy/nanochat)
- Introduction post: ["Introducing nanochat: The best ChatGPT that $100 can buy"](https://github.com/karpathy/nanochat/discussions/1)
---
## Journal!
Check out these steps to join in or get help:
### Day 1
<div style="background: white; border: 1px solid #ddd; border-radius: 4px; padding: 16px; margin-bottom: 12px;">
<div style="font-weight: 600; font-size: 16px; margin-bottom: 8px; color: #333;"><a href="https://huggingface.co/spaces/nanochat-students/README/discussions/6" style="color: #0066cc; text-decoration: none; font-size: 14px;" target="_blank">1. Environment Setup →</a></div>
<div style="color: #666; font-size: 14px; line-height: 1.5; margin-bottom: 8px;">
Support on your Python environment using uv, create a virtual environment, and install all necessary dependencies for the nanochat project.
</div>
<div style="font-weight: 600; font-size: 16px; margin-bottom: 8px; color: #333;"><a href="https://huggingface.co/spaces/nanochat-students/README/discussions/3" style="color: #0066cc; text-decoration: none; font-size: 14px;" target="_blank">2. Tokenizer Training→</a>
</div>
<div style="color: #666; font-size: 14px; line-height: 1.5; margin-bottom: 8px;">
Train a custom BPE tokenizer using Rust bindings.
</div>
<div style="font-weight: 600; font-size: 16px; margin-bottom: 8px; color: #333;"><a href="https://huggingface.co/spaces/nanochat-students/README/discussions/2" style="color: #0066cc; text-decoration: none; font-size: 14px;" target="_blank">3. Pre-training →</a></div>
<div style="color: #666; font-size: 14px; line-height: 1.5; margin-bottom: 8px;">
Base training across 8 GPUs using torchrun, with metrics tracked in a shared trackio space below.
</div>
</div>

|