File size: 4,720 Bytes
15b8bb5
 
 
 
 
 
 
 
 
e163f7d
c4c0416
 
 
e163f7d
b0029d0
ee8de2b
3723a68
71d9ae6
 
 
3723a68
71d9ae6
 
3723a68
 
 
 
71d9ae6
 
 
 
 
 
 
 
 
 
 
 
 
3723a68
71d9ae6
8d33272
71d9ae6
8d33272
 
 
71d9ae6
8d33272
71d9ae6
 
 
 
d2af4fe
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8a7443d
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
---
title: README
emoji: 🌖
colorFrom: gray
colorTo: yellow
sdk: static
pinned: false
---

<img src="https://huggingface.co/datasets/nanochat-students/images/resolve/main/students.png" alt="nanochat students banner" style="width: 100%; height: 500px; object-fit: cover; object-position: center;">
<div style="background: lightblue; border-radius: 12px; padding: 30px; color: white;; box-shadow: 0 4px 6px rgba(0, 0, 0, 0.1); max-width: 800px; margin: 10px auto;">
      <a href="https://huggingface.co/spaces/nanochat-students/README/discussions"><h2 style="margin: 0 0 10px 0; font-size: 28px;">💬 Check out the community here!</h2></a>
</div>

Welcome to the **nanochat students** organization\! This is a community organization for students following Andrej Karpathy's [nanochat course](https://github.com/karpathy/nanochat). We are learning to build a full-stack LLM implementation from tokenization to web serving, all for under $100.

## What is nanochat?

nanochat is a complete implementation of an LLM like ChatGPT in a minimal, hackable codebase. It's designed as the capstone project for the LLM101n course by Eureka Labs, teaching you to build and train your own ChatGPT clone end-to-end.

## What You'll Find Here

This organization hosts community-contributed resources to help you learn and succeed with nanochat. You'll find:  
- notebooks that break down the implementation.  
- spaces that demo or illustrate the concepts we’re learning.   
- trained models and checkpoints from the community  
- relevant curated datasets.

## Getting Help and Sharing Ideas

The [Discussions](https://huggingface.co/spaces/nanochat-students/README/discussions) section is where you can ask questions, share your training results and report cards, discuss optimization techniques, and collaborate on experiments.

<!-- ## **Getting Started**

1. **Start with the main repo**: Clone and explore [karpathy/nanochat](https://github.com/karpathy/nanochat)  
2. **Run the speedrun**: Follow the quick start guide to train your first $100 model  
3. **Explore resources here**: Check out community notebooks and spaces  
4. **Join discussions**: Share your results and learn from others  
5. **Contribute back**: Share your own notebooks, models, or datasets\! -->

## Contributing

We welcome contributions from all students or experts. Here's how you can help: notebooks, demos, models, and articles:

- Join the org, we'll give you write access.
- If you make anything, share it in this discussion [thread](https://huggingface.co/spaces/nanochat-students/README/discussions/1)
- If you can, help answer questions in [discussions](https://huggingface.co/spaces/nanochat-students/README/discussions)  

Let's make this a fun, supportive, and efficient community of learners. 

## **Resources**

- nanochat repo - [karpathy/nanochat](https://github.com/karpathy/nanochat)  
- Introduction post: ["Introducing nanochat: The best ChatGPT that $100 can buy"](https://github.com/karpathy/nanochat/discussions/1)

---

## Journal!

Check out these steps to join in or get help:

### Day 1

<div style="background: white; border: 1px solid #ddd; border-radius: 4px; padding: 16px; margin-bottom: 12px;">
<div style="font-weight: 600; font-size: 16px; margin-bottom: 8px; color: #333;"><a href="https://huggingface.co/spaces/nanochat-students/README/discussions/6" style="color: #0066cc; text-decoration: none; font-size: 14px;" target="_blank">1. Environment Setup →</a></div>
<div style="color: #666; font-size: 14px; line-height: 1.5; margin-bottom: 8px;">
    Support on your Python environment using uv, create a virtual environment, and install all necessary dependencies for the nanochat project.
</div>
    
<div style="font-weight: 600; font-size: 16px; margin-bottom: 8px; color: #333;"><a href="https://huggingface.co/spaces/nanochat-students/README/discussions/3" style="color: #0066cc; text-decoration: none; font-size: 14px;" target="_blank">2. Tokenizer Training→</a>
</div>
<div style="color: #666; font-size: 14px; line-height: 1.5; margin-bottom: 8px;">
    Train a custom BPE tokenizer using Rust bindings.
</div>

<div style="font-weight: 600; font-size: 16px; margin-bottom: 8px; color: #333;"><a href="https://huggingface.co/spaces/nanochat-students/README/discussions/2" style="color: #0066cc; text-decoration: none; font-size: 14px;" target="_blank">3. Pre-training →</a></div>
<div style="color: #666; font-size: 14px; line-height: 1.5; margin-bottom: 8px;">
    Base training across 8 GPUs using torchrun, with metrics tracked in a shared trackio space below.
</div>

</div>


![image](https://cdn-uploads.huggingface.co/production/uploads/62d648291fa3e4e7ae3fa6e8/k9l3ECubDiU1LkWWzY5UU.png)