Spaces:

nanochat-students
/

README

Running

App Files Files Community

README / README.md

burtenshaw HF Staff

Update README.md

c4c0416 verified about 2 months ago

preview code

raw

history blame contribute delete

4.72 kB

	---
	title: README
	emoji: 🌖
	colorFrom: gray
	colorTo: yellow
	sdk: static
	pinned: false
	---

	<img src="https://huggingface.co/datasets/nanochat-students/images/resolve/main/students.png" alt="nanochat students banner" style="width: 100%; height: 500px; object-fit: cover; object-position: center;">
	<div style="background: lightblue; border-radius: 12px; padding: 30px; color: white;; box-shadow: 0 4px 6px rgba(0, 0, 0, 0.1); max-width: 800px; margin: 10px auto;">
	<a href="https://huggingface.co/spaces/nanochat-students/README/discussions"><h2 style="margin: 0 0 10px 0; font-size: 28px;">💬 Check out the community here!</h2></a>
	</div>

	Welcome to the nanochat students organization\! This is a community organization for students following Andrej Karpathy's [nanochat course](https://github.com/karpathy/nanochat). We are learning to build a full-stack LLM implementation from tokenization to web serving, all for under $100.

	## What is nanochat?

	nanochat is a complete implementation of an LLM like ChatGPT in a minimal, hackable codebase. It's designed as the capstone project for the LLM101n course by Eureka Labs, teaching you to build and train your own ChatGPT clone end-to-end.

	## What You'll Find Here

	This organization hosts community-contributed resources to help you learn and succeed with nanochat. You'll find:
	- notebooks that break down the implementation.
	- spaces that demo or illustrate the concepts we’re learning.
	- trained models and checkpoints from the community
	- relevant curated datasets.

	## Getting Help and Sharing Ideas

	The [Discussions](https://huggingface.co/spaces/nanochat-students/README/discussions) section is where you can ask questions, share your training results and report cards, discuss optimization techniques, and collaborate on experiments.

	<!-- ## Getting Started

	1. Start with the main repo: Clone and explore [karpathy/nanochat](https://github.com/karpathy/nanochat)
	2. Run the speedrun: Follow the quick start guide to train your first $100 model
	3. Explore resources here: Check out community notebooks and spaces
	4. Join discussions: Share your results and learn from others
	5. Contribute back: Share your own notebooks, models, or datasets\! -->

	## Contributing

	We welcome contributions from all students or experts. Here's how you can help: notebooks, demos, models, and articles:

	- Join the org, we'll give you write access.
	- If you make anything, share it in this discussion [thread](https://huggingface.co/spaces/nanochat-students/README/discussions/1)
	- If you can, help answer questions in [discussions](https://huggingface.co/spaces/nanochat-students/README/discussions)

	Let's make this a fun, supportive, and efficient community of learners.

	## Resources

	- nanochat repo - [karpathy/nanochat](https://github.com/karpathy/nanochat)
	- Introduction post: ["Introducing nanochat: The best ChatGPT that $100 can buy"](https://github.com/karpathy/nanochat/discussions/1)

	---

	## Journal!

	Check out these steps to join in or get help:

	### Day 1

	<div style="background: white; border: 1px solid #ddd; border-radius: 4px; padding: 16px; margin-bottom: 12px;">
	<div style="font-weight: 600; font-size: 16px; margin-bottom: 8px; color: #333;"><a href="https://huggingface.co/spaces/nanochat-students/README/discussions/6" style="color: #0066cc; text-decoration: none; font-size: 14px;" target="_blank">1. Environment Setup →</a></div>
	<div style="color: #666; font-size: 14px; line-height: 1.5; margin-bottom: 8px;">
	Support on your Python environment using uv, create a virtual environment, and install all necessary dependencies for the nanochat project.
	</div>

	<div style="font-weight: 600; font-size: 16px; margin-bottom: 8px; color: #333;"><a href="https://huggingface.co/spaces/nanochat-students/README/discussions/3" style="color: #0066cc; text-decoration: none; font-size: 14px;" target="_blank">2. Tokenizer Training→</a>
	</div>
	<div style="color: #666; font-size: 14px; line-height: 1.5; margin-bottom: 8px;">
	Train a custom BPE tokenizer using Rust bindings.
	</div>

	<div style="font-weight: 600; font-size: 16px; margin-bottom: 8px; color: #333;"><a href="https://huggingface.co/spaces/nanochat-students/README/discussions/2" style="color: #0066cc; text-decoration: none; font-size: 14px;" target="_blank">3. Pre-training →</a></div>
	<div style="color: #666; font-size: 14px; line-height: 1.5; margin-bottom: 8px;">
	Base training across 8 GPUs using torchrun, with metrics tracked in a shared trackio space below.
	</div>

	</div>


	![image](https://cdn-uploads.huggingface.co/production/uploads/62d648291fa3e4e7ae3fa6e8/k9l3ECubDiU1LkWWzY5UU.png)