Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
codelion 's Collections
Dhara Foundational Models
Sutra Pedagogical Datasets
Nano Language Models
Pre-training Dataset Samples
Ellora
Pivotal Token Search
Internal Coherence Maximization
Securade.ai

Nano Language Models

updated 1 day ago

A collection of really small language models pre-trained from scratch with open-data. Ideal for use in experimentation and comparisions.

Upvote
1

  • codelion/SmolLM2-70M

    Text Generation • 69.2M • Updated 1 day ago • 26 • 3

  • codelion/malm-165m

    Text Generation • Updated Jan 23 • 28 • 4

  • codelion/dhara-70m

    Text Generation • 71.3M • Updated Dec 30, 2025 • 452 • 47

  • codelion/gpt-2-70m

    Text Generation • 64.1M • Updated Nov 2, 2025 • 203 • 20
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs