Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Polygl0t 's Collections
Tucano2
LilMoo
LilTii
ViTucano-v1 (Portuguese)
Tucano (Portuguese)
TeenyTinyLlama (Portuguese)

LilTii

updated 2 days ago

A 0.6B Bengali Language Model that Outperforms Qwen.

Upvote
-

  • Polygl0t/LilTii-v0.1

    Text Generation • 0.7B • Updated 2 days ago • 14

    Note 🧱 Base model pretrained only with Bengali text.


  • Polygl0t/LilTii-v0.2

    Text Generation • 0.7B • Updated 2 days ago • 23

    Note 🧱 Base model pretrained with a Bengali + English mixture.


  • Polygl0t/gigakriya-v1

    Viewer • Updated 2 days ago • 41.6M • 42

    Note 📚 Pretraining dataset.


  • Polygl0t/bengali-edu-qwen-annotations

    Viewer • Updated 2 days ago • 320k • 4

    Note 📚 Annotations to train classifiers/filters (Educational).


  • Polygl0t/bengali-toxicity-qwen-annotations

    Viewer • Updated 2 days ago • 320k • 5

    Note 📚 Annotations to train classifiers/filters (Toxicity).


  • Polygl0t/bengali-banglabert-edu-classifier

    Text Classification • 34.7M • Updated 2 days ago • 12

    Note 🎯 Quality Filter (Educational)


  • Polygl0t/bengali-banglabert-toxicity-classifier

    Text Classification • 34.7M • Updated 2 days ago • 14

    Note 🎯 Quality Filter (Toxicity)


  • Polygl0t/tokenizers

    Viewer • Updated 2 days ago • 8.98M • 14

    Note 📚 Data used to train the LilTii tokenizer.

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs