| --- |
| license: mit |
| datasets: |
| - VDC-team/DialoguesEN-4k |
| language: |
| - en |
| tags: |
| - English |
| - LM |
| - 60M |
| - small_talk |
| - smalltalk |
| - VDC |
| - Text |
| --- |
|  |
| <div align="center"> |
|
|
| # π€ SmallDront-60M |
|
|
| *A lightweight conversational AI model designed for natural small talk* |
|
|
| [](https://opensource.org/licenses/MIT) |
| [](https://huggingface.co/) |
| []() |
| []() |
|
|
| </div> |
|
|
| --- |
|
|
| ## π Overview |
|
|
| **SmallDront-60M** is a compact language model specifically fine-tuned for engaging small talk conversations. With 60 million parameters in F32 precision, it strikes a balance between performance and efficiency β delivering coherent, natural dialogue without the overhead of larger models. |
|
|
| > *Compared to its predecessor (SmallDront-20M), this model properly ends its responses and hallucinates significantly less.* |
|
|
| --- |
|
|
| ## β¨ Features |
|
|
| | Feature | Detail | |
| |---------|--------| |
| | π§ **Architecture** | Transformer-based | |
| | π’ **Parameters** | +-60,000,000 | |
| | π **Precision** | F32 (Full float32) | |
| | π€ **Tokenizer** | **GPT-2 Tokenizer** | |
| | π¦ **Format** | Hugging Face | |
| | π·οΈ **Special Tokens** | `<\|user\|>` `<\|assistant\|>` | |
|
|
| --- |
|
|
| ## π Training |
|
|
| The model was trained on the **[VDC-team/DialoguesEN-4k](https://huggingface.co/datasets/VDC-team/DialoguesEN-4k)** dataset until reaching a loss of **0.9**. |
|
|
| --- |
|
|
| ## π¬ Example Conversations |
|
|
| *Temperature: 0.3* |
|
|
| ```text |
| You: Who are you? |
| Assistant: Just a friendly chat. You? |
| |
| You: Hello! |
| Assistant: Hey! What's a place you feel at? |
| |
| You: Where you? |
| Assistant: Hi, for a chat. You? |
| |
| ``` |
|
|
| **use.py** - use model example |