SmallDront-20m / README.md
MishaGGG's picture
Update README.md
c5a6d5b verified
|
Raw
History Blame Contribute Delete
1.49 kB
---
license: mit
datasets:
- VDC-team/DialoguesEN-2k
language:
- en
tags:
- VDC
- VDront
- LM
- LLM
- 20m
- english
---
# SmallDront-20m
![](preview.jpg)
**SmallDront-20m** is a lightweight 20M parameter LM model fine-tuned for small talk in English. It improves upon our previous model, `VDrontV2-mini`, delivering more coherent and engaging conversations.
## 🧠 Model Details
- **Architecture:** GPT-style
- **Parameters:** 20,000,000
- **Tokenizer:** GPT2 tokenizer
- **Special Tokens:** `<|user|>` and `<|assistant|>`
- **Training Data:** [DialoguesEN-2k](https://huggingface.co/datasets/VDC-team/DialoguesEN-2k)
- **Focus Topic:** Small talk / casual conversation
## ✨ Key Improvements
- Better at handling greetings and opening lines
- More natural conversation flow
- Often asks follow-up questions (though sometimes absurd)
- Lower loss — example successful runs at loss **0.5**
## 🗣️ Example Dialogues (loss 0.5)
> **You:** Hey
> **Assistant:** Hey there, what's the latest news in your world? What did you? ... *(then follows info noise)*
> **You:** Hi
> **Assistant:** Hello, got any plans for this day.
> **You:** Hello!
> **Assistant:** Hey! What's a favorite memory of yours? You? How about you?
*Note: The model tends to generate informative noise after initial questions — a known tradeoff in this version.*
## 🚀 Usage
Use `use.py` as a simple example for loading and interacting with the model.
### HuggingFace Format