VDC-team
/

SmallDront-20m

Model card Files Files and versions

SmallDront-20m / README.md

MishaGGG's picture

Update README.md

c5a6d5b verified 5 days ago

|

History Blame Contribute Delete

1.49 kB

	---
	license: mit
	datasets:
	- VDC-team/DialoguesEN-2k
	language:
	- en
	tags:
	- VDC
	- VDront
	- LM
	- LLM
	- 20m
	- english
	---

	# SmallDront-20m
	![](preview.jpg)

	SmallDront-20m is a lightweight 20M parameter LM model fine-tuned for small talk in English. It improves upon our previous model, `VDrontV2-mini`, delivering more coherent and engaging conversations.

	## 🧠 Model Details

	- Architecture: GPT-style
	- Parameters: 20,000,000
	- Tokenizer: GPT2 tokenizer
	- Special Tokens: `<\|user\|>` and `<\|assistant\|>`
	- Training Data: [DialoguesEN-2k](https://huggingface.co/datasets/VDC-team/DialoguesEN-2k)
	- Focus Topic: Small talk / casual conversation

	## ✨ Key Improvements

	- Better at handling greetings and opening lines
	- More natural conversation flow
	- Often asks follow-up questions (though sometimes absurd)
	- Lower loss — example successful runs at loss 0.5

	## 🗣️ Example Dialogues (loss 0.5)

	> You: Hey
	> Assistant: Hey there, what's the latest news in your world? What did you? ... (then follows info noise)

	> You: Hi
	> Assistant: Hello, got any plans for this day.

	> You: Hello!
	> Assistant: Hey! What's a favorite memory of yours? You? How about you?

	Note: The model tends to generate informative noise after initial questions — a known tradeoff in this version.

	## 🚀 Usage

	Use `use.py` as a simple example for loading and interacting with the model.

	### HuggingFace Format