SmallDront-20m / README.md
MishaGGG's picture
Update README.md
c5a6d5b verified
|
Raw
History Blame Contribute Delete
1.49 kB
metadata
license: mit
datasets:
  - VDC-team/DialoguesEN-2k
language:
  - en
tags:
  - VDC
  - VDront
  - LM
  - LLM
  - 20m
  - english

SmallDront-20m

SmallDront-20m is a lightweight 20M parameter LM model fine-tuned for small talk in English. It improves upon our previous model, VDrontV2-mini, delivering more coherent and engaging conversations.

🧠 Model Details

  • Architecture: GPT-style
  • Parameters: 20,000,000
  • Tokenizer: GPT2 tokenizer
  • Special Tokens: <|user|> and <|assistant|>
  • Training Data: DialoguesEN-2k
  • Focus Topic: Small talk / casual conversation

✨ Key Improvements

  • Better at handling greetings and opening lines
  • More natural conversation flow
  • Often asks follow-up questions (though sometimes absurd)
  • Lower loss — example successful runs at loss 0.5

🗣️ Example Dialogues (loss 0.5)

You: Hey
Assistant: Hey there, what's the latest news in your world? What did you? ... (then follows info noise)

You: Hi
Assistant: Hello, got any plans for this day.

You: Hello!
Assistant: Hey! What's a favorite memory of yours? You? How about you?

Note: The model tends to generate informative noise after initial questions — a known tradeoff in this version.

🚀 Usage

Use use.py as a simple example for loading and interacting with the model.

HuggingFace Format