SmallDront-60m / README.md
MishaGGG's picture
Update README.md
5120bd2 verified
|
Raw
History Blame Contribute Delete
1.85 kB
metadata
license: mit
datasets:
  - VDC-team/DialoguesEN-4k
language:
  - en
tags:
  - English
  - LM
  - 60M
  - small_talk
  - smalltalk
  - VDC
  - Text

πŸ€– SmallDront-60M

A lightweight conversational AI model designed for natural small talk

License: MIT Hugging Face Parameters Format


πŸ“ Overview

SmallDront-60M is a compact language model specifically fine-tuned for engaging small talk conversations. With 60 million parameters in F32 precision, it strikes a balance between performance and efficiency β€” delivering coherent, natural dialogue without the overhead of larger models.

Compared to its predecessor (SmallDront-20M), this model properly ends its responses and hallucinates significantly less.


✨ Features

Feature Detail
🧠 Architecture Transformer-based
πŸ”’ Parameters +-60,000,000
πŸ“ Precision F32 (Full float32)
πŸ”€ Tokenizer GPT-2 Tokenizer
πŸ“¦ Format Hugging Face
🏷️ Special Tokens <|user|> <|assistant|>

πŸŽ“ Training

The model was trained on the VDC-team/DialoguesEN-4k dataset until reaching a loss of 0.9.


πŸ’¬ Example Conversations

Temperature: 0.3

You: Who are you?
Assistant: Just a friendly chat. You?

You: Hello!
Assistant: Hey! What's a place you feel at?

You: Where you?
Assistant: Hi, for a chat. You?

use.py - use model example