raincandy-u
/

TinyChat-1776K

Text Generation

text-generation-inference

Model card Files Files and versions

raincandy-u commited on Jun 13, 2024

Commit

ec7676f

·

verified ·

1 Parent(s): bf714b1

Update README.md

Files changed (1) hide show

README.md +46 -3

README.md CHANGED Viewed

@@ -1,3 +1,46 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+---
+# raincandy-u/TinyChat-1776K
+A tiny LM trained on TinyChat dataset from scratch.
+The aim is to try to achieve natural responses on the smallest possible model. Trained using a dataset of 3 year old children level English conversations.
+Note: It has no world knowledge, so you should not ask it any intellectual questions.
+## Model Spec
+```
+config = AutoConfig.for_model(
+    model_type="llama",
+    hidden_size=192,
+    intermediate_size=640,
+    num_attention_heads=16,
+    num_hidden_layers=3,
+    num_key_value_heads=4,
+    tie_word_embeddings=True,
+    vocab_size=2048,
+    max_position_embeddings=256
+)
+```
+## Template
+```
+<A>Hi, Tom. How are you? <end>
+<B>I'm fine, thank you. And you? <end>
+<A>Fine. What's your favorite color? <end>
+<B>My favorite color is black. <end>
+<A>Do you like cats? <end>
+<B>
+```
+## Generation Param
+```
+top_k=40,
+top_p=0.8,
+temperature=1
+```