david-ar commited on
Commit
d1cbf4e
·
verified ·
1 Parent(s): 00cfc63

Add training dataset link to model card

Browse files
Files changed (1) hide show
  1. README.md +6 -0
README.md CHANGED
@@ -12,6 +12,8 @@ tags:
12
  - twenty-questions
13
  language:
14
  - en
 
 
15
  ---
16
 
17
  # TwentyQ — The World's Smallest Chat Model
@@ -76,6 +78,10 @@ while True:
76
  - **Regular questions**: `Yes`, `No`, `Probably`, `Doubtful`, `Maybe`, `Unknown`
77
  - **Guesses**: `Yes`, `No`, `Close`
78
 
 
 
 
 
79
  ## How It Works
80
 
81
  The model is a weight matrix mapping 156 features (questions) to 1,200 output classes (objects). Each weight is 2 bits encoding polarity and strength. Inference is a scored lookup — no matrix multiplication, no attention, no backprop. Just XOR and addition.
 
12
  - twenty-questions
13
  language:
14
  - en
15
+ datasets:
16
+ - david-ar/20q-dataset
17
  ---
18
 
19
  # TwentyQ — The World's Smallest Chat Model
 
78
  - **Regular questions**: `Yes`, `No`, `Probably`, `Doubtful`, `Maybe`, `Unknown`
79
  - **Guesses**: `Yes`, `No`, `Close`
80
 
81
+ ## Training Data
82
+
83
+ Trained on [`david-ar/20q-dataset`](https://huggingface.co/datasets/david-ar/20q-dataset), a corpus of 9,600 Twenty Questions conversations covering 1,200 objects across 156 features. Answers include graded confidence levels (Yes, No, Probably, Doubtful) rather than binary labels, giving the model finer-grained signal for learning association strengths.
84
+
85
  ## How It Works
86
 
87
  The model is a weight matrix mapping 156 features (questions) to 1,200 output classes (objects). Each weight is 2 bits encoding polarity and strength. Inference is a scored lookup — no matrix multiplication, no attention, no backprop. Just XOR and addition.