Harry00 commited on
Commit
fbea604
·
verified ·
1 Parent(s): b7aeaf9

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +67 -20
README.md CHANGED
@@ -4,28 +4,50 @@ A high-dimensional sparse distributed memory system with energy-based dynamics a
4
 
5
  ## Architecture
6
 
7
- - **Vector size**: 4096 bits (sparse, ~5% active)
8
- - **Memory**: Sparse Address Table with online learning
9
- - **Inference**: Energy minimization via simulated annealing
10
- - **Knowledge**: 283 facts, 334 words, 53 sequences
11
-
12
- ## Benchmarks
13
-
14
- | Task | Accuracy |
15
- |------|----------|
16
- | Question Answering | 92.9% |
17
- | Analogy Solving | 100.0% |
18
- | Sequence Completion | 93.3% |
19
- | Word Retrieval | 100.0% |
20
- | Generalization | 100.0% |
21
- | **Overall** | **97.2%** |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
22
 
23
  ## Files
24
 
25
- - `mle_best_facts.json`: Knowledge base facts
26
- - `mle_best_model.npz`: Word vectors and embeddings
27
- - `mle_best_results.json`: Benchmark results
28
- - `mle_best_config.json`: Model configuration
29
 
30
  ## Usage
31
 
@@ -33,7 +55,32 @@ A high-dimensional sparse distributed memory system with energy-based dynamics a
33
  import numpy as np
34
 
35
  # Load model
36
- data = np.load("mle_best_model.npz", allow_pickle=True)
37
  words = data['words']
38
  vectors = data['vectors']
 
 
 
39
  ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
4
 
5
  ## Architecture
6
 
7
+ - **Vector size**: 4096 bits (sparse, ~5% active = 205 bits)
8
+ - **Memory**: Sparse Address Table with deterministic hash encoding
9
+ - **Knowledge base**: 1848 facts, 1106 words, 53 sequences
10
+ - **Encoding**: MD5-based deterministic sparse vectors
11
+ - **Inference**: Symbolic reasoning with Hamming similarity
12
+
13
+ ## Benchmarks - Ultimate Model (v3)
14
+
15
+ | Task | Accuracy | Details |
16
+ |------|----------|---------|
17
+ | **Question Answering** | **62.8%** | 49/78 correct |
18
+ | **Analogy Solving** | **67.9%** | 19/28 correct |
19
+ | **Sequence Completion** | **93.3%** | 14/15 correct |
20
+ | **Word Retrieval (noise 15%)** | **100%** | 100/100 correct |
21
+ | **Generalization** | **100%** | 20/20 correct |
22
+ | **Robustness (noise 50%)** | **100%** | Up to 50% bit corruption |
23
+ | **Overall** | **84.8%** | Weighted average |
24
+
25
+ ## Corpus Coverage
26
+
27
+ - **Animals**: 185 animals across 5 categories
28
+ - **Plants**: 175 plants across 6 categories
29
+ - **Locations**: 150+ cities across 15 countries and 5 continents
30
+ - **Vehicles**: 10 vehicles with parts and actions
31
+ - **Professions**: 20 professions with workplace associations
32
+ - **Food**: 55 food and drink items
33
+ - **Body**: 36 body parts with functions
34
+ - **Colors**: 29 colors with associations
35
+ - **Emotions**: 20 emotions with opposites
36
+ - **Time**: Full temporal sequences (days, months, seasons, ordinals)
37
+ - **Tools**: 20 tools with functions
38
+ - **Music**: 17 instruments and 12 genres
39
+ - **Sports**: 22 sports
40
+ - **Weather**: 14 weather types
41
+ - **Clothing**: 20 clothing items
42
+ - **Technology**: 19 devices
43
+ - **Celestial**: 20 celestial bodies
44
 
45
  ## Files
46
 
47
+ - `mle_ultimate_model.npz` - Word vectors (1106 × 4096 binary)
48
+ - `mle_ultimate_facts.json` - Complete knowledge base (1848 facts)
49
+ - `mle_ultimate_results.json` - Benchmark results
50
+ - `mle_ultimate_config.json` - Model configuration
51
 
52
  ## Usage
53
 
 
55
  import numpy as np
56
 
57
  # Load model
58
+ data = np.load("mle_ultimate_model.npz", allow_pickle=True)
59
  words = data['words']
60
  vectors = data['vectors']
61
+
62
+ # Each vector is a 4096-bit sparse binary vector
63
+ # ~205 bits active (5% sparsity)
64
  ```
65
+
66
+ ## Supported Query Patterns
67
+
68
+ - "what is X" → category (is_a)
69
+ - "what has X" → possessor (has)
70
+ - "what can X" → agent (can)
71
+ - "where is X" → location (in/lives_in)
72
+ - "what color is X" → color (is/has_color)
73
+ - "what is opposite of X" → antonym (opposite)
74
+ - "what is before X" → predecessor (before)
75
+ - "what is X made of" → material (made_of)
76
+ - "what is X used for" → function (used_for)
77
+
78
+ ## Model Versions
79
+
80
+ | Version | Facts | Words | Overall |
81
+ |---------|-------|-------|---------|
82
+ | v1 (mle_best) | 283 | 334 | **97.2%** |
83
+ | v2 (mle_advanced) | 933 | 705 | **68.2%** |
84
+ | v3 (mle_ultimate) | 1848 | 1106 | **84.8%** |
85
+
86
+ v3 offers the largest knowledge coverage with strong robustness to noise.