Upload README.md
Browse files
README.md
CHANGED
|
@@ -4,28 +4,50 @@ A high-dimensional sparse distributed memory system with energy-based dynamics a
|
|
| 4 |
|
| 5 |
## Architecture
|
| 6 |
|
| 7 |
-
- **Vector size**: 4096 bits (sparse, ~5% active)
|
| 8 |
-
- **Memory**: Sparse Address Table with
|
| 9 |
-
- **
|
| 10 |
-
- **
|
| 11 |
-
|
| 12 |
-
|
| 13 |
-
|
| 14 |
-
|
| 15 |
-
|
|
| 16 |
-
|
|
| 17 |
-
|
|
| 18 |
-
|
|
| 19 |
-
|
|
| 20 |
-
|
|
| 21 |
-
| **
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 22 |
|
| 23 |
## Files
|
| 24 |
|
| 25 |
-
- `
|
| 26 |
-
- `
|
| 27 |
-
- `
|
| 28 |
-
- `
|
| 29 |
|
| 30 |
## Usage
|
| 31 |
|
|
@@ -33,7 +55,32 @@ A high-dimensional sparse distributed memory system with energy-based dynamics a
|
|
| 33 |
import numpy as np
|
| 34 |
|
| 35 |
# Load model
|
| 36 |
-
data = np.load("
|
| 37 |
words = data['words']
|
| 38 |
vectors = data['vectors']
|
|
|
|
|
|
|
|
|
|
| 39 |
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4 |
|
| 5 |
## Architecture
|
| 6 |
|
| 7 |
+
- **Vector size**: 4096 bits (sparse, ~5% active = 205 bits)
|
| 8 |
+
- **Memory**: Sparse Address Table with deterministic hash encoding
|
| 9 |
+
- **Knowledge base**: 1848 facts, 1106 words, 53 sequences
|
| 10 |
+
- **Encoding**: MD5-based deterministic sparse vectors
|
| 11 |
+
- **Inference**: Symbolic reasoning with Hamming similarity
|
| 12 |
+
|
| 13 |
+
## Benchmarks - Ultimate Model (v3)
|
| 14 |
+
|
| 15 |
+
| Task | Accuracy | Details |
|
| 16 |
+
|------|----------|---------|
|
| 17 |
+
| **Question Answering** | **62.8%** | 49/78 correct |
|
| 18 |
+
| **Analogy Solving** | **67.9%** | 19/28 correct |
|
| 19 |
+
| **Sequence Completion** | **93.3%** | 14/15 correct |
|
| 20 |
+
| **Word Retrieval (noise 15%)** | **100%** | 100/100 correct |
|
| 21 |
+
| **Generalization** | **100%** | 20/20 correct |
|
| 22 |
+
| **Robustness (noise 50%)** | **100%** | Up to 50% bit corruption |
|
| 23 |
+
| **Overall** | **84.8%** | Weighted average |
|
| 24 |
+
|
| 25 |
+
## Corpus Coverage
|
| 26 |
+
|
| 27 |
+
- **Animals**: 185 animals across 5 categories
|
| 28 |
+
- **Plants**: 175 plants across 6 categories
|
| 29 |
+
- **Locations**: 150+ cities across 15 countries and 5 continents
|
| 30 |
+
- **Vehicles**: 10 vehicles with parts and actions
|
| 31 |
+
- **Professions**: 20 professions with workplace associations
|
| 32 |
+
- **Food**: 55 food and drink items
|
| 33 |
+
- **Body**: 36 body parts with functions
|
| 34 |
+
- **Colors**: 29 colors with associations
|
| 35 |
+
- **Emotions**: 20 emotions with opposites
|
| 36 |
+
- **Time**: Full temporal sequences (days, months, seasons, ordinals)
|
| 37 |
+
- **Tools**: 20 tools with functions
|
| 38 |
+
- **Music**: 17 instruments and 12 genres
|
| 39 |
+
- **Sports**: 22 sports
|
| 40 |
+
- **Weather**: 14 weather types
|
| 41 |
+
- **Clothing**: 20 clothing items
|
| 42 |
+
- **Technology**: 19 devices
|
| 43 |
+
- **Celestial**: 20 celestial bodies
|
| 44 |
|
| 45 |
## Files
|
| 46 |
|
| 47 |
+
- `mle_ultimate_model.npz` - Word vectors (1106 × 4096 binary)
|
| 48 |
+
- `mle_ultimate_facts.json` - Complete knowledge base (1848 facts)
|
| 49 |
+
- `mle_ultimate_results.json` - Benchmark results
|
| 50 |
+
- `mle_ultimate_config.json` - Model configuration
|
| 51 |
|
| 52 |
## Usage
|
| 53 |
|
|
|
|
| 55 |
import numpy as np
|
| 56 |
|
| 57 |
# Load model
|
| 58 |
+
data = np.load("mle_ultimate_model.npz", allow_pickle=True)
|
| 59 |
words = data['words']
|
| 60 |
vectors = data['vectors']
|
| 61 |
+
|
| 62 |
+
# Each vector is a 4096-bit sparse binary vector
|
| 63 |
+
# ~205 bits active (5% sparsity)
|
| 64 |
```
|
| 65 |
+
|
| 66 |
+
## Supported Query Patterns
|
| 67 |
+
|
| 68 |
+
- "what is X" → category (is_a)
|
| 69 |
+
- "what has X" → possessor (has)
|
| 70 |
+
- "what can X" → agent (can)
|
| 71 |
+
- "where is X" → location (in/lives_in)
|
| 72 |
+
- "what color is X" → color (is/has_color)
|
| 73 |
+
- "what is opposite of X" → antonym (opposite)
|
| 74 |
+
- "what is before X" → predecessor (before)
|
| 75 |
+
- "what is X made of" → material (made_of)
|
| 76 |
+
- "what is X used for" → function (used_for)
|
| 77 |
+
|
| 78 |
+
## Model Versions
|
| 79 |
+
|
| 80 |
+
| Version | Facts | Words | Overall |
|
| 81 |
+
|---------|-------|-------|---------|
|
| 82 |
+
| v1 (mle_best) | 283 | 334 | **97.2%** |
|
| 83 |
+
| v2 (mle_advanced) | 933 | 705 | **68.2%** |
|
| 84 |
+
| v3 (mle_ultimate) | 1848 | 1106 | **84.8%** |
|
| 85 |
+
|
| 86 |
+
v3 offers the largest knowledge coverage with strong robustness to noise.
|