File size: 2,931 Bytes
8ec2e8a
 
 
 
 
 
fbea604
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8ec2e8a
 
 
fbea604
 
 
 
8ec2e8a
 
 
 
 
 
 
fbea604
8ec2e8a
 
fbea604
 
 
8ec2e8a
fbea604
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
# MLE Competitive Model

A high-dimensional sparse distributed memory system with energy-based dynamics and knowledge-based question-answering capabilities.

## Architecture

- **Vector size**: 4096 bits (sparse, ~5% active = 205 bits)
- **Memory**: Sparse Address Table with deterministic hash encoding
- **Knowledge base**: 1848 facts, 1106 words, 53 sequences
- **Encoding**: MD5-based deterministic sparse vectors
- **Inference**: Symbolic reasoning with Hamming similarity

## Benchmarks - Ultimate Model (v3)

| Task | Accuracy | Details |
|------|----------|---------|
| **Question Answering** | **62.8%** | 49/78 correct |
| **Analogy Solving** | **67.9%** | 19/28 correct |
| **Sequence Completion** | **93.3%** | 14/15 correct |
| **Word Retrieval (noise 15%)** | **100%** | 100/100 correct |
| **Generalization** | **100%** | 20/20 correct |
| **Robustness (noise 50%)** | **100%** | Up to 50% bit corruption |
| **Overall** | **84.8%** | Weighted average |

## Corpus Coverage

- **Animals**: 185 animals across 5 categories
- **Plants**: 175 plants across 6 categories
- **Locations**: 150+ cities across 15 countries and 5 continents
- **Vehicles**: 10 vehicles with parts and actions
- **Professions**: 20 professions with workplace associations
- **Food**: 55 food and drink items
- **Body**: 36 body parts with functions
- **Colors**: 29 colors with associations
- **Emotions**: 20 emotions with opposites
- **Time**: Full temporal sequences (days, months, seasons, ordinals)
- **Tools**: 20 tools with functions
- **Music**: 17 instruments and 12 genres
- **Sports**: 22 sports
- **Weather**: 14 weather types
- **Clothing**: 20 clothing items
- **Technology**: 19 devices
- **Celestial**: 20 celestial bodies

## Files

- `mle_ultimate_model.npz` - Word vectors (1106 Γ— 4096 binary)
- `mle_ultimate_facts.json` - Complete knowledge base (1848 facts)
- `mle_ultimate_results.json` - Benchmark results
- `mle_ultimate_config.json` - Model configuration

## Usage

```python
import numpy as np

# Load model
data = np.load("mle_ultimate_model.npz", allow_pickle=True)
words = data['words']
vectors = data['vectors']

# Each vector is a 4096-bit sparse binary vector
# ~205 bits active (5% sparsity)
```

## Supported Query Patterns

- "what is X" β†’ category (is_a)
- "what has X" β†’ possessor (has)
- "what can X" β†’ agent (can)
- "where is X" β†’ location (in/lives_in)
- "what color is X" β†’ color (is/has_color)
- "what is opposite of X" β†’ antonym (opposite)
- "what is before X" β†’ predecessor (before)
- "what is X made of" β†’ material (made_of)
- "what is X used for" β†’ function (used_for)

## Model Versions

| Version | Facts | Words | Overall |
|---------|-------|-------|---------|
| v1 (mle_best) | 283 | 334 | **97.2%** |
| v2 (mle_advanced) | 933 | 705 | **68.2%** |
| v3 (mle_ultimate) | 1848 | 1106 | **84.8%** |

v3 offers the largest knowledge coverage with strong robustness to noise.