File size: 5,249 Bytes
be361ae
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8bf526c
 
 
 
 
 
67da9bf
c6b4e86
 
67da9bf
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
be361ae
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
---
license: apache-2.0
tags:
  - geofractal-router
  - collective-intelligence
  - math-reasoning
  - multi-stream
  - emergence
  - routing
datasets:
  - openai/gsm8k
base_model:
  - tbs17/MathBERT
  - google-t5/t5-base
pipeline_tag: text-classification
---

# V2 first epoch ready
Looking much stronger. 

![image](https://cdn-uploads.huggingface.co/production/uploads/630cf55b15433862cfc9556f/n-I_F4q4xhj-5sPZrmohJ.png)


# V1 BAD END - V2 using qwen 2.5 math 1.5b
Will the Qwen 2.5 1.5b model cut through the noise?


```
======================================================================
GALAXY BRAIN COLLECTIVE - GSM8K FINAL RESULTS
======================================================================

Streams:
  FUZZY:        MathBERT (symbolic), T5 (linguistic)
  DETERMINISTIC: Eigenspectrum, Cayley-Menger, Symbolic Calc, Fractal

| Epoch | Collective | MathBERT | T5 | Eigen | Cayley | Symbolic | Fractal | ρ |
|-------|------------|----------|-----|-------|--------|----------|---------|-------|
| 1 | 8.9% | 10.4% | 13.0% | 5.4% | 5.4% | 5.7% | 6.7% | 0.686 |
| 2 | 9.6% | 8.9% | 13.0% | 6.5% | 6.5% | 4.6% | 6.1% | 0.738 |
| 3 | 9.9% | 9.6% | 13.6% | 5.8% | 5.8% | 4.5% | 5.8% | 0.728 |
| 4 | 8.3% | 10.1% | 13.3% | 6.1% | 4.5% | 4.8% | 4.5% | 0.619 |
| 5 | 9.0% | 11.1% | 14.6% | 4.5% | 4.6% | 5.0% | 5.9% | 0.620 |

βœ“ Best epoch: 3 with collective accuracy 9.9%
βœ“ All checkpoints available at: https://huggingface.co/AbstractPhil/math_collective
```

# Math Collective - Galaxy Brain Router

**6-stream collective intelligence system for mathematical reasoning.**

## Architecture
```
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚                    GALAXY BRAIN COLLECTIVE                   β”‚
β”œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€
β”‚  FUZZY STREAMS (learned representations)                    β”‚
β”‚  β”œβ”€β”€ MathBERT (frozen) β†’ Head A β†’ "symbolic understanding"  β”‚
β”‚  └── T5-base (frozen)  β†’ Head B β†’ "linguistic reasoning"    β”‚
β”‚                                                             β”‚
β”‚  DETERMINISTIC STREAMS (pure computation)                   β”‚
β”‚  β”œβ”€β”€ Eigenspectrum    β†’ Head C β†’ "covariance geometry"      β”‚
β”‚  β”œβ”€β”€ Cayley-Menger    β†’ Head D β†’ "distance geometry"        β”‚
β”‚  β”œβ”€β”€ Symbolic Calc    β†’ Head E β†’ "actual arithmetic"        β”‚
β”‚  └── Fractal Dim      β†’ Head F β†’ "complexity measure"       β”‚
β”‚                                                             β”‚
β”‚  All 6 streams β†’ Fusion β†’ Classifier                        β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
```

## Key Innovation

**Fuzzy + Deterministic Routing**

The router learns WHEN to trust each stream:
- Simple arithmetic? Trust the Symbolic Calculator (deterministic)
- Complex word problem? Trust MathBERT/T5 (semantic)
- Ambiguous? Triangulate across all 6 perspectives

## Streams

| Stream | Type | Source | Purpose |
|--------|------|--------|---------|
| MathBERT | Fuzzy | tbs17/MathBERT (frozen) | Mathematical notation understanding |
| T5-base | Fuzzy | t5-base (frozen) | General language reasoning |
| Eigenspectrum | Deterministic | Covariance eigenvalues | Geometric structure of embeddings |
| Cayley-Menger | Deterministic | Distance matrix geometry | Simplex volume features |
| Symbolic | Deterministic | Regex + arithmetic | Actual number extraction & computation |
| Fractal | Deterministic | Correlation dimension | Problem complexity measure |

## Training

- **Dataset**: GSM8K (Grade School Math 8K)
- **Task**: Answer magnitude bucket prediction (20 buckets)
- **Frozen params**: ~330M (MathBERT + T5)
- **Trainable params**: ~15M (routing heads, fusion, projections)

## Emergence Metric (ρ)
```
ρ = collective_accuracy / max(individual_accuracies)

ρ > 1.0 = emergence (collective outperforms best individual)
```

## Usage
```python
from huggingface_hub import hf_hub_download
import torch

# Download checkpoint
checkpoint_path = hf_hub_download(
    repo_id="AbstractPhil/math_collective",
    filename="checkpoints/checkpoint_epoch_5.pt"
)

# Load and use (see geofractal-router for full implementation)
checkpoint = torch.load(checkpoint_path)
print(f"Epoch: {checkpoint['epoch']}")
print(f"Metrics: {checkpoint['metrics']}")
```

## Related

- **Framework**: [AbstractPhil/geofractal_router](https://huggingface.co/AbstractPhil/geofractal_router)
- **Paper**: Coming soon
- **Code**: [GitHub - geofractal](https://github.com/AbstractEyes/geofractal)

## Citation
```bibtex
@misc{abstractphil2025mathcollective,
  title={Math Collective: Galaxy Brain Routing for Mathematical Reasoning},
  author={AbstractPhil},
  year={2025},
  publisher={Hugging Face},
  url={https://huggingface.co/AbstractPhil/math_collective}
}
```

## License

Apache 2.0