File size: 4,081 Bytes
0cd28b8
 
 
 
 
 
 
 
 
 
 
 
 
 
fbd2b5c
 
0cd28b8
 
 
 
 
fbd2b5c
0cd28b8
fbd2b5c
0cd28b8
 
 
 
fbd2b5c
 
0cd28b8
fbd2b5c
0cd28b8
 
 
fbd2b5c
 
 
 
 
 
 
 
 
 
 
 
0cd28b8
 
 
fbd2b5c
 
 
 
 
 
 
 
 
0cd28b8
 
 
 
 
fbd2b5c
 
 
 
0cd28b8
 
 
 
 
 
 
 
 
fbd2b5c
0cd28b8
fbd2b5c
 
 
 
 
0cd28b8
fbd2b5c
0cd28b8
fbd2b5c
 
 
 
 
 
 
 
 
0cd28b8
 
fbd2b5c
0cd28b8
 
 
 
 
 
 
 
 
 
 
fbd2b5c
 
 
 
 
 
 
0cd28b8
 
 
 
 
 
 
 
 
 
 
 
 
 
 
fbd2b5c
0cd28b8
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
---
license: apache-2.0
library_name: pytorch
tags:
  - neuroscience
  - neural-network
  - connectome
  - brain
  - h01
  - cortex
  - biology
  - human-brain
  - temporal-cortex
  - brain-inspired
  - spiking-neural-network
  - reservoir-computing
pipeline_tag: other
datasets:
  - google/h01-release
---

# Nauro — H01 Human Cortex Connectome (Full)

The **complete** neuron-to-neuron connectivity matrix extracted from a
nanometer-resolution reconstruction of **human temporal cortex**
([H01 dataset](https://h01-release.storage.googleapis.com/data.html),
Google/Harvard/Lichtman Lab).

Built from all 166 Avro synapse shards (~32 GB raw data), filtered at
≥0.50 confidence. This is the full connectome — no spatial cropping.

## Summary

| Property | Value |
|----------|-------|
| Neurons | 16,087 |
| Excitatory | 10,531 (65.4%) |
| Inhibitory | 4,688 (29.1%) |
| Non-zero connections | 76,903 |
| Raw edges (pre-aggregation) | 116,611 |
| Connectivity density | 0.030% |
| Mean in-degree | 4.8 |
| Max in-degree | 70 |
| External inputs (total) | 27,022,313 |
| Volume | Full 1 mm³ |
| Cortical layers | L1–L6 + white matter |
| Build | All 166 GCS shards, min_confidence=0.50 |

## Connectivity by cortical layer

| Layer | Neurons | Exc | Inh | Internal connections | Density |
|-------|---------|-----|-----|---------------------|---------|
| Layer 1 | 827 | 85 | 586 | 55 | 0.008% |
| Layer 2 | 4,656 | 2,952 | 1,594 | 21,845 | 0.101% |
| Layer 3 | 2,692 | 1,673 | 965 | 11,018 | 0.152% |
| Layer 4 | 3,440 | 2,622 | 688 | 8,748 | 0.074% |
| Layer 5 | 2,313 | 1,665 | 505 | 6,252 | 0.117% |
| Layer 6 | 1,077 | 906 | 128 | 4,419 | 0.381% |
| White matter | 648 | 395 | 111 | 732 | 0.174% |

## Degree distribution

| Metric | In-degree | Out-degree |
|--------|-----------|------------|
| Mean | 4.8 | 4.8 |
| Std | 6.3 | 7.0 |
| Median | 3.0 | 2.0 |
| Max | 70 | 124 |

## Quick start

```python
import json, numpy as np, torch
from safetensors.torch import load_file

# Load everything
config  = json.load(open("config.json"))
weights = load_file("connectome.safetensors")["weights"]  # (16087, 16087)
meta    = np.load("metadata.npz", allow_pickle=True)
edges   = np.load("edges.npz")["edges"]                   # (116611, 3)

print(f"{config['n_neurons']} neurons, {config['n_synapses']} connections")
print(f"Weight matrix: {weights.shape}, density: {config['density']:.4%}")
```

### Load via HuggingFace Hub

```python
from huggingface_hub import hf_hub_download
from safetensors.torch import load_file
import numpy as np

repo = "NathanRoll/h01-cortex-snn"
weights = load_file(hf_hub_download(repo, "connectome.safetensors"))["weights"]
meta = np.load(hf_hub_download(repo, "metadata.npz"), allow_pickle=True)
print(f"Loaded {weights.shape[0]} neurons")
```

### Reconstruct from edge list

```python
N = config["n_neurons"]
W = torch.zeros(N, N)
for pre, post, stype in edges:
    W[post, pre] += 1.0
# W[i, j] = number of synapses from neuron j → neuron i
```

## Files

| File | Description | Size |
|------|-------------|------|
| `connectome.safetensors` | Full 16,087×16,087 weight matrix | ~1 GB |
| `edges.npz` | Raw edge list `[pre, post, type]` | ~0.6 MB |
| `metadata.npz` | Positions, cell types, layers, segment IDs | ~0.3 MB |
| `somas_filtered.csv` | Neuron table (positions, types, layers) | ~1.1 MB |
| `config.json` | Build parameters + summary statistics | small |
| `layer_stats.json` | Per-layer connectivity statistics | small |

## Data source

The connectome data is from the
[H01 release](https://h01-release.storage.googleapis.com/data.html)
by Google Research and the Lichtman Laboratory at Harvard University.
The original 1.4 petabyte dataset was imaged via serial-section electron
microscopy at 4 nm × 4 nm × 33 nm resolution.

> Shapson-Coe, A. et al. "A petavoxel fragment of human cerebral cortex
> reconstructed at nanoscale resolution." *Science* 384, eadk4858 (2024).

## License

Apache 2.0. The underlying H01 data is subject to
[Google's release terms](https://h01-release.storage.googleapis.com/data.html).