etwk commited on
Commit
faa14cf
·
1 Parent(s): 3d2c226

Update tier-5 metrics to 0.74 (overall 0.483); refresh 64-bit compliance row

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -17,8 +17,8 @@ multiplication tables. Entry for the
17
  [Modular Arithmetic Challenge](https://github.com/SAIRcompetition/modular-arithmetic-challenge).
18
 
19
  - **Saturates tiers 1–4** (all primes `< 2³²`): tiers 1–3 = 100%, tier 4 = 99%
20
- - **Tier 5** (33–64-bit primes) = 0.64 on the public benchmark
21
- - **overall_accuracy 0.473**, `highest_tier_above_90 = 4`
22
  - Verifiably **generalises to primes never seen in training** (held-out-prime validation
23
  accuracy tracks training accuracy — no memorisation gap)
24
 
@@ -52,7 +52,7 @@ holds the prime:
52
  |---|---|---|---|---|---|---|
53
  | `weights16.pt` | 16-bit | `< 2¹⁶` | 1–3 | 4096 / 4 | ~50M | tiers 1–3 = 1.00 |
54
  | `weights32.pt` | 32-bit | `< 2³²` | 4 | 6144 / 4 | ~114M | tier 4 = 0.99 |
55
- | `weights64.pt` | 64-bit | `< 2⁶⁴` | 5 | 4096 / 7, residual | ~236M | tier 5 = 0.64 |
56
 
57
  The 64-bit cell needs **depth and residual connections** the narrower cells do not: a 64-bit
58
  modular Horner step hides two long carry chains (the `2t + bit·b` addition and the
@@ -106,7 +106,7 @@ cell is *at* the floor. The capability therefore resides in the trained paramete
106
  |---|---|---|---|---|---|---|
107
  | tier 3 (16-bit cell) | 1.00 | 1.00 | 0.98 | 0.74 | 0.06 | 0.00 |
108
  | tier 4 (32-bit cell) | 0.99 | 0.99 | 0.86 | 0.04 | 0.02 | 0.00 |
109
- | tier 5 (64-bit cell) | 0.64 | 0.57 | 0.41 | 0.01 | 0.01 | 0.00 |
110
 
111
  Generalisation against memorisation: 10% of primes at each bit-width were held out of
112
  training entirely; chain accuracy on them matches the training primes.
 
17
  [Modular Arithmetic Challenge](https://github.com/SAIRcompetition/modular-arithmetic-challenge).
18
 
19
  - **Saturates tiers 1–4** (all primes `< 2³²`): tiers 1–3 = 100%, tier 4 = 99%
20
+ - **Tier 5** (33–64-bit primes) = 0.74 on the public benchmark
21
+ - **overall_accuracy 0.483**, `highest_tier_above_90 = 4`
22
  - Verifiably **generalises to primes never seen in training** (held-out-prime validation
23
  accuracy tracks training accuracy — no memorisation gap)
24
 
 
52
  |---|---|---|---|---|---|---|
53
  | `weights16.pt` | 16-bit | `< 2¹⁶` | 1–3 | 4096 / 4 | ~50M | tiers 1–3 = 1.00 |
54
  | `weights32.pt` | 32-bit | `< 2³²` | 4 | 6144 / 4 | ~114M | tier 4 = 0.99 |
55
+ | `weights64.pt` | 64-bit | `< 2⁶⁴` | 5 | 4096 / 7, residual | ~236M | tier 5 = 0.74 |
56
 
57
  The 64-bit cell needs **depth and residual connections** the narrower cells do not: a 64-bit
58
  modular Horner step hides two long carry chains (the `2t + bit·b` addition and the
 
106
  |---|---|---|---|---|---|---|
107
  | tier 3 (16-bit cell) | 1.00 | 1.00 | 0.98 | 0.74 | 0.06 | 0.00 |
108
  | tier 4 (32-bit cell) | 0.99 | 0.99 | 0.86 | 0.04 | 0.02 | 0.00 |
109
+ | tier 5 (64-bit cell) | 0.74 | 0.71 | 0.46 | 0.01 | 0.01 | 0.00 |
110
 
111
  Generalisation against memorisation: 10% of primes at each bit-width were held out of
112
  training entirely; chain accuracy on them matches the training primes.