Update README.md
Browse files
README.md
CHANGED
|
@@ -1,6 +1,13 @@
|
|
| 1 |
---
|
| 2 |
license: mit
|
| 3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4 |
|
| 5 |
# Experimentation Head
|
| 6 |
Everything led to the RoPE enhancement. You will find the proof in a trainer attached to this with basic instructions on how to activate it in Colab.
|
|
|
|
| 1 |
---
|
| 2 |
license: mit
|
| 3 |
---
|
| 4 |
+
# What is bert beatrix 200_000?
|
| 5 |
+
This is the first 200,000 sequence length bert. Trained specifically with cantor attention routing as a mechanism. The proof is attached.
|
| 6 |
+
|
| 7 |
+
This is meant to encapsulate the potential for infinite rope through cantor fractal structures. You likely won't even need to install the geofractal repo for the test case.
|
| 8 |
+
|
| 9 |
+
This showcases the direct causal response by testing how well the model can represent data along very long chains.
|
| 10 |
+
|
| 11 |
|
| 12 |
# Experimentation Head
|
| 13 |
Everything led to the RoPE enhancement. You will find the proof in a trainer attached to this with basic instructions on how to activate it in Colab.
|