Update README.md
Browse files
README.md
CHANGED
|
@@ -2,11 +2,11 @@
|
|
| 2 |
license: cc-by-nc-4.0
|
| 3 |
---
|
| 4 |
|
| 5 |
-
# Model Card for **
|
| 6 |
|
| 7 |
**_This is a scaled-up version of the Masked Language Model (MLM) checkpoint used in our preprint for the completions task._**
|
| 8 |
|
| 9 |
-
Trained with larger dataset of multiple initial conditions per system, with mixed periods as well.
|
| 10 |
|
| 11 |
*Panda*: Patched Attention for Nonlinear Dynamics.
|
| 12 |
|
|
|
|
| 2 |
license: cc-by-nc-4.0
|
| 3 |
---
|
| 4 |
|
| 5 |
+
# Model Card for **_Panda_MLM-66M_**
|
| 6 |
|
| 7 |
**_This is a scaled-up version of the Masked Language Model (MLM) checkpoint used in our preprint for the completions task._**
|
| 8 |
|
| 9 |
+
Trained with larger dataset of multiple initial conditions per system, with mixed periods as well. 12 layers with 12 attention heads each.
|
| 10 |
|
| 11 |
*Panda*: Patched Attention for Nonlinear Dynamics.
|
| 12 |
|