Update README.md
Browse files
README.md
CHANGED
|
@@ -42,4 +42,6 @@ All models use tied embeddings (input embeddings = output projection weights). C
|
|
| 42 |
## Additional Notes
|
| 43 |
Phase exploration on embeddings will be replaced soon with stronger version which includes statistical analysis.
|
| 44 |
|
| 45 |
-
Causal ablation codes and models will be added soon.
|
|
|
|
|
|
|
|
|
| 42 |
## Additional Notes
|
| 43 |
Phase exploration on embeddings will be replaced soon with stronger version which includes statistical analysis.
|
| 44 |
|
| 45 |
+
Causal ablation codes and models will be added soon.
|
| 46 |
+
|
| 47 |
+
Cleaned WPT training codes, mixed precision bf16 and fp32 performance test codes will be also added.
|