Update README.md
Browse files
README.md
CHANGED
|
@@ -1,6 +1,8 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: mit
|
| 3 |
-
|
|
|
|
|
|
|
| 4 |
|
| 5 |
# Model
|
| 6 |
MiniALBERT is a recursive transformer model which uses cross-layer parameter sharing, embedding factorisation, and bottleneck adapters to achieve high parameter efficiency.
|
|
@@ -43,4 +45,4 @@ If you use the model, please cite our paper:
|
|
| 43 |
pages={1161--1173},
|
| 44 |
year={2023}
|
| 45 |
}
|
| 46 |
-
```
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: mit
|
| 3 |
+
tags:
|
| 4 |
+
- oxford-legacy
|
| 5 |
+
---
|
| 6 |
|
| 7 |
# Model
|
| 8 |
MiniALBERT is a recursive transformer model which uses cross-layer parameter sharing, embedding factorisation, and bottleneck adapters to achieve high parameter efficiency.
|
|
|
|
| 45 |
pages={1161--1173},
|
| 46 |
year={2023}
|
| 47 |
}
|
| 48 |
+
```
|