Update README.md
Browse files
README.md
CHANGED
|
@@ -14,7 +14,8 @@ inference:
|
|
| 14 |
|
| 15 |
## Model sheet for AstraQuasar-4B
|
| 16 |
|
| 17 |
-
**AstraQuasar-4B** is our first pre-trained Large Language Model (LLM) for text generation.
|
|
|
|
| 18 |
AstraQuasar-4B-v.0.1 is built upon the foundation of the Phi-2 architecture, with **significant enhancements including an increased number of layers and the innovative introduction of a novel technique known as the duplicate trick.**
|
| 19 |
|
| 20 |
<p align="center">
|
|
|
|
| 14 |
|
| 15 |
## Model sheet for AstraQuasar-4B
|
| 16 |
|
| 17 |
+
**AstraQuasar-4B** is our first pre-trained Large Language Model (LLM) for text generation.
|
| 18 |
+
It is a model with **4B parameters**, whithout embeddings.
|
| 19 |
AstraQuasar-4B-v.0.1 is built upon the foundation of the Phi-2 architecture, with **significant enhancements including an increased number of layers and the innovative introduction of a novel technique known as the duplicate trick.**
|
| 20 |
|
| 21 |
<p align="center">
|