Update README.md
Browse files
README.md
CHANGED
|
@@ -18,6 +18,8 @@ datasets:
|
|
| 18 |
|
| 19 |
This is a Deepseek-inspired model trained on TinyStories dataset, featuring Mixture of Experts (MoE) architecture.
|
| 20 |
|
|
|
|
|
|
|
| 21 |
## Model Details
|
| 22 |
|
| 23 |
- **Model Type**: Autoregressive Language Model with Mixture of Experts
|
|
|
|
| 18 |
|
| 19 |
This is a Deepseek-inspired model trained on TinyStories dataset, featuring Mixture of Experts (MoE) architecture.
|
| 20 |
|
| 21 |
+
Github: https://github.com/sky-2002/Generative-Modelling/tree/master/deepseek
|
| 22 |
+
|
| 23 |
## Model Details
|
| 24 |
|
| 25 |
- **Model Type**: Autoregressive Language Model with Mixture of Experts
|