Update README.md
Browse files
README.md
CHANGED
|
@@ -12,7 +12,9 @@ language:
|
|
| 12 |
|
| 13 |
## Model description
|
| 14 |
|
| 15 |
-
**Stockmark-2-100B-Instruct-beta** is a 100B parameter large language model specialized in Japanese.
|
|
|
|
|
|
|
| 16 |
|
| 17 |
See [our blog](???) for the detail.
|
| 18 |
|
|
|
|
| 12 |
|
| 13 |
## Model description
|
| 14 |
|
| 15 |
+
**Stockmark-2-100B-Instruct-beta** is a 100B parameter large language model built from scratch, which is particulary specialized in Japanese. The model was pretrained on approximately 1.5 trillion tokens of data (60% English, 30% Japanese, 10% Code). After pretraining, the model underwent post-training using synthetic data to enhance its instruction-following abilities. The synthetic data was generated using Qwen2.5-32B-Instruct.
|
| 16 |
+
|
| 17 |
+
As a beta release, Stockmark-2-100b-Instruct-beta is still undergoing improvements and evaluations. Feedback and insights from users will help refine future versions.
|
| 18 |
|
| 19 |
See [our blog](???) for the detail.
|
| 20 |
|