Update README.md
Browse files
README.md
CHANGED
|
@@ -12,7 +12,7 @@ language:
|
|
| 12 |
|
| 13 |
## Model description
|
| 14 |
|
| 15 |
-
**Stockmark-2-100B-Instruct-beta** is a
|
| 16 |
|
| 17 |
As a beta release, Stockmark-2-100b-Instruct-beta is still undergoing improvements and evaluations. Feedback and insights from users will help refine future versions.
|
| 18 |
|
|
|
|
| 12 |
|
| 13 |
## Model description
|
| 14 |
|
| 15 |
+
**Stockmark-2-100B-Instruct-beta** is a 100-billion-parameter large language model built from scratch, with a particular focus on Japanese. It was pre-trained on approximately 1.5 trillion tokens of data, consisting of 60% English, 30% Japanese, and 10% code. Following pretraining, the model underwent post-training with synthetic data in Japanese to enhance its ability to follow instructions. This synthetic data was generated using Qwen2.5-32B-Instruct.
|
| 16 |
|
| 17 |
As a beta release, Stockmark-2-100b-Instruct-beta is still undergoing improvements and evaluations. Feedback and insights from users will help refine future versions.
|
| 18 |
|