Update README.md
Browse files
README.md
CHANGED
|
@@ -15,6 +15,9 @@ language:
|
|
| 15 |
|
| 16 |
K2-V2 is our most capable fully open model to date, and one of the strongest open-weight models in its class. It uses a 70B-parameter dense transformer architecture and represents the latest advancement in the LLM360 model family.
|
| 17 |
|
|
|
|
|
|
|
|
|
|
| 18 |
<img src="figures/sft-models.png" width="400" alt="K2-V2 SFT results"/>
|
| 19 |
|
| 20 |
Beyond standard competencies such as factual knowledge and conversational ability, K2-V2 demonstrates strong long-context consistency, deep mathematical understanding, and robust reasoning skills. These capabilities serve as building blocks for sophisticated downstream applications, such as solving complex math problems and executing agentic workflows.
|
|
|
|
| 15 |
|
| 16 |
K2-V2 is our most capable fully open model to date, and one of the strongest open-weight models in its class. It uses a 70B-parameter dense transformer architecture and represents the latest advancement in the LLM360 model family.
|
| 17 |
|
| 18 |
+
<img src="figures/paper_base_instruct_one_bar_plot.png" width="600" alt="K2-V2 SFT results"/>
|
| 19 |
+
|
| 20 |
+
|
| 21 |
<img src="figures/sft-models.png" width="400" alt="K2-V2 SFT results"/>
|
| 22 |
|
| 23 |
Beyond standard competencies such as factual knowledge and conversational ability, K2-V2 demonstrates strong long-context consistency, deep mathematical understanding, and robust reasoning skills. These capabilities serve as building blocks for sophisticated downstream applications, such as solving complex math problems and executing agentic workflows.
|