PyTorch
English
llama
hunterhector commited on
Commit
d7af0a1
·
verified ·
1 Parent(s): 5d4b21d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -0
README.md CHANGED
@@ -15,6 +15,9 @@ language:
15
 
16
  K2-V2 is our most capable fully open model to date, and one of the strongest open-weight models in its class. It uses a 70B-parameter dense transformer architecture and represents the latest advancement in the LLM360 model family.
17
 
 
 
 
18
  <img src="figures/sft-models.png" width="400" alt="K2-V2 SFT results"/>
19
 
20
  Beyond standard competencies such as factual knowledge and conversational ability, K2-V2 demonstrates strong long-context consistency, deep mathematical understanding, and robust reasoning skills. These capabilities serve as building blocks for sophisticated downstream applications, such as solving complex math problems and executing agentic workflows.
 
15
 
16
  K2-V2 is our most capable fully open model to date, and one of the strongest open-weight models in its class. It uses a 70B-parameter dense transformer architecture and represents the latest advancement in the LLM360 model family.
17
 
18
+ <img src="figures/paper_base_instruct_one_bar_plot.png" width="600" alt="K2-V2 SFT results"/>
19
+
20
+
21
  <img src="figures/sft-models.png" width="400" alt="K2-V2 SFT results"/>
22
 
23
  Beyond standard competencies such as factual knowledge and conversational ability, K2-V2 demonstrates strong long-context consistency, deep mathematical understanding, and robust reasoning skills. These capabilities serve as building blocks for sophisticated downstream applications, such as solving complex math problems and executing agentic workflows.