LLM360
/

K2-V2

hunterhector commited on Jan 26

Commit

d7af0a1

verified ·

1 Parent(s): 5d4b21d

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -15,6 +15,9 @@ language:
 K2-V2 is our most capable fully open model to date, and one of the strongest open-weight models in its class. It uses a 70B-parameter dense transformer architecture and represents the latest advancement in the LLM360 model family.
 <img src="figures/sft-models.png" width="400" alt="K2-V2 SFT results"/>
 Beyond standard competencies such as factual knowledge and conversational ability, K2-V2 demonstrates strong long-context consistency, deep mathematical understanding, and robust reasoning skills. These capabilities serve as building blocks for sophisticated downstream applications, such as solving complex math problems and executing agentic workflows.

 K2-V2 is our most capable fully open model to date, and one of the strongest open-weight models in its class. It uses a 70B-parameter dense transformer architecture and represents the latest advancement in the LLM360 model family.
+<img src="figures/paper_base_instruct_one_bar_plot.png" width="600" alt="K2-V2 SFT results"/>
 <img src="figures/sft-models.png" width="400" alt="K2-V2 SFT results"/>
 Beyond standard competencies such as factual knowledge and conversational ability, K2-V2 demonstrates strong long-context consistency, deep mathematical understanding, and robust reasoning skills. These capabilities serve as building blocks for sophisticated downstream applications, such as solving complex math problems and executing agentic workflows.