Update README.md
Browse files
README.md
CHANGED
|
@@ -8,7 +8,7 @@ license: apache-2.0
|
|
| 8 |
pipeline_tag: text-generation
|
| 9 |
---
|
| 10 |
|
| 11 |
-
# K2 Think
|
| 12 |
|
| 13 |
๐ [Blog]() - ๐ [Code](https://github.com/LLM360/Reasoning360) - ๐ข [Project Page](https://k2think.ai)
|
| 14 |
|
|
@@ -16,7 +16,7 @@ pipeline_tag: text-generation
|
|
| 16 |
|
| 17 |
<br>
|
| 18 |
|
| 19 |
-
K2 Think
|
| 20 |
|
| 21 |
# Quickstart
|
| 22 |
|
|
@@ -44,7 +44,7 @@ The chat template is directly inherited from K2-V2-Instruct, with the default `r
|
|
| 44 |
from transformers import pipeline
|
| 45 |
import torch
|
| 46 |
|
| 47 |
-
model_id = "LLM360/K2-Think-
|
| 48 |
|
| 49 |
pipe = pipeline(
|
| 50 |
"text-generation",
|
|
@@ -93,7 +93,7 @@ A more complete summary of evaluation results are reported in our [Blog]()
|
|
| 93 |
|
| 94 |
## Benchmarks (pass\@1, average over 16 runs)
|
| 95 |
|
| 96 |
-
| Domain | Benchmark
|
| 97 |
| ------- | -------------------- | -----------: |
|
| 98 |
| Math | AIME 2025 | 90.42 |
|
| 99 |
| Math | HMMT 2025 | 84.79 |
|
|
@@ -105,7 +105,7 @@ A more complete summary of evaluation results are reported in our [Blog]()
|
|
| 105 |
|
| 106 |
Aggregated across four safety dimensions (**Safety-4**):
|
| 107 |
|
| 108 |
-
K2 Think
|
| 109 |
|
| 110 |
| Safety Surface | Macro-Avg | Risk Level |
|
| 111 |
| ------------------------------- | --------: | ---------- |
|
|
@@ -136,7 +136,7 @@ If you use K2 Think (Jan '26) in your research, please use the following citatio
|
|
| 136 |
|
| 137 |
```bibtex
|
| 138 |
@misc{k2think2026k2think0126,
|
| 139 |
-
title={K2 {T}hink
|
| 140 |
author={K2 Think Team and Taylor W. Killian and Varad Pimpalkhute and Richard Fan and Haonan Li and Chengqian Gao and Ming Shan Hee and Xudong Han and John Maggs and Guowei He and Zhengzhong Liu and Eric P. Xing},
|
| 141 |
year={2026},
|
| 142 |
url={https://tbd.org},
|
|
|
|
| 8 |
pipeline_tag: text-generation
|
| 9 |
---
|
| 10 |
|
| 11 |
+
# K2 Think V2: A Fully-Sovereign Reasoning Model
|
| 12 |
|
| 13 |
๐ [Blog]() - ๐ [Code](https://github.com/LLM360/Reasoning360) - ๐ข [Project Page](https://k2think.ai)
|
| 14 |
|
|
|
|
| 16 |
|
| 17 |
<br>
|
| 18 |
|
| 19 |
+
K2 Think V2 is a 70 billion parameter open-weights general reasoning model with strong performance in competitive mathematical problem solving built on-top of [K2-V2-Instruct](huggingface.co/LLM360/K2-V2-Instruct), comprising a fully sovereign reasoning model.
|
| 20 |
|
| 21 |
# Quickstart
|
| 22 |
|
|
|
|
| 44 |
from transformers import pipeline
|
| 45 |
import torch
|
| 46 |
|
| 47 |
+
model_id = "LLM360/K2-Think-V2"
|
| 48 |
|
| 49 |
pipe = pipeline(
|
| 50 |
"text-generation",
|
|
|
|
| 93 |
|
| 94 |
## Benchmarks (pass\@1, average over 16 runs)
|
| 95 |
|
| 96 |
+
| Domain | Benchmark | K2 Think V2 |
|
| 97 |
| ------- | -------------------- | -----------: |
|
| 98 |
| Math | AIME 2025 | 90.42 |
|
| 99 |
| Math | HMMT 2025 | 84.79 |
|
|
|
|
| 105 |
|
| 106 |
Aggregated across four safety dimensions (**Safety-4**):
|
| 107 |
|
| 108 |
+
K2 Think V2 establishes a robust safety baseline while effectively resolving the "alignment tax" of [previous K2 Think](hf.co/LLM360/K2-Think) releases. Despite strong overall safety performance, there are still opportunities to improve the model with regard to handling sensitive personal information.
|
| 109 |
|
| 110 |
| Safety Surface | Macro-Avg | Risk Level |
|
| 111 |
| ------------------------------- | --------: | ---------- |
|
|
|
|
| 136 |
|
| 137 |
```bibtex
|
| 138 |
@misc{k2think2026k2think0126,
|
| 139 |
+
title={K2 {T}hink {V}2: A {F}ully-{S}overeign {R}easoning {M}odel},
|
| 140 |
author={K2 Think Team and Taylor W. Killian and Varad Pimpalkhute and Richard Fan and Haonan Li and Chengqian Gao and Ming Shan Hee and Xudong Han and John Maggs and Guowei He and Zhengzhong Liu and Eric P. Xing},
|
| 141 |
year={2026},
|
| 142 |
url={https://tbd.org},
|