Update README.md: link to paper
Browse files
README.md
CHANGED
|
@@ -32,6 +32,8 @@ pipeline_tag: text-generation
|
|
| 32 |
|
| 33 |
An open weights model combining the intelligence of R1 with the token efficiency of V3.
|
| 34 |
|
|
|
|
|
|
|
| 35 |
[Announcement on X](https://x.com/tngtech/status/1916284566127444468) | [LinkedIn post](https://www.linkedin.com/posts/tng-technology-consulting_on-the-weekend-we-released-deepseek-r1t-chimera-activity-7323008947236290560-Cf2m) | [Try it on OpenRouter](https://openrouter.ai/tngtech/deepseek-r1t-chimera:free)
|
| 36 |
|
| 37 |
|
|
|
|
| 32 |
|
| 33 |
An open weights model combining the intelligence of R1 with the token efficiency of V3.
|
| 34 |
|
| 35 |
+
For details on the construction process and analyses of Chimera model variants, please [read our paper](./paper/assembly_of_experts.pdf).
|
| 36 |
+
|
| 37 |
[Announcement on X](https://x.com/tngtech/status/1916284566127444468) | [LinkedIn post](https://www.linkedin.com/posts/tng-technology-consulting_on-the-weekend-we-released-deepseek-r1t-chimera-activity-7323008947236290560-Cf2m) | [Try it on OpenRouter](https://openrouter.ai/tngtech/deepseek-r1t-chimera:free)
|
| 38 |
|
| 39 |
|