Mxode
/

NanoTranslator-XXL

@@ -1,20 +1,22 @@
-# **NanoTranslator-XL**
 [English](README.md) | 简体中文
 ## Introduction
-这是 NanoTranslator 的 **X-Large** 型号，目前仅支持**英译中**。仓库中同时提供了 ONNX 版本的模型。
-| Size | P. | Arch. | Act. |  V.  |  H.  |  I.  |  L.  | A.H. | K.H. | Tie |
-| :--: | :-----: | :--: | :--: | :--: | :-----: | :---: | :------: | :----: | :----: | :--: |
-|  XL  |  100  |  LLaMA  |  SwiGLU  | 16K | 768  | 4096 |  8   |  24  |  8   | True |
-|  L   |  78  | LLaMA | GeGLU  | 16K | 768  | 4096 |  6   |  24  |  8   | True |
-| M2 | 22 | Qwen2 | GeGLU | 4K  | 432  | 2304 |  6   |  24  |  8   | True |
-|  M   |  22  |  LLaMA  |  SwiGLU  | 8K  | 256  | 1408 |  16  |  16  |  4   | True |
-|  S   | 9 | LLaMA | SwiGLU | 4K  | 168  | 896  |  16  |  12  |  4   | True |
-| XS | 2 | LLaMA | SwiGLU | 2K | 96 | 512 | 12 | 12 | 4 | True |
 - **P.** - Parameters (in million)
 - **V.** - vocab size
@@ -41,7 +43,7 @@ Prompt 格式如下：
 import torch
 from transformers import AutoTokenizer, AutoModelForCausalLM
-model_path = 'Mxode/NanoTranslator-XL'
 tokenizer = AutoTokenizer.from_pretrained(model_path)
 model = AutoModelForCausalLM.from_pretrained(model_path)
@@ -78,7 +80,7 @@ print(response)
 根据实际测试，使用 ONNX 模型推理会比直接使用 transformers 推理要**快 2～10 倍**。
-如果希望使用 ONNX 模型，那么你需要手动切换到 [onnx 分支](https://huggingface.co/Mxode/NanoTranslator-XL/tree/onnx)并从本地加载。
 参考文档：

+# **NanoTranslator-XXL**
 [English](README.md) | 简体中文
 ## Introduction
+这是 NanoTranslator 的 **XX-Large** 型号，目前仅支持**英译中**。仓库中同时提供了 ONNX 版本的模型。
+所有模型均收录于 [NanoTranslator Collection](https://huggingface.co/collections/Mxode/nanotranslator-66e1de2ba352e926ae865bd2) 中。
+|  | P. | Arch. | Act. |  V.  |  H.  |  I.  |  L.  | A.H. | K.H. | Tie |
+| :--: | :-----: | :--: | :--: | :--: | :-----: | :---: | :------: | :--: | :--: | :--: |
+|  [XXL](https://huggingface.co/Mxode/NanoTranslator-XXL)  |  100  |  LLaMA  |  SwiGLU  | 16000 | 768  | 4096 |  8   |  24  |  8   | True |
+|  [XL](https://huggingface.co/Mxode/NanoTranslator-XL)  |  78  | LLaMA | GeGLU  | 16000 | 768  | 4096 |  6   |  24  |  8   | True |
+| [L](https://huggingface.co/Mxode/NanoTranslator-L) |  49  | LLaMA | GeGLU  | 16000 | 512  | 2816 |  8   |  16  |  8   | True |
+| [M2](https://huggingface.co/Mxode/NanoTranslator-M2) | 22 | Qwen2 | GeGLU | 4000  | 432  | 2304 |  6   |  24  |  8   | True |
+|  [M](https://huggingface.co/Mxode/NanoTranslator-M)   |  22  |  LLaMA  |  SwiGLU  | 8000  | 256  | 1408 |  16  |  16  |  4   | True |
+|  [S](https://huggingface.co/Mxode/NanoTranslator-S)   | 9 | LLaMA | SwiGLU | 4000  | 168  | 896  |  16  |  12  |  4   | True |
+| [XS](https://huggingface.co/Mxode/NanoTranslator-XS) | 2 | LLaMA | SwiGLU | 2000 | 96 | 512 | 12 | 12 | 4 | True |
 - **P.** - Parameters (in million)
 - **V.** - vocab size
 import torch
 from transformers import AutoTokenizer, AutoModelForCausalLM
+model_path = 'Mxode/NanoTranslator-XXL'
 tokenizer = AutoTokenizer.from_pretrained(model_path)
 model = AutoModelForCausalLM.from_pretrained(model_path)
 根据实际测试，使用 ONNX 模型推理会比直接使用 transformers 推理要**快 2～10 倍**。
+如果希望使用 ONNX 模型，那么你需要手动切换到 [onnx 分支](https://huggingface.co/Mxode/NanoTranslator-XXL/tree/onnx)并从本地加载。
 参考文档：