Text Generation
Transformers
Safetensors
English
minimax_m2
conversational
custom_code
8-bit precision
quark
Instructions to use amd/MiniMax-M2.5-NVFP4 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use amd/MiniMax-M2.5-NVFP4 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="amd/MiniMax-M2.5-NVFP4", trust_remote_code=True) messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("amd/MiniMax-M2.5-NVFP4", trust_remote_code=True) model = AutoModelForCausalLM.from_pretrained("amd/MiniMax-M2.5-NVFP4", trust_remote_code=True) messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use amd/MiniMax-M2.5-NVFP4 with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "amd/MiniMax-M2.5-NVFP4" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "amd/MiniMax-M2.5-NVFP4", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/amd/MiniMax-M2.5-NVFP4
- SGLang
How to use amd/MiniMax-M2.5-NVFP4 with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "amd/MiniMax-M2.5-NVFP4" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "amd/MiniMax-M2.5-NVFP4", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "amd/MiniMax-M2.5-NVFP4" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "amd/MiniMax-M2.5-NVFP4", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use amd/MiniMax-M2.5-NVFP4 with Docker Model Runner:
docker model run hf.co/amd/MiniMax-M2.5-NVFP4
add missing file for legal scan
#2
by linzhao-amd - opened
- LICENSE +0 -0
- LICENSE-MODEL +0 -56
- README.md +6 -113
- quark_profile.yaml +177 -0
- tokenizer.json +2 -2
- tokenizer_config.json +243 -240
LICENSE
ADDED
|
File without changes
|
LICENSE-MODEL
DELETED
|
@@ -1,56 +0,0 @@
|
|
| 1 |
-
MINIMAX MODEL LICENSE
|
| 2 |
-
MiniMax-M2.5 Version Release Date: 2026-02-13
|
| 3 |
-
|
| 4 |
-
1. Definitions
|
| 5 |
-
|
| 6 |
-
"Agreement" means the terms and conditions for use, reproduction, distribution and modification of the Model Materials set forth herein.
|
| 7 |
-
|
| 8 |
-
"Licensee" or "you" means you, or your employer or any other person or entity (if you are entering into this Agreement on such person or entity's behalf), of the age required under applicable laws, rules or regulations to provide legal consent and that has legal authority to bind your employer or such other person or entity if you are entering in this Agreement on their behalf.
|
| 9 |
-
|
| 10 |
-
"Model" means the foundational large language models and software and algorithms, including machine-learning model code, trained model weights, inference-enabling code, training-enabling code, fine-tuning enabling code and other elements of the foregoing distributed by MiniMax.
|
| 11 |
-
|
| 12 |
-
"Model Materials" means, collectively, the Model and any source code, scripts, specifications, manuals and documentation accompanying the Model (and any portion thereof) made available under this Agreement.
|
| 13 |
-
|
| 14 |
-
"MiniMax" or "we" means MiniMax AI.
|
| 15 |
-
|
| 16 |
-
2. License Rights and Redistribution
|
| 17 |
-
|
| 18 |
-
a. Grant of Rights. You are granted a non-exclusive, worldwide and royalty-free limited license under MiniMax's intellectual property or other rights owned by MiniMax embodied in the Model Materials to use, reproduce, distribute, copy, create derivative works of, and make modifications to the Model Materials.
|
| 19 |
-
|
| 20 |
-
b. Redistribution and Use.
|
| 21 |
-
|
| 22 |
-
i. If you distribute or make available the Model Materials (or any derivative works thereof), or a product or service that uses any of them, including another AI model, you shall provide a copy of this Agreement with any such the Model Materials or derivative works and cause any modified files to carry prominent notices stating that you changed the files. You may add your own copyright statement to your modifications and may provide additional or different license terms and conditions for use, reproduction, or distribution of your modifications, or for any such derivative works as a whole, provided your use, reproduction, and distribution of the work otherwise complies with the terms and conditions of this Agreement.
|
| 23 |
-
|
| 24 |
-
ii. You must retain in all copies of the Model Materials that you distribute the following attribution notice within a "Notice" text file distributed as a part of such copies: "MiniMax AI model is licensed under the MiniMax Model License, Copyright © MiniMax. All Rights Reserved."
|
| 25 |
-
|
| 26 |
-
iii. Your use of the Model Materials must comply with applicable laws and regulations (including trade compliance laws and regulations) and adhere to the Prohibited Uses Policy for the Model Materials, which is hereby incorporated by reference into this Agreement.
|
| 27 |
-
|
| 28 |
-
3. Disclaimer of Warranty. UNLESS REQUIRED BY APPLICABLE LAW, THE MODEL MATERIALS AND ANY OUTPUT AND RESULTS THEREFROM ARE PROVIDED ON AN "AS IS" BASIS, WITHOUT WARRANTIES OF ANY KIND, AND MINIMAX DISCLAIMS ALL WARRANTIES OF ANY KIND, BOTH EXPRESS AND IMPLIED, INCLUDING, WITHOUT LIMITATION, ANY WARRANTIES OF TITLE, NON-INFRINGEMENT, MERCHANTABILITY, OR FITNESS FOR A PARTICULAR PURPOSE. YOU ARE SOLELY RESPONSIBLE FOR DETERMINING THE APPROPRIATENESS OF USING OR REDISTRIBUTING THE MODEL MATERIALS AND ASSUME ANY RISKS ASSOCIATED WITH YOUR USE OF THE MODEL MATERIALS AND ANY OUTPUT AND RESULTS.
|
| 29 |
-
|
| 30 |
-
4. Limitation of Liability. IN NO EVENT WILL MINIMAX OR ITS AFFILIATES BE LIABLE UNDER ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, TORT, NEGLIGENCE, PRODUCTS LIABILITY, OR OTHERWISE, ARISING OUT OF THIS AGREEMENT, FOR ANY LOST PROFITS OR ANY INDIRECT, SPECIAL, CONSEQUENTIAL, INCIDENTAL, EXEMPLARY OR PUNITIVE DAMAGES, EVEN IF MINIMAX OR ITS AFFILIATES HAVE BEEN ADVISED OF THE POSSIBILITY OF ANY OF THE FOREGOING.
|
| 31 |
-
|
| 32 |
-
5. Intellectual Property.
|
| 33 |
-
|
| 34 |
-
a. No trademark license is granted to use the trade names, trademarks, service marks, or product names of MiniMax, except as required to fulfill notice requirements under this Agreement.
|
| 35 |
-
|
| 36 |
-
b. Subject to MiniMax's ownership of the Model Materials and derivatives made by or for MiniMax, with respect to any derivative works and modifications of the Model Materials that are made by you, as between you and MiniMax, you are and will be the owner of such derivative works and modifications.
|
| 37 |
-
|
| 38 |
-
c. If you institute litigation or other proceedings against MiniMax or any entity (including a cross-claim or counterclaim in a lawsuit) alleging that the Model Materials or outputs or results, or any portion of any of the foregoing, constitutes infringement of intellectual property or other rights owned or licensable by you, then any licenses granted to you under this Agreement shall terminate as of the date such litigation or claim is filed or instituted. You will indemnify and hold harmless MiniMax from and against any claim by any third party arising out of or related to your use or distribution of the Model Materials.
|
| 39 |
-
|
| 40 |
-
6. Term and Termination. The term of this Agreement will commence upon your acceptance of this Agreement or access to the Model Materials and will continue in full force and effect until terminated in accordance with the terms and conditions herein. MiniMax may terminate this Agreement if you are in breach of any term or condition of this Agreement. Upon termination of this Agreement, you shall delete and cease use of the Model Materials. Sections 3, 4 and 7 shall survive the termination of this Agreement.
|
| 41 |
-
|
| 42 |
-
7. Governing Law and Jurisdiction. This Agreement will be governed and construed under the laws of Singapore without regard to choice of law principles, and the UN Convention on Contracts for the International Sale of Goods does not apply to this Agreement. Any dispute arising out of or in connection with this Agreement, including any question regarding its existence, validity or termination, shall be referred to and finally resolved by arbitration administered by the Singapore International Arbitration Centre ("SIAC") in accordance with the Arbitration Rules of the Singapore International Arbitration Centre ("SIAC Rules") for the time being in force, which rules are deemed to be incorporated by reference in this clause.
|
| 43 |
-
|
| 44 |
-
Prohibited Uses Policy
|
| 45 |
-
|
| 46 |
-
You agree you will not use, or allow others to use, the Models or any derivatives of the Models to:
|
| 47 |
-
|
| 48 |
-
1. Violate any applicable federal, state, local, or international law or regulation, or infringe upon the lawful rights or interests of any third party.
|
| 49 |
-
2. Assist with, engage in or otherwise support any military purpose.
|
| 50 |
-
3. Exploit, harm, or attempt to exploit or harm minors in any way.
|
| 51 |
-
4. Generate or disseminate false or misleading information with the intent to cause harm.
|
| 52 |
-
5. Generate or disseminate content prohibited by applicable laws or regulations.
|
| 53 |
-
6. Generate or disseminate personally identifiable information without proper authorization or for unlawful or unreasonable purposes.
|
| 54 |
-
7. Defame, disparage, harass, or cause harm to any individual or entity.
|
| 55 |
-
8. Conduct fully automated decision-making that adversely affects an individual's legal rights or creates or modifies a binding, enforceable obligation.
|
| 56 |
-
9. Promote discrimination, hate speech, or harmful behavior against individuals or groups based on race or ethnic origin, religion, disability, age, nationality and national origin, veteran status, sexual orientation, gender or gender identity, caste, immigration status, or any other characteristic that is associated with systemic discrimination or marginalization.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
README.md
CHANGED
|
@@ -1,118 +1,11 @@
|
|
| 1 |
---
|
| 2 |
-
base_model:
|
| 3 |
-
- MiniMaxAI/MiniMax-M2.5
|
| 4 |
-
language:
|
| 5 |
-
- en
|
| 6 |
-
library_name: transformers
|
| 7 |
license: other
|
| 8 |
-
license_name:
|
| 9 |
-
license_link:
|
| 10 |
---
|
| 11 |
|
| 12 |
-
|
| 13 |
|
| 14 |
-
|
| 15 |
-
|
| 16 |
-
|
| 17 |
-
- **Supported Hardware Microarchitecture:** AMD MI300/MI350/MI355 (emulation)
|
| 18 |
-
- **ROCm:** 7.2.2
|
| 19 |
-
- **PyTorch**: 2.10.0
|
| 20 |
-
- **Transformers**: 5.2.0
|
| 21 |
-
- **Operating System(s):** Linux
|
| 22 |
-
- **Inference Engine:** [SGLang](https://docs.sglang.ai/)/[vLLM](https://docs.vllm.ai/en/latest/)
|
| 23 |
-
- **Model Optimizer:** [AMD-Quark](https://quark.docs.amd.com/latest/index.html) (v0.12)
|
| 24 |
-
- **Quantized layers:** `experts`
|
| 25 |
-
- **Weight quantization:** NVFP4, Static
|
| 26 |
-
- **Activation quantization:** NVFP4, Dynamic
|
| 27 |
-
|
| 28 |
-
|
| 29 |
-
# Model Quantization
|
| 30 |
-
|
| 31 |
-
The model was quantized from [MiniMaxAI/MiniMax-M2.5](https://huggingface.co/MiniMaxAI/MiniMax-M2.5) by using [AMD-Quark](https://quark.docs.amd.com/latest/index.html). The weights and activations are quantized to NVFP4.
|
| 32 |
-
|
| 33 |
-
|
| 34 |
-
**Quantization scripts:**
|
| 35 |
-
```
|
| 36 |
-
cd Quark/examples/torch/language_modeling/llm_ptq/
|
| 37 |
-
export exclude_layers="lm_head *block_sparse_moe.gate* *self_attn*"
|
| 38 |
-
export CUDA_VISIBLE_DEVICES=0,1,2,3
|
| 39 |
-
python3 quantize_quark.py \
|
| 40 |
-
--model_dir MiniMaxAI/MiniMax-M2.5 \
|
| 41 |
-
--quant_scheme nvfp4 \
|
| 42 |
-
--num_calib_data 128 \
|
| 43 |
-
--exclude_layers $exclude_layers \
|
| 44 |
-
--model_export hf_format \
|
| 45 |
-
--trust_remote_code \
|
| 46 |
-
--multi_gpu \
|
| 47 |
-
--output_dir amd/MiniMax-M2.5-NVFP4
|
| 48 |
-
|
| 49 |
-
```
|
| 50 |
-
For further details or issues, please refer to the AMD-Quark documentation or contact the respective developers.
|
| 51 |
-
|
| 52 |
-
# Deployment
|
| 53 |
-
## Use with vLLM/SGLang
|
| 54 |
-
|
| 55 |
-
This model can be deployed efficiently using the [vLLM](https://docs.vllm.ai/en/latest/) and [SGLang](https://docs.sglang.ai/) backends.
|
| 56 |
-
|
| 57 |
-
## Evaluation
|
| 58 |
-
The model was evaluated on gsm8k benchmarks using the [vllm](https://github.com/vllm-project/vllm/tree/v0.13.0) framework.
|
| 59 |
-
|
| 60 |
-
### Accuracy
|
| 61 |
-
|
| 62 |
-
<table>
|
| 63 |
-
<tr>
|
| 64 |
-
<td><strong>Benchmark</strong>
|
| 65 |
-
</td>
|
| 66 |
-
<td><strong>MiniMaxAI/MiniMax-M2.5 </strong>
|
| 67 |
-
</td>
|
| 68 |
-
<td><strong>amd/MiniMax-M2.5-NVFP4(this model)</strong>
|
| 69 |
-
</td>
|
| 70 |
-
<td><strong>Recovery</strong>
|
| 71 |
-
</td>
|
| 72 |
-
</tr>
|
| 73 |
-
<tr>
|
| 74 |
-
<td>gsm8k (flexible-extract)
|
| 75 |
-
</td>
|
| 76 |
-
<td>91.51
|
| 77 |
-
</td>
|
| 78 |
-
<td>91.21
|
| 79 |
-
</td>
|
| 80 |
-
<td>99.67%
|
| 81 |
-
</td>
|
| 82 |
-
</tr>
|
| 83 |
-
</table>
|
| 84 |
-
|
| 85 |
-
|
| 86 |
-
|
| 87 |
-
### Reproduction
|
| 88 |
-
|
| 89 |
-
The GSM8K result was obtained using the `lm-evaluation-harness` framework, based on the Docker image `rocm/vllm-dev:nightly_main_20260603`.
|
| 90 |
-
|
| 91 |
-
Install the lm-eval `(Version: 0.4.12)` in container first.
|
| 92 |
-
```
|
| 93 |
-
pip install lm-eval
|
| 94 |
-
pip install lm-eval[api]
|
| 95 |
-
```
|
| 96 |
-
#### Launching server
|
| 97 |
-
```
|
| 98 |
-
VLLM_ROCM_USE_AITER=1 vllm serve amd/MiniMax-M2.5-NVFP4/ \
|
| 99 |
-
--tensor-parallel-size 2 \
|
| 100 |
-
--tool-call-parser minimax_m2 \
|
| 101 |
-
--reasoning-parser minimax_m2 \
|
| 102 |
-
--enable-auto-tool-choice \
|
| 103 |
-
--trust-remote-code
|
| 104 |
-
```
|
| 105 |
-
#### Evaluating model in a new terminal
|
| 106 |
-
```
|
| 107 |
-
lm_eval \
|
| 108 |
-
--model local-completions \
|
| 109 |
-
--model_args "model=amd/MiniMax-M2.5-NVFP4/,base_url=http://127.0.0.1:8000/v1/completions,tokenized_requests=False,tokenizer_backend=None,num_concurrent=32" \
|
| 110 |
-
--gen_kwargs temperature=1.0,top_p=0.95 \
|
| 111 |
-
--tasks gsm8k \
|
| 112 |
-
--num_fewshot 8 \
|
| 113 |
-
--batch_size 1
|
| 114 |
-
```
|
| 115 |
-
|
| 116 |
-
|
| 117 |
-
# License
|
| 118 |
-
Modifications Copyright(c) 2026 Advanced Micro Devices, Inc. All rights reserved.
|
|
|
|
| 1 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 2 |
license: other
|
| 3 |
+
license_name: other
|
| 4 |
+
license_link: LICENSE
|
| 5 |
---
|
| 6 |
|
| 7 |
+
**Disclaimer**
|
| 8 |
|
| 9 |
+
This model is provided for research and evaluation purposes only.
|
| 10 |
+
Quantization may introduce accuracy or behavioral differences compared to the original model.
|
| 11 |
+
Users are responsible for validating the model in their own environments and complying with the original model license.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
quark_profile.yaml
ADDED
|
@@ -0,0 +1,177 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Quark Profiling Results
|
| 2 |
+
|
| 3 |
+
memory_usage:
|
| 4 |
+
- step: "Start"
|
| 5 |
+
timestamp: 1780202462.5395215
|
| 6 |
+
relative_time_secs: 0.0
|
| 7 |
+
cpu_memory_mb: 3105.29
|
| 8 |
+
gpu_memory_mb: 3107.88
|
| 9 |
+
disk_read_mb: 0.0
|
| 10 |
+
disk_write_mb: 0.0
|
| 11 |
+
- step: "Model Loading Start"
|
| 12 |
+
timestamp: 1780202463.1930308
|
| 13 |
+
relative_time_secs: 0.6535093784332275
|
| 14 |
+
cpu_memory_mb: 3105.29
|
| 15 |
+
gpu_memory_mb: 3107.88
|
| 16 |
+
disk_read_mb: 0.0
|
| 17 |
+
disk_write_mb: 0.0
|
| 18 |
+
- step: "Model Loading End"
|
| 19 |
+
timestamp: 1780202557.2720962
|
| 20 |
+
relative_time_secs: 94.7325747013092
|
| 21 |
+
cpu_memory_mb: 4740.59
|
| 22 |
+
gpu_memory_mb: 223242.2
|
| 23 |
+
disk_read_mb: 0.0
|
| 24 |
+
disk_write_mb: 0.0
|
| 25 |
+
- step: "Dataset Loading Start"
|
| 26 |
+
timestamp: 1780202558.3775353
|
| 27 |
+
relative_time_secs: 95.8380138874054
|
| 28 |
+
cpu_memory_mb: 4855.08
|
| 29 |
+
gpu_memory_mb: 223242.2
|
| 30 |
+
disk_read_mb: 0.0
|
| 31 |
+
disk_write_mb: 0.0
|
| 32 |
+
- step: "Dataset Loading End"
|
| 33 |
+
timestamp: 1780202562.9039524
|
| 34 |
+
relative_time_secs: 100.36443090438843
|
| 35 |
+
cpu_memory_mb: 4936.2
|
| 36 |
+
gpu_memory_mb: 223244.2
|
| 37 |
+
disk_read_mb: 0.0
|
| 38 |
+
disk_write_mb: 0.0
|
| 39 |
+
- step: "Model Quantization Start"
|
| 40 |
+
timestamp: 1780202563.5159075
|
| 41 |
+
relative_time_secs: 100.97638607025146
|
| 42 |
+
cpu_memory_mb: 4936.2
|
| 43 |
+
gpu_memory_mb: 223244.2
|
| 44 |
+
disk_read_mb: 0.0
|
| 45 |
+
disk_write_mb: 0.0
|
| 46 |
+
- step: "Model Preparation Start"
|
| 47 |
+
timestamp: 1780202564.1855054
|
| 48 |
+
relative_time_secs: 101.64598393440247
|
| 49 |
+
cpu_memory_mb: 4936.2
|
| 50 |
+
gpu_memory_mb: 223244.2
|
| 51 |
+
disk_read_mb: 0.0
|
| 52 |
+
disk_write_mb: 0.0
|
| 53 |
+
- step: "Model Preparation End"
|
| 54 |
+
timestamp: 1780202616.292561
|
| 55 |
+
relative_time_secs: 153.75303959846497
|
| 56 |
+
cpu_memory_mb: 7562.85
|
| 57 |
+
gpu_memory_mb: 223569.68
|
| 58 |
+
disk_read_mb: 0.0
|
| 59 |
+
disk_write_mb: 0.0
|
| 60 |
+
- step: "Advanced Algorithms Start"
|
| 61 |
+
timestamp: 1780202616.9882076
|
| 62 |
+
relative_time_secs: 154.4486861228943
|
| 63 |
+
cpu_memory_mb: 7562.85
|
| 64 |
+
gpu_memory_mb: 223569.68
|
| 65 |
+
disk_read_mb: 0.0
|
| 66 |
+
disk_write_mb: 0.0
|
| 67 |
+
- step: "Advanced Algorithms End"
|
| 68 |
+
timestamp: 1780202617.6359434
|
| 69 |
+
relative_time_secs: 155.096421957016
|
| 70 |
+
cpu_memory_mb: 7562.85
|
| 71 |
+
gpu_memory_mb: 223569.68
|
| 72 |
+
disk_read_mb: 0.0
|
| 73 |
+
disk_write_mb: 0.0
|
| 74 |
+
- step: "Calibration Start"
|
| 75 |
+
timestamp: 1780202618.4098058
|
| 76 |
+
relative_time_secs: 155.87028431892395
|
| 77 |
+
cpu_memory_mb: 7562.85
|
| 78 |
+
gpu_memory_mb: 223569.68
|
| 79 |
+
disk_read_mb: 0.0
|
| 80 |
+
disk_write_mb: 0.0
|
| 81 |
+
- step: "Calibration End"
|
| 82 |
+
timestamp: 1780209356.3565843
|
| 83 |
+
relative_time_secs: 6893.817062854767
|
| 84 |
+
cpu_memory_mb: 18700.47
|
| 85 |
+
gpu_memory_mb: 400402.3
|
| 86 |
+
disk_read_mb: 0.0
|
| 87 |
+
disk_write_mb: 4381.69
|
| 88 |
+
- step: "Model Quantization End"
|
| 89 |
+
timestamp: 1780209705.8870807
|
| 90 |
+
relative_time_secs: 7243.347559213638
|
| 91 |
+
cpu_memory_mb: 107398.29
|
| 92 |
+
gpu_memory_mb: 399905.61
|
| 93 |
+
disk_read_mb: 0.0
|
| 94 |
+
disk_write_mb: 4381.69
|
| 95 |
+
- step: "Freeze Model Start"
|
| 96 |
+
timestamp: 1780209706.5831566
|
| 97 |
+
relative_time_secs: 7244.043635129929
|
| 98 |
+
cpu_memory_mb: 107398.29
|
| 99 |
+
gpu_memory_mb: 399905.61
|
| 100 |
+
disk_read_mb: 0.0
|
| 101 |
+
disk_write_mb: 4381.69
|
| 102 |
+
- step: "Freeze Model End"
|
| 103 |
+
timestamp: 1780209720.9122105
|
| 104 |
+
relative_time_secs: 7258.372689008713
|
| 105 |
+
cpu_memory_mb: 107725.61
|
| 106 |
+
gpu_memory_mb: 399905.61
|
| 107 |
+
disk_read_mb: 0.0
|
| 108 |
+
disk_write_mb: 4381.69
|
| 109 |
+
- step: "Export HF Safetensors Start"
|
| 110 |
+
timestamp: 1780209721.5354578
|
| 111 |
+
relative_time_secs: 7258.995936393738
|
| 112 |
+
cpu_memory_mb: 107725.61
|
| 113 |
+
gpu_memory_mb: 399905.61
|
| 114 |
+
disk_read_mb: 0.0
|
| 115 |
+
disk_write_mb: 4381.69
|
| 116 |
+
- step: "Export HF Safetensors End"
|
| 117 |
+
timestamp: 1780209935.171702
|
| 118 |
+
relative_time_secs: 7472.632180452347
|
| 119 |
+
cpu_memory_mb: 111636.27
|
| 120 |
+
gpu_memory_mb: 412467.14
|
| 121 |
+
disk_read_mb: 0.0
|
| 122 |
+
disk_write_mb: 132630.66
|
| 123 |
+
- step: "Model Evaluation Start"
|
| 124 |
+
timestamp: 1780209935.8758442
|
| 125 |
+
relative_time_secs: 7473.336322784424
|
| 126 |
+
cpu_memory_mb: 111636.27
|
| 127 |
+
gpu_memory_mb: 412467.14
|
| 128 |
+
disk_read_mb: 0.0
|
| 129 |
+
disk_write_mb: 132630.66
|
| 130 |
+
- step: "Model Evaluation End"
|
| 131 |
+
timestamp: 1780209957.401115
|
| 132 |
+
relative_time_secs: 7494.8615934848785
|
| 133 |
+
cpu_memory_mb: 111695.7
|
| 134 |
+
gpu_memory_mb: 412973.53
|
| 135 |
+
disk_read_mb: 0.01
|
| 136 |
+
disk_write_mb: 132630.66
|
| 137 |
+
- step: "End"
|
| 138 |
+
timestamp: 1780209958.0559528
|
| 139 |
+
relative_time_secs: 7495.5164313316345
|
| 140 |
+
cpu_memory_mb: 111695.73
|
| 141 |
+
gpu_memory_mb: 412973.53
|
| 142 |
+
disk_read_mb: 0.03
|
| 143 |
+
disk_write_mb: 132630.66
|
| 144 |
+
|
| 145 |
+
# Summary Metrics
|
| 146 |
+
total_quantization_time_seconds: 7495.5165
|
| 147 |
+
peak_memory_mb: 111695.73
|
| 148 |
+
peak_gpu_memory_mb: 412973.53
|
| 149 |
+
total_disk_read_mb: 0.03
|
| 150 |
+
total_disk_write_mb: 132630.66
|
| 151 |
+
|
| 152 |
+
# Metric Definitions:
|
| 153 |
+
#
|
| 154 |
+
# Checkpoint Metrics (per record):
|
| 155 |
+
# - step: Name of the profiling checkpoint. Common steps include:
|
| 156 |
+
# - "Start": Initial state when profiling begins
|
| 157 |
+
# - "Model Loaded": After loading the ONNX model into memory
|
| 158 |
+
# - "Pre-process Start/End": Before and after model preprocessing
|
| 159 |
+
# - "Calibration Start/End": Before and after calibration data collection
|
| 160 |
+
# - "Quantization (MatMulNBits) Start/End": MatMulNBits quantization phase
|
| 161 |
+
# - "Quantization (Static) Start/End": Static quantization phase
|
| 162 |
+
# - "Post-process Start/End": Before and after post-processing
|
| 163 |
+
# - "Fast Finetune Start/End": Before and after fast finetuning (if enabled)
|
| 164 |
+
# - timestamp: Unix timestamp (seconds since epoch) when this measurement was taken. Useful for correlating with external logs or events.
|
| 165 |
+
# - relative_time_secs: Time elapsed (in seconds) since the "Start" step. Useful for understanding the duration of each phase relative to the beginning of profiling.
|
| 166 |
+
# - cpu_memory_mb: Current Resident Set Size (RSS) in megabytes at this step. This includes memory from the main process and all child processes. RSS represents the portion of memory held in RAM (not swapped out).
|
| 167 |
+
# - gpu_memory_mb: Current GPU memory usage in megabytes. This represents actual GPU memory used by the process, including allocations from PyTorch, ONNX Runtime, TensorRT, and other frameworks. Only available when PyTorch with CUDA/ROCm is installed and GPU is available.
|
| 168 |
+
# - disk_read_mb: Cumulative disk bytes read (in megabytes) since the start of profiling. Measured relative to the baseline captured at the 'Start' checkpoint, including I/O from the main process and all child processes. Only available when psutil is installed and the OS exposes per-process I/O counters (Linux /proc/<pid>/io, Windows; not available on macOS without root).
|
| 169 |
+
# - disk_write_mb: Cumulative disk bytes written (in megabytes) since the start of profiling. Measured relative to the baseline captured at the 'Start' checkpoint, including I/O from the main process and all child processes. Only available when psutil is installed and the OS exposes per-process I/O counters (Linux /proc/<pid>/io, Windows; not available on macOS without root).
|
| 170 |
+
#
|
| 171 |
+
# Summary Metrics (overall):
|
| 172 |
+
# - total_quantization_time_seconds: Total elapsed time (in seconds) from the start of profiling to the end of the quantization process.
|
| 173 |
+
# - peak_memory_mb: Peak resident set size (RSS) in megabytes for the main process during the entire profiling session. On Linux, this is read from VmHWM (high water mark) in /proc/<pid>/status. On Windows, this is the peak working set size. This metric may not be available on all platforms.
|
| 174 |
+
# - peak_gpu_memory_mb: Peak GPU memory usage in megabytes during the entire profiling session. This is the maximum GPU memory used, including allocations from PyTorch, ONNX Runtime, TensorRT, and other frameworks. Only available when PyTorch with CUDA/ROCm is installed and GPU is available.
|
| 175 |
+
# - total_disk_read_mb: Total disk bytes read (in megabytes) during the entire profiling session. Computed as the difference between the final and baseline cumulative read counters, including I/O from the main process and all child processes. Only available when psutil is installed and the OS exposes per-process I/O counters (Linux /proc/<pid>/io, Windows; not available on macOS without root).
|
| 176 |
+
# - total_disk_write_mb: Total disk bytes written (in megabytes) during the entire profiling session. Computed as the difference between the final and baseline cumulative write counters, including I/O from the main process and all child processes. Only available when psutil is installed and the OS exposes per-process I/O counters (Linux /proc/<pid>/io, Windows; not available on macOS without root).
|
| 177 |
+
# - peak_cache_dir_disk_usage_mb: Highest peak increase in disk usage (in megabytes) among all cache directories created during the profiling session, relative to each cache directory's size when monitoring started. Sampled every 1 second by recursively summing file sizes with os.scandir().
|
tokenizer.json
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ad01040a70bcef384b88cf9a00f75c1b73106acf4ab506fe04d75a89591111dc
|
| 3 |
+
size 15523019
|
tokenizer_config.json
CHANGED
|
@@ -1,495 +1,498 @@
|
|
| 1 |
{
|
|
|
|
| 2 |
"added_tokens_decoder": {
|
| 3 |
-
|
| 4 |
"content": "]!p~[",
|
| 5 |
"lstrip": false,
|
| 6 |
"normalized": false,
|
| 7 |
"rstrip": false,
|
| 8 |
"single_word": false,
|
| 9 |
"special": true
|
| 10 |
-
|
| 11 |
-
|
| 12 |
"content": "<fim_prefix>",
|
| 13 |
"lstrip": false,
|
| 14 |
"normalized": false,
|
| 15 |
"rstrip": false,
|
| 16 |
"single_word": false,
|
| 17 |
"special": true
|
| 18 |
-
|
| 19 |
-
|
| 20 |
"content": "<fim_middle>",
|
| 21 |
"lstrip": false,
|
| 22 |
"normalized": false,
|
| 23 |
"rstrip": false,
|
| 24 |
"single_word": false,
|
| 25 |
"special": true
|
| 26 |
-
|
| 27 |
-
|
| 28 |
"content": "<fim_suffix>",
|
| 29 |
"lstrip": false,
|
| 30 |
"normalized": false,
|
| 31 |
"rstrip": false,
|
| 32 |
"single_word": false,
|
| 33 |
"special": true
|
| 34 |
-
|
| 35 |
-
|
| 36 |
"content": "<fim_pad>",
|
| 37 |
"lstrip": false,
|
| 38 |
"normalized": false,
|
| 39 |
"rstrip": false,
|
| 40 |
"single_word": false,
|
| 41 |
"special": true
|
| 42 |
-
|
| 43 |
-
|
| 44 |
"content": "<reponame>",
|
| 45 |
"lstrip": false,
|
| 46 |
"normalized": false,
|
| 47 |
"rstrip": false,
|
| 48 |
"single_word": false,
|
| 49 |
"special": true
|
| 50 |
-
|
| 51 |
-
|
| 52 |
"content": "<filename>",
|
| 53 |
"lstrip": false,
|
| 54 |
"normalized": false,
|
| 55 |
"rstrip": false,
|
| 56 |
"single_word": false,
|
| 57 |
"special": true
|
| 58 |
-
|
| 59 |
-
|
| 60 |
"content": "<gh_stars>",
|
| 61 |
"lstrip": false,
|
| 62 |
"normalized": false,
|
| 63 |
"rstrip": false,
|
| 64 |
"single_word": false,
|
| 65 |
"special": true
|
| 66 |
-
|
| 67 |
-
|
| 68 |
"content": "<issue_start>",
|
| 69 |
"lstrip": false,
|
| 70 |
"normalized": false,
|
| 71 |
"rstrip": false,
|
| 72 |
"single_word": false,
|
| 73 |
"special": true
|
| 74 |
-
|
| 75 |
-
|
| 76 |
"content": "<issue_comment>",
|
| 77 |
"lstrip": false,
|
| 78 |
"normalized": false,
|
| 79 |
"rstrip": false,
|
| 80 |
"single_word": false,
|
| 81 |
"special": true
|
| 82 |
-
|
| 83 |
-
|
| 84 |
"content": "<issue_closed>",
|
| 85 |
"lstrip": false,
|
| 86 |
"normalized": false,
|
| 87 |
"rstrip": false,
|
| 88 |
"single_word": false,
|
| 89 |
"special": true
|
| 90 |
-
|
| 91 |
-
|
| 92 |
-
|
| 93 |
-
|
| 94 |
-
|
| 95 |
-
|
| 96 |
-
|
| 97 |
-
|
| 98 |
-
|
| 99 |
-
|
| 100 |
"content": "<jupyter_text>",
|
| 101 |
"lstrip": false,
|
| 102 |
"normalized": false,
|
| 103 |
"rstrip": false,
|
| 104 |
"single_word": false,
|
| 105 |
"special": true
|
| 106 |
-
|
| 107 |
-
|
| 108 |
"content": "<jupyter_code>",
|
| 109 |
"lstrip": false,
|
| 110 |
"normalized": false,
|
| 111 |
"rstrip": false,
|
| 112 |
"single_word": false,
|
| 113 |
"special": true
|
| 114 |
-
|
| 115 |
-
|
| 116 |
"content": "<jupyter_output>",
|
| 117 |
"lstrip": false,
|
| 118 |
"normalized": false,
|
| 119 |
"rstrip": false,
|
| 120 |
"single_word": false,
|
| 121 |
"special": true
|
| 122 |
-
|
| 123 |
-
|
| 124 |
-
|
| 125 |
-
|
| 126 |
-
|
| 127 |
-
|
| 128 |
-
|
| 129 |
-
|
| 130 |
-
|
| 131 |
-
|
| 132 |
"content": "<commit_before>",
|
| 133 |
"lstrip": false,
|
| 134 |
"normalized": false,
|
| 135 |
"rstrip": false,
|
| 136 |
"single_word": false,
|
| 137 |
"special": true
|
| 138 |
-
|
| 139 |
-
|
| 140 |
"content": "<commit_msg>",
|
| 141 |
"lstrip": false,
|
| 142 |
"normalized": false,
|
| 143 |
"rstrip": false,
|
| 144 |
"single_word": false,
|
| 145 |
"special": true
|
| 146 |
-
|
| 147 |
-
|
| 148 |
"content": "<commit_after>",
|
| 149 |
"lstrip": false,
|
| 150 |
"normalized": false,
|
| 151 |
"rstrip": false,
|
| 152 |
"single_word": false,
|
| 153 |
"special": true
|
| 154 |
-
|
| 155 |
-
|
| 156 |
"content": "]~b]",
|
| 157 |
"lstrip": false,
|
| 158 |
"normalized": false,
|
| 159 |
"rstrip": false,
|
| 160 |
"single_word": false,
|
| 161 |
"special": true
|
| 162 |
-
|
| 163 |
-
|
| 164 |
"content": "[e~[",
|
| 165 |
"lstrip": false,
|
| 166 |
"normalized": false,
|
| 167 |
"rstrip": false,
|
| 168 |
"single_word": false,
|
| 169 |
"special": true
|
| 170 |
-
|
| 171 |
-
|
| 172 |
"content": "]!d~[",
|
| 173 |
"lstrip": false,
|
| 174 |
"normalized": false,
|
| 175 |
"rstrip": false,
|
| 176 |
"single_word": false,
|
| 177 |
"special": true
|
| 178 |
-
|
| 179 |
-
|
| 180 |
"content": "<function_call>",
|
| 181 |
"lstrip": false,
|
| 182 |
"normalized": false,
|
| 183 |
"rstrip": false,
|
| 184 |
"single_word": false,
|
| 185 |
"special": true
|
| 186 |
-
|
| 187 |
-
|
| 188 |
"content": "<code_interpreter>",
|
| 189 |
"lstrip": false,
|
| 190 |
"normalized": false,
|
| 191 |
"rstrip": false,
|
| 192 |
"single_word": false,
|
| 193 |
"special": true
|
| 194 |
-
|
| 195 |
-
|
| 196 |
"content": "]<]speech[>[",
|
| 197 |
"lstrip": false,
|
| 198 |
"normalized": false,
|
| 199 |
"rstrip": false,
|
| 200 |
"single_word": false,
|
| 201 |
"special": true
|
| 202 |
-
|
| 203 |
-
|
| 204 |
"content": "]<]image[>[",
|
| 205 |
"lstrip": false,
|
| 206 |
"normalized": false,
|
| 207 |
"rstrip": false,
|
| 208 |
"single_word": false,
|
| 209 |
"special": true
|
| 210 |
-
|
| 211 |
-
|
| 212 |
"content": "]<]video[>[",
|
| 213 |
"lstrip": false,
|
| 214 |
"normalized": false,
|
| 215 |
"rstrip": false,
|
| 216 |
"single_word": false,
|
| 217 |
"special": true
|
| 218 |
-
|
| 219 |
-
|
| 220 |
"content": "]<]start of speech[>[",
|
| 221 |
"lstrip": false,
|
| 222 |
"normalized": false,
|
| 223 |
"rstrip": false,
|
| 224 |
"single_word": false,
|
| 225 |
"special": true
|
| 226 |
-
|
| 227 |
-
|
| 228 |
"content": "]<]end of speech[>[",
|
| 229 |
"lstrip": false,
|
| 230 |
"normalized": false,
|
| 231 |
"rstrip": false,
|
| 232 |
"single_word": false,
|
| 233 |
"special": true
|
| 234 |
-
|
| 235 |
-
|
| 236 |
"content": "]<]start of image[>[",
|
| 237 |
"lstrip": false,
|
| 238 |
"normalized": false,
|
| 239 |
"rstrip": false,
|
| 240 |
"single_word": false,
|
| 241 |
"special": true
|
| 242 |
-
|
| 243 |
-
|
| 244 |
"content": "]<]end of image[>[",
|
| 245 |
"lstrip": false,
|
| 246 |
"normalized": false,
|
| 247 |
"rstrip": false,
|
| 248 |
"single_word": false,
|
| 249 |
"special": true
|
| 250 |
-
|
| 251 |
-
|
| 252 |
"content": "]<]start of video[>[",
|
| 253 |
"lstrip": false,
|
| 254 |
"normalized": false,
|
| 255 |
"rstrip": false,
|
| 256 |
"single_word": false,
|
| 257 |
"special": true
|
| 258 |
-
|
| 259 |
-
|
| 260 |
"content": "]<]end of video[>[",
|
| 261 |
"lstrip": false,
|
| 262 |
"normalized": false,
|
| 263 |
"rstrip": false,
|
| 264 |
"single_word": false,
|
| 265 |
"special": true
|
| 266 |
-
|
| 267 |
-
|
| 268 |
"content": "]<]vision pad[>[",
|
| 269 |
"lstrip": false,
|
| 270 |
"normalized": false,
|
| 271 |
"rstrip": false,
|
| 272 |
"single_word": false,
|
| 273 |
"special": true
|
| 274 |
-
|
| 275 |
-
|
| 276 |
"content": "]~!b[",
|
| 277 |
"lstrip": false,
|
| 278 |
"normalized": false,
|
| 279 |
"rstrip": false,
|
| 280 |
"single_word": false,
|
| 281 |
"special": true
|
| 282 |
-
|
| 283 |
-
|
| 284 |
"content": "<jupyter_error>",
|
| 285 |
"lstrip": false,
|
| 286 |
"normalized": false,
|
| 287 |
"rstrip": false,
|
| 288 |
"single_word": false,
|
| 289 |
"special": true
|
| 290 |
-
|
| 291 |
-
|
| 292 |
"content": "<add_file>",
|
| 293 |
-
"single_word": false,
|
| 294 |
"lstrip": false,
|
| 295 |
-
"rstrip": false,
|
| 296 |
"normalized": false,
|
|
|
|
|
|
|
| 297 |
"special": true
|
| 298 |
-
|
| 299 |
-
|
| 300 |
"content": "<delete_file>",
|
| 301 |
"lstrip": false,
|
| 302 |
"normalized": false,
|
| 303 |
"rstrip": false,
|
| 304 |
"single_word": false,
|
| 305 |
"special": true
|
| 306 |
-
|
| 307 |
-
|
| 308 |
"content": "<rename_file>",
|
| 309 |
"lstrip": false,
|
| 310 |
"normalized": false,
|
| 311 |
"rstrip": false,
|
| 312 |
"single_word": false,
|
| 313 |
"special": true
|
| 314 |
-
|
| 315 |
-
|
| 316 |
"content": "<edit_file>",
|
| 317 |
"lstrip": false,
|
| 318 |
"normalized": false,
|
| 319 |
"rstrip": false,
|
| 320 |
"single_word": false,
|
| 321 |
"special": true
|
| 322 |
-
|
| 323 |
-
|
| 324 |
"content": "<commit_message>",
|
| 325 |
"lstrip": false,
|
| 326 |
"normalized": false,
|
| 327 |
"rstrip": false,
|
| 328 |
"single_word": false,
|
| 329 |
"special": true
|
| 330 |
-
|
| 331 |
-
|
| 332 |
"content": "<empty_source_file>",
|
| 333 |
"lstrip": false,
|
| 334 |
"normalized": false,
|
| 335 |
"rstrip": false,
|
| 336 |
"single_word": false,
|
| 337 |
"special": true
|
| 338 |
-
|
| 339 |
-
|
| 340 |
"content": "<repo_struct>",
|
| 341 |
"lstrip": false,
|
| 342 |
"normalized": false,
|
| 343 |
"rstrip": false,
|
| 344 |
"single_word": false,
|
| 345 |
"special": true
|
| 346 |
-
|
| 347 |
-
|
| 348 |
-
|
| 349 |
-
|
| 350 |
-
|
| 351 |
-
|
| 352 |
-
|
| 353 |
-
|
| 354 |
-
|
| 355 |
-
|
| 356 |
-
|
| 357 |
-
|
| 358 |
-
|
| 359 |
-
|
| 360 |
-
|
| 361 |
-
|
| 362 |
-
|
| 363 |
-
|
| 364 |
-
|
| 365 |
-
|
| 366 |
-
|
| 367 |
-
|
| 368 |
-
|
| 369 |
-
|
| 370 |
-
|
| 371 |
-
|
| 372 |
-
|
| 373 |
-
|
| 374 |
-
|
| 375 |
-
|
| 376 |
-
|
| 377 |
-
|
| 378 |
-
|
| 379 |
-
|
| 380 |
-
|
| 381 |
-
|
| 382 |
-
|
| 383 |
-
|
| 384 |
-
|
| 385 |
-
|
| 386 |
-
|
| 387 |
-
|
| 388 |
-
|
| 389 |
-
|
| 390 |
-
|
| 391 |
-
|
| 392 |
-
|
| 393 |
-
|
| 394 |
-
|
| 395 |
-
|
| 396 |
-
|
| 397 |
-
|
| 398 |
-
|
| 399 |
-
|
| 400 |
-
|
| 401 |
-
|
| 402 |
-
|
| 403 |
-
|
| 404 |
-
|
| 405 |
-
|
| 406 |
-
|
| 407 |
-
|
| 408 |
-
|
| 409 |
-
|
| 410 |
-
|
| 411 |
-
|
| 412 |
-
|
| 413 |
-
|
| 414 |
-
|
| 415 |
-
|
| 416 |
-
|
| 417 |
-
|
| 418 |
-
|
| 419 |
-
|
| 420 |
-
|
| 421 |
-
|
| 422 |
-
|
| 423 |
-
|
| 424 |
-
|
| 425 |
-
|
| 426 |
-
|
| 427 |
-
|
| 428 |
-
|
| 429 |
-
|
| 430 |
-
|
| 431 |
-
|
| 432 |
-
|
| 433 |
-
|
| 434 |
-
|
| 435 |
},
|
| 436 |
"additional_special_tokens": [
|
| 437 |
-
|
| 438 |
-
|
| 439 |
-
|
| 440 |
-
|
| 441 |
-
|
| 442 |
-
|
| 443 |
-
|
| 444 |
-
|
| 445 |
-
|
| 446 |
-
|
| 447 |
-
|
| 448 |
-
|
| 449 |
-
|
| 450 |
-
|
| 451 |
-
|
| 452 |
-
|
| 453 |
-
|
| 454 |
-
|
| 455 |
-
|
| 456 |
-
|
| 457 |
-
|
| 458 |
-
|
| 459 |
-
|
| 460 |
-
|
| 461 |
-
|
| 462 |
-
|
| 463 |
-
|
| 464 |
-
|
| 465 |
-
|
| 466 |
-
|
| 467 |
-
|
| 468 |
-
|
| 469 |
-
|
| 470 |
-
|
| 471 |
-
|
| 472 |
-
|
| 473 |
-
|
| 474 |
-
|
| 475 |
-
|
| 476 |
-
|
| 477 |
-
|
| 478 |
-
|
| 479 |
-
|
| 480 |
-
|
| 481 |
-
|
| 482 |
-
|
| 483 |
-
|
| 484 |
-
|
| 485 |
-
|
| 486 |
-
|
| 487 |
-
|
| 488 |
-
"add_prefix_space": false,
|
| 489 |
"bos_token": "]~!b[",
|
| 490 |
"clean_up_tokenization_spaces": false,
|
| 491 |
"eos_token": "[e~[",
|
|
|
|
| 492 |
"model_max_length": 40960000,
|
|
|
|
|
|
|
| 493 |
"tokenizer_class": "GPT2Tokenizer",
|
| 494 |
"unk_token": "]!d~["
|
| 495 |
}
|
|
|
|
| 1 |
{
|
| 2 |
+
"add_prefix_space": false,
|
| 3 |
"added_tokens_decoder": {
|
| 4 |
+
"200000": {
|
| 5 |
"content": "]!p~[",
|
| 6 |
"lstrip": false,
|
| 7 |
"normalized": false,
|
| 8 |
"rstrip": false,
|
| 9 |
"single_word": false,
|
| 10 |
"special": true
|
| 11 |
+
},
|
| 12 |
+
"200001": {
|
| 13 |
"content": "<fim_prefix>",
|
| 14 |
"lstrip": false,
|
| 15 |
"normalized": false,
|
| 16 |
"rstrip": false,
|
| 17 |
"single_word": false,
|
| 18 |
"special": true
|
| 19 |
+
},
|
| 20 |
+
"200002": {
|
| 21 |
"content": "<fim_middle>",
|
| 22 |
"lstrip": false,
|
| 23 |
"normalized": false,
|
| 24 |
"rstrip": false,
|
| 25 |
"single_word": false,
|
| 26 |
"special": true
|
| 27 |
+
},
|
| 28 |
+
"200003": {
|
| 29 |
"content": "<fim_suffix>",
|
| 30 |
"lstrip": false,
|
| 31 |
"normalized": false,
|
| 32 |
"rstrip": false,
|
| 33 |
"single_word": false,
|
| 34 |
"special": true
|
| 35 |
+
},
|
| 36 |
+
"200004": {
|
| 37 |
"content": "<fim_pad>",
|
| 38 |
"lstrip": false,
|
| 39 |
"normalized": false,
|
| 40 |
"rstrip": false,
|
| 41 |
"single_word": false,
|
| 42 |
"special": true
|
| 43 |
+
},
|
| 44 |
+
"200005": {
|
| 45 |
"content": "<reponame>",
|
| 46 |
"lstrip": false,
|
| 47 |
"normalized": false,
|
| 48 |
"rstrip": false,
|
| 49 |
"single_word": false,
|
| 50 |
"special": true
|
| 51 |
+
},
|
| 52 |
+
"200006": {
|
| 53 |
"content": "<filename>",
|
| 54 |
"lstrip": false,
|
| 55 |
"normalized": false,
|
| 56 |
"rstrip": false,
|
| 57 |
"single_word": false,
|
| 58 |
"special": true
|
| 59 |
+
},
|
| 60 |
+
"200007": {
|
| 61 |
"content": "<gh_stars>",
|
| 62 |
"lstrip": false,
|
| 63 |
"normalized": false,
|
| 64 |
"rstrip": false,
|
| 65 |
"single_word": false,
|
| 66 |
"special": true
|
| 67 |
+
},
|
| 68 |
+
"200008": {
|
| 69 |
"content": "<issue_start>",
|
| 70 |
"lstrip": false,
|
| 71 |
"normalized": false,
|
| 72 |
"rstrip": false,
|
| 73 |
"single_word": false,
|
| 74 |
"special": true
|
| 75 |
+
},
|
| 76 |
+
"200009": {
|
| 77 |
"content": "<issue_comment>",
|
| 78 |
"lstrip": false,
|
| 79 |
"normalized": false,
|
| 80 |
"rstrip": false,
|
| 81 |
"single_word": false,
|
| 82 |
"special": true
|
| 83 |
+
},
|
| 84 |
+
"200010": {
|
| 85 |
"content": "<issue_closed>",
|
| 86 |
"lstrip": false,
|
| 87 |
"normalized": false,
|
| 88 |
"rstrip": false,
|
| 89 |
"single_word": false,
|
| 90 |
"special": true
|
| 91 |
+
},
|
| 92 |
+
"200011": {
|
| 93 |
+
"content": "<jupyter_start>",
|
| 94 |
+
"lstrip": false,
|
| 95 |
+
"normalized": false,
|
| 96 |
+
"rstrip": false,
|
| 97 |
+
"single_word": false,
|
| 98 |
+
"special": true
|
| 99 |
+
},
|
| 100 |
+
"200012": {
|
| 101 |
"content": "<jupyter_text>",
|
| 102 |
"lstrip": false,
|
| 103 |
"normalized": false,
|
| 104 |
"rstrip": false,
|
| 105 |
"single_word": false,
|
| 106 |
"special": true
|
| 107 |
+
},
|
| 108 |
+
"200013": {
|
| 109 |
"content": "<jupyter_code>",
|
| 110 |
"lstrip": false,
|
| 111 |
"normalized": false,
|
| 112 |
"rstrip": false,
|
| 113 |
"single_word": false,
|
| 114 |
"special": true
|
| 115 |
+
},
|
| 116 |
+
"200014": {
|
| 117 |
"content": "<jupyter_output>",
|
| 118 |
"lstrip": false,
|
| 119 |
"normalized": false,
|
| 120 |
"rstrip": false,
|
| 121 |
"single_word": false,
|
| 122 |
"special": true
|
| 123 |
+
},
|
| 124 |
+
"200015": {
|
| 125 |
+
"content": "<empty_output>",
|
| 126 |
+
"lstrip": false,
|
| 127 |
+
"normalized": false,
|
| 128 |
+
"rstrip": false,
|
| 129 |
+
"single_word": false,
|
| 130 |
+
"special": true
|
| 131 |
+
},
|
| 132 |
+
"200016": {
|
| 133 |
"content": "<commit_before>",
|
| 134 |
"lstrip": false,
|
| 135 |
"normalized": false,
|
| 136 |
"rstrip": false,
|
| 137 |
"single_word": false,
|
| 138 |
"special": true
|
| 139 |
+
},
|
| 140 |
+
"200017": {
|
| 141 |
"content": "<commit_msg>",
|
| 142 |
"lstrip": false,
|
| 143 |
"normalized": false,
|
| 144 |
"rstrip": false,
|
| 145 |
"single_word": false,
|
| 146 |
"special": true
|
| 147 |
+
},
|
| 148 |
+
"200018": {
|
| 149 |
"content": "<commit_after>",
|
| 150 |
"lstrip": false,
|
| 151 |
"normalized": false,
|
| 152 |
"rstrip": false,
|
| 153 |
"single_word": false,
|
| 154 |
"special": true
|
| 155 |
+
},
|
| 156 |
+
"200019": {
|
| 157 |
"content": "]~b]",
|
| 158 |
"lstrip": false,
|
| 159 |
"normalized": false,
|
| 160 |
"rstrip": false,
|
| 161 |
"single_word": false,
|
| 162 |
"special": true
|
| 163 |
+
},
|
| 164 |
+
"200020": {
|
| 165 |
"content": "[e~[",
|
| 166 |
"lstrip": false,
|
| 167 |
"normalized": false,
|
| 168 |
"rstrip": false,
|
| 169 |
"single_word": false,
|
| 170 |
"special": true
|
| 171 |
+
},
|
| 172 |
+
"200021": {
|
| 173 |
"content": "]!d~[",
|
| 174 |
"lstrip": false,
|
| 175 |
"normalized": false,
|
| 176 |
"rstrip": false,
|
| 177 |
"single_word": false,
|
| 178 |
"special": true
|
| 179 |
+
},
|
| 180 |
+
"200022": {
|
| 181 |
"content": "<function_call>",
|
| 182 |
"lstrip": false,
|
| 183 |
"normalized": false,
|
| 184 |
"rstrip": false,
|
| 185 |
"single_word": false,
|
| 186 |
"special": true
|
| 187 |
+
},
|
| 188 |
+
"200023": {
|
| 189 |
"content": "<code_interpreter>",
|
| 190 |
"lstrip": false,
|
| 191 |
"normalized": false,
|
| 192 |
"rstrip": false,
|
| 193 |
"single_word": false,
|
| 194 |
"special": true
|
| 195 |
+
},
|
| 196 |
+
"200024": {
|
| 197 |
"content": "]<]speech[>[",
|
| 198 |
"lstrip": false,
|
| 199 |
"normalized": false,
|
| 200 |
"rstrip": false,
|
| 201 |
"single_word": false,
|
| 202 |
"special": true
|
| 203 |
+
},
|
| 204 |
+
"200025": {
|
| 205 |
"content": "]<]image[>[",
|
| 206 |
"lstrip": false,
|
| 207 |
"normalized": false,
|
| 208 |
"rstrip": false,
|
| 209 |
"single_word": false,
|
| 210 |
"special": true
|
| 211 |
+
},
|
| 212 |
+
"200026": {
|
| 213 |
"content": "]<]video[>[",
|
| 214 |
"lstrip": false,
|
| 215 |
"normalized": false,
|
| 216 |
"rstrip": false,
|
| 217 |
"single_word": false,
|
| 218 |
"special": true
|
| 219 |
+
},
|
| 220 |
+
"200027": {
|
| 221 |
"content": "]<]start of speech[>[",
|
| 222 |
"lstrip": false,
|
| 223 |
"normalized": false,
|
| 224 |
"rstrip": false,
|
| 225 |
"single_word": false,
|
| 226 |
"special": true
|
| 227 |
+
},
|
| 228 |
+
"200028": {
|
| 229 |
"content": "]<]end of speech[>[",
|
| 230 |
"lstrip": false,
|
| 231 |
"normalized": false,
|
| 232 |
"rstrip": false,
|
| 233 |
"single_word": false,
|
| 234 |
"special": true
|
| 235 |
+
},
|
| 236 |
+
"200029": {
|
| 237 |
"content": "]<]start of image[>[",
|
| 238 |
"lstrip": false,
|
| 239 |
"normalized": false,
|
| 240 |
"rstrip": false,
|
| 241 |
"single_word": false,
|
| 242 |
"special": true
|
| 243 |
+
},
|
| 244 |
+
"200030": {
|
| 245 |
"content": "]<]end of image[>[",
|
| 246 |
"lstrip": false,
|
| 247 |
"normalized": false,
|
| 248 |
"rstrip": false,
|
| 249 |
"single_word": false,
|
| 250 |
"special": true
|
| 251 |
+
},
|
| 252 |
+
"200031": {
|
| 253 |
"content": "]<]start of video[>[",
|
| 254 |
"lstrip": false,
|
| 255 |
"normalized": false,
|
| 256 |
"rstrip": false,
|
| 257 |
"single_word": false,
|
| 258 |
"special": true
|
| 259 |
+
},
|
| 260 |
+
"200032": {
|
| 261 |
"content": "]<]end of video[>[",
|
| 262 |
"lstrip": false,
|
| 263 |
"normalized": false,
|
| 264 |
"rstrip": false,
|
| 265 |
"single_word": false,
|
| 266 |
"special": true
|
| 267 |
+
},
|
| 268 |
+
"200033": {
|
| 269 |
"content": "]<]vision pad[>[",
|
| 270 |
"lstrip": false,
|
| 271 |
"normalized": false,
|
| 272 |
"rstrip": false,
|
| 273 |
"single_word": false,
|
| 274 |
"special": true
|
| 275 |
+
},
|
| 276 |
+
"200034": {
|
| 277 |
"content": "]~!b[",
|
| 278 |
"lstrip": false,
|
| 279 |
"normalized": false,
|
| 280 |
"rstrip": false,
|
| 281 |
"single_word": false,
|
| 282 |
"special": true
|
| 283 |
+
},
|
| 284 |
+
"200035": {
|
| 285 |
"content": "<jupyter_error>",
|
| 286 |
"lstrip": false,
|
| 287 |
"normalized": false,
|
| 288 |
"rstrip": false,
|
| 289 |
"single_word": false,
|
| 290 |
"special": true
|
| 291 |
+
},
|
| 292 |
+
"200036": {
|
| 293 |
"content": "<add_file>",
|
|
|
|
| 294 |
"lstrip": false,
|
|
|
|
| 295 |
"normalized": false,
|
| 296 |
+
"rstrip": false,
|
| 297 |
+
"single_word": false,
|
| 298 |
"special": true
|
| 299 |
+
},
|
| 300 |
+
"200037": {
|
| 301 |
"content": "<delete_file>",
|
| 302 |
"lstrip": false,
|
| 303 |
"normalized": false,
|
| 304 |
"rstrip": false,
|
| 305 |
"single_word": false,
|
| 306 |
"special": true
|
| 307 |
+
},
|
| 308 |
+
"200038": {
|
| 309 |
"content": "<rename_file>",
|
| 310 |
"lstrip": false,
|
| 311 |
"normalized": false,
|
| 312 |
"rstrip": false,
|
| 313 |
"single_word": false,
|
| 314 |
"special": true
|
| 315 |
+
},
|
| 316 |
+
"200039": {
|
| 317 |
"content": "<edit_file>",
|
| 318 |
"lstrip": false,
|
| 319 |
"normalized": false,
|
| 320 |
"rstrip": false,
|
| 321 |
"single_word": false,
|
| 322 |
"special": true
|
| 323 |
+
},
|
| 324 |
+
"200040": {
|
| 325 |
"content": "<commit_message>",
|
| 326 |
"lstrip": false,
|
| 327 |
"normalized": false,
|
| 328 |
"rstrip": false,
|
| 329 |
"single_word": false,
|
| 330 |
"special": true
|
| 331 |
+
},
|
| 332 |
+
"200041": {
|
| 333 |
"content": "<empty_source_file>",
|
| 334 |
"lstrip": false,
|
| 335 |
"normalized": false,
|
| 336 |
"rstrip": false,
|
| 337 |
"single_word": false,
|
| 338 |
"special": true
|
| 339 |
+
},
|
| 340 |
+
"200042": {
|
| 341 |
"content": "<repo_struct>",
|
| 342 |
"lstrip": false,
|
| 343 |
"normalized": false,
|
| 344 |
"rstrip": false,
|
| 345 |
"single_word": false,
|
| 346 |
"special": true
|
| 347 |
+
},
|
| 348 |
+
"200043": {
|
| 349 |
+
"content": "<code_context>",
|
| 350 |
+
"lstrip": false,
|
| 351 |
+
"normalized": false,
|
| 352 |
+
"rstrip": false,
|
| 353 |
+
"single_word": false,
|
| 354 |
+
"special": true
|
| 355 |
+
},
|
| 356 |
+
"200044": {
|
| 357 |
+
"content": "<file_content>",
|
| 358 |
+
"lstrip": false,
|
| 359 |
+
"normalized": false,
|
| 360 |
+
"rstrip": false,
|
| 361 |
+
"single_word": false,
|
| 362 |
+
"special": true
|
| 363 |
+
},
|
| 364 |
+
"200045": {
|
| 365 |
+
"content": "<source_files>",
|
| 366 |
+
"lstrip": false,
|
| 367 |
+
"normalized": false,
|
| 368 |
+
"rstrip": false,
|
| 369 |
+
"single_word": false,
|
| 370 |
+
"special": true
|
| 371 |
+
},
|
| 372 |
+
"200046": {
|
| 373 |
+
"content": "<pr_start>",
|
| 374 |
+
"lstrip": false,
|
| 375 |
+
"normalized": false,
|
| 376 |
+
"rstrip": false,
|
| 377 |
+
"single_word": false,
|
| 378 |
+
"special": true
|
| 379 |
+
},
|
| 380 |
+
"200047": {
|
| 381 |
+
"content": "<review_comment>",
|
| 382 |
+
"lstrip": false,
|
| 383 |
+
"normalized": false,
|
| 384 |
+
"rstrip": false,
|
| 385 |
+
"single_word": false,
|
| 386 |
+
"special": true
|
| 387 |
+
},
|
| 388 |
+
"200048": {
|
| 389 |
+
"content": "<filepath>",
|
| 390 |
+
"lstrip": false,
|
| 391 |
+
"normalized": false,
|
| 392 |
+
"rstrip": false,
|
| 393 |
+
"single_word": false,
|
| 394 |
+
"special": true
|
| 395 |
+
},
|
| 396 |
+
"200049": {
|
| 397 |
+
"content": "<file_sep>",
|
| 398 |
+
"lstrip": false,
|
| 399 |
+
"normalized": false,
|
| 400 |
+
"rstrip": false,
|
| 401 |
+
"single_word": false,
|
| 402 |
+
"special": true
|
| 403 |
+
},
|
| 404 |
+
"200050": {
|
| 405 |
+
"content": "<think>",
|
| 406 |
+
"lstrip": false,
|
| 407 |
+
"normalized": false,
|
| 408 |
+
"rstrip": false,
|
| 409 |
+
"single_word": false,
|
| 410 |
+
"special": false
|
| 411 |
+
},
|
| 412 |
+
"200051": {
|
| 413 |
+
"content": "</think>",
|
| 414 |
+
"lstrip": false,
|
| 415 |
+
"normalized": false,
|
| 416 |
+
"rstrip": false,
|
| 417 |
+
"single_word": false,
|
| 418 |
+
"special": false
|
| 419 |
+
},
|
| 420 |
+
"200052": {
|
| 421 |
+
"content": "<minimax:tool_call>",
|
| 422 |
+
"lstrip": false,
|
| 423 |
+
"normalized": false,
|
| 424 |
+
"rstrip": false,
|
| 425 |
+
"single_word": false,
|
| 426 |
+
"special": false
|
| 427 |
+
},
|
| 428 |
+
"200053": {
|
| 429 |
+
"content": "</minimax:tool_call>",
|
| 430 |
+
"lstrip": false,
|
| 431 |
+
"normalized": false,
|
| 432 |
+
"rstrip": false,
|
| 433 |
+
"single_word": false,
|
| 434 |
+
"special": false
|
| 435 |
+
}
|
| 436 |
},
|
| 437 |
"additional_special_tokens": [
|
| 438 |
+
"<code_interpreter>",
|
| 439 |
+
"<commit_after>",
|
| 440 |
+
"<commit_before>",
|
| 441 |
+
"<commit_msg>",
|
| 442 |
+
"<empty_output>",
|
| 443 |
+
"<filename>",
|
| 444 |
+
"<fim_middle>",
|
| 445 |
+
"<fim_pad>",
|
| 446 |
+
"<fim_prefix>",
|
| 447 |
+
"<fim_suffix>",
|
| 448 |
+
"<function_call>",
|
| 449 |
+
"<gh_stars>",
|
| 450 |
+
"]<]speech[>[",
|
| 451 |
+
"]<]image[>[",
|
| 452 |
+
"]<]video[>[",
|
| 453 |
+
"]<]start of speech[>[",
|
| 454 |
+
"]<]end of speech[>[",
|
| 455 |
+
"]<]start of image[>[",
|
| 456 |
+
"]<]end of image[>[",
|
| 457 |
+
"]<]start of video[>[",
|
| 458 |
+
"]<]end of video[>[",
|
| 459 |
+
"]<]vision pad[>[",
|
| 460 |
+
"]~!b[",
|
| 461 |
+
"<issue_closed>",
|
| 462 |
+
"<issue_comment>",
|
| 463 |
+
"<issue_start>",
|
| 464 |
+
"<jupyter_code>",
|
| 465 |
+
"<jupyter_output>",
|
| 466 |
+
"<jupyter_start>",
|
| 467 |
+
"<jupyter_text>",
|
| 468 |
+
"<reponame>",
|
| 469 |
+
"[e~[",
|
| 470 |
+
"]!d~[",
|
| 471 |
+
"]!p~[",
|
| 472 |
+
"]~b]",
|
| 473 |
+
"<jupyter_error>",
|
| 474 |
+
"<add_file>",
|
| 475 |
+
"<delete_file>",
|
| 476 |
+
"<rename_file>",
|
| 477 |
+
"<edit_file>",
|
| 478 |
+
"<commit_message>",
|
| 479 |
+
"<empty_source_file>",
|
| 480 |
+
"<repo_struct>",
|
| 481 |
+
"<code_context>",
|
| 482 |
+
"<file_content>",
|
| 483 |
+
"<source_files>",
|
| 484 |
+
"<pr_start>",
|
| 485 |
+
"<review_comment>",
|
| 486 |
+
"<filepath>",
|
| 487 |
+
"<file_sep>"
|
| 488 |
+
],
|
|
|
|
| 489 |
"bos_token": "]~!b[",
|
| 490 |
"clean_up_tokenization_spaces": false,
|
| 491 |
"eos_token": "[e~[",
|
| 492 |
+
"extra_special_tokens": {},
|
| 493 |
"model_max_length": 40960000,
|
| 494 |
+
"pad_token": "[e~[",
|
| 495 |
+
"padding_side": "left",
|
| 496 |
"tokenizer_class": "GPT2Tokenizer",
|
| 497 |
"unk_token": "]!d~["
|
| 498 |
}
|