guanwenyu1995 commited on
Commit
6c7c9ae
Β·
verified Β·
1 Parent(s): 41544f6

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +5 -5
README.md CHANGED
@@ -20,9 +20,9 @@ library_name: transformers
20
 
21
  ## Introduction
22
 
23
- BitCPM4-CANN-0.5B-unquantized is the **unquantized QAT training checkpoint** of the BitCPM4-CANN-0.5B model. This model stores the raw quantization-aware training (QAT) parameters **before** fake-quantizer fusionβ€”the ternary fake quantizers are defined in `modeling.py` and applied during forward propagation.
24
 
25
- > ⚠️ **This model is NOT intended for direct inference.** It is designed as the starting point for fine-tuning BitCPM4-CANN. If you need a model for inference, please use the pseudo-quantized version: [openbmb/BitCPM4-CANN-0.5B](https://huggingface.co/openbmb/BitCPM4-CANN-0.5B).
26
 
27
  ### Key Characteristics
28
 
@@ -47,7 +47,7 @@ This model is designed for fine-tuning with frameworks that support custom model
47
 
48
  #### Supported Fine-tuning Frameworks
49
 
50
- - **DeepSpeed** (recommended): See [example](https://huggingface.co/openbmb/BitCPM4-CANN-0.5B-unquantized/tree/main/example)
51
  - **LLaMA Factory**: Supports custom model loading with `trust_remote_code=True`
52
  - **Other Frameworks**: Any framework that supports HuggingFace-compatible model loading with custom modeling code
53
 
@@ -60,7 +60,7 @@ When fine-tuning, you **must** ensure:
60
  ```python
61
  from transformers import AutoModelForCausalLM, AutoTokenizer
62
 
63
- path = 'openbmb/BitCPM4-CANN-0.5B-unquantized'
64
  tokenizer = AutoTokenizer.from_pretrained(path, trust_remote_code=True)
65
  model = AutoModelForCausalLM.from_pretrained(
66
  path,
@@ -90,7 +90,7 @@ The converted model can then be loaded for inference in the same way as [openbmb
90
 
91
  ```
92
  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
93
- β”‚ BitCPM4-CANN-0.5B-unquantized β”‚ ← This model (QAT parameters + fake quantizer in modeling.py)
94
  β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
95
  β”‚
96
  β–Ό Fine-tune (DeepSpeed / LLaMA Factory / ...)
 
20
 
21
  ## Introduction
22
 
23
+ BitCPM4-CANN-8B-unquantized is the **unquantized QAT training checkpoint** of the BitCPM4-CANN-8B model. This model stores the raw quantization-aware training (QAT) parameters **before** fake-quantizer fusionβ€”the ternary fake quantizers are defined in `modeling.py` and applied during forward propagation.
24
 
25
+ > ⚠️ **This model is NOT intended for direct inference.** It is designed as the starting point for fine-tuning BitCPM4-CANN. If you need a model for inference, please use the pseudo-quantized version: [openbmb/BitCPM4-CANN-8B](https://huggingface.co/openbmb/BitCPM4-CANN-8B).
26
 
27
  ### Key Characteristics
28
 
 
47
 
48
  #### Supported Fine-tuning Frameworks
49
 
50
+ - **DeepSpeed** (recommended): See [example](https://huggingface.co/openbmb/BitCPM4-CANN-8B-unquantized/tree/main/example)
51
  - **LLaMA Factory**: Supports custom model loading with `trust_remote_code=True`
52
  - **Other Frameworks**: Any framework that supports HuggingFace-compatible model loading with custom modeling code
53
 
 
60
  ```python
61
  from transformers import AutoModelForCausalLM, AutoTokenizer
62
 
63
+ path = 'openbmb/BitCPM4-CANN-8B-unquantized'
64
  tokenizer = AutoTokenizer.from_pretrained(path, trust_remote_code=True)
65
  model = AutoModelForCausalLM.from_pretrained(
66
  path,
 
90
 
91
  ```
92
  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
93
+ β”‚ BitCPM4-CANN-8B-unquantized β”‚ ← This model (QAT parameters + fake quantizer in modeling.py)
94
  β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
95
  β”‚
96
  β–Ό Fine-tune (DeepSpeed / LLaMA Factory / ...)