huihui-ai commited on
Commit
af2487a
·
verified ·
1 Parent(s): 5f3d08b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -13,7 +13,7 @@ Just a simple merge, without any inference code, and does not indicate whether t
13
  2. **Loading files into cache**: Loads all safetensors files into memory and builds a key-to-file mapping.
14
  3. **Merging Tensor Parallel (TP) shards**: Merges shards for tensor parallelism (TP=8) along specific dimensions and verifies the merged tensor shapes.
15
  4. **Grouping weights by layer**: Organizes weights by model layer, with special weights (e.g., `lm_head.weight`, `model.embed_tokens.weight`, and `model.norm.weight`) handled separately.
16
- 5. **Saving merged weights**: Saves the grouped weights as new safetensors files and generates a new index file `pytorch_model.bin.index.json`.
17
 
18
  ### Features
19
  - **Input**: Safetensors files in the `xai-org/grok-2` model directory.
 
13
  2. **Loading files into cache**: Loads all safetensors files into memory and builds a key-to-file mapping.
14
  3. **Merging Tensor Parallel (TP) shards**: Merges shards for tensor parallelism (TP=8) along specific dimensions and verifies the merged tensor shapes.
15
  4. **Grouping weights by layer**: Organizes weights by model layer, with special weights (e.g., `lm_head.weight`, `model.embed_tokens.weight`, and `model.norm.weight`) handled separately.
16
+ 5. **Saving merged weights**: Saves the grouped weights as new safetensors files and generates a new index file [pytorch_model.bin.index.json](https://huggingface.co/huihui-ai/grok-2/blob/main/pytorch_model.bin.index.json).
17
 
18
  ### Features
19
  - **Input**: Safetensors files in the `xai-org/grok-2` model directory.