Update README.md
Browse files
README.md
CHANGED
|
@@ -13,7 +13,7 @@ Just a simple merge, without any inference code, and does not indicate whether t
|
|
| 13 |
2. **Loading files into cache**: Loads all safetensors files into memory and builds a key-to-file mapping.
|
| 14 |
3. **Merging Tensor Parallel (TP) shards**: Merges shards for tensor parallelism (TP=8) along specific dimensions and verifies the merged tensor shapes.
|
| 15 |
4. **Grouping weights by layer**: Organizes weights by model layer, with special weights (e.g., `lm_head.weight`, `model.embed_tokens.weight`, and `model.norm.weight`) handled separately.
|
| 16 |
-
5. **Saving merged weights**: Saves the grouped weights as new safetensors files and generates a new index file
|
| 17 |
|
| 18 |
### Features
|
| 19 |
- **Input**: Safetensors files in the `xai-org/grok-2` model directory.
|
|
|
|
| 13 |
2. **Loading files into cache**: Loads all safetensors files into memory and builds a key-to-file mapping.
|
| 14 |
3. **Merging Tensor Parallel (TP) shards**: Merges shards for tensor parallelism (TP=8) along specific dimensions and verifies the merged tensor shapes.
|
| 15 |
4. **Grouping weights by layer**: Organizes weights by model layer, with special weights (e.g., `lm_head.weight`, `model.embed_tokens.weight`, and `model.norm.weight`) handled separately.
|
| 16 |
+
5. **Saving merged weights**: Saves the grouped weights as new safetensors files and generates a new index file [pytorch_model.bin.index.json](https://huggingface.co/huihui-ai/grok-2/blob/main/pytorch_model.bin.index.json).
|
| 17 |
|
| 18 |
### Features
|
| 19 |
- **Input**: Safetensors files in the `xai-org/grok-2` model directory.
|