justinj92 commited on
Commit
fd891ac
·
verified ·
1 Parent(s): 1c4c67e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -254,7 +254,7 @@ print(response)
254
  - **Library**: [Open-R1](https://github.com/huggingface/open-r1)
255
  - **Total Training Tokens**: 100B Tokens
256
  - **Framework**: PyTorch with Transformers and TRL
257
- - **Optimization**: DeepSpeed ZeRO-3
258
  - **Memory Optimization**: Gradient checkpointing, Liger kernels
259
  - **Monitoring**: Weights & Biases integration
260
  - **Hardware Used**: 8xB200 GPUs
@@ -300,4 +300,4 @@ This model is released under the Llama 3.1 Community License. Please see the [of
300
 
301
  ## Model Card Contact
302
 
303
- For questions about this model card or the model itself, please open an issue in the model repository or contact [your-email@domain.com].
 
254
  - **Library**: [Open-R1](https://github.com/huggingface/open-r1)
255
  - **Total Training Tokens**: 100B Tokens
256
  - **Framework**: PyTorch with Transformers and TRL
257
+ - **Optimization**: DeepSpeed ZeRO-2
258
  - **Memory Optimization**: Gradient checkpointing, Liger kernels
259
  - **Monitoring**: Weights & Biases integration
260
  - **Hardware Used**: 8xB200 GPUs
 
300
 
301
  ## Model Card Contact
302
 
303
+ For questions about this model card or the model itself, please open an issue in the model repository.