amd
/

AMD-OLMo-1B-SFT

Text Generation

Model card Files Files and versions

Prakamya commited on Oct 31, 2024

Commit

2a5bb07

·

verified ·

1 Parent(s): b373b2f

Update README.md

Files changed (1) hide show

README.md +11 -0

README.md CHANGED Viewed

@@ -277,6 +277,17 @@ hf-align/scripts/run_dpo.py hf-align/recipes/AMD-OLMo-1B-dpo.yaml \
 | **AlpacaEval 2 (LC Win Rate)**     |       Length Control Win Rate  (weighted_alpaca_eval_gpt4_turbo)        |
 | **MTBench**   |       Average score for single-answer grading (2 turns)          |
 #### License
 Copyright (c) 2018-2024 Advanced Micro Devices, Inc. All Rights Reserved.

 | **AlpacaEval 2 (LC Win Rate)**     |       Length Control Win Rate  (weighted_alpaca_eval_gpt4_turbo)        |
 | **MTBench**   |       Average score for single-answer grading (2 turns)          |
+Feel free to cite our AMD-OLMo models:
+```bash
+@misc{AMD-OLMo,
+    title = {AMD-OLMo: A series of 1B language models trained from scratch by AMD on AMD Instinct™ MI250 GPUs.},
+    url = {https://huggingface.co/amd/AMD-OLMo},
+    author = {Jiang Liu, Jialian Wu, Prakamya Mishra, Zicheng Liu, Sudhanshu Ranjan, Pratik Prabhanjan Brahma, Yusheng Su, Gowtham Ramesh, Peng Sun, Zhe Li, Dong Li, Lu Tian, Emad Barsoum},
+    month = {October},
+    year = {2024}
+}
+```
 #### License
 Copyright (c) 2018-2024 Advanced Micro Devices, Inc. All Rights Reserved.