Update README.md
Browse files
README.md
CHANGED
|
@@ -92,8 +92,6 @@ OpenBioLLM-8B is an advanced open source language model designed specifically fo
|
|
| 92 |
<img width="1200px" src="https://cdn-uploads.huggingface.co/production/uploads/5f3fe13d79c1ba4c353d0c19/oPchsJsEpQoGcGXVbh7YS.png">
|
| 93 |
</div>
|
| 94 |
|
| 95 |
-
|
| 96 |
-
- **Reward Model**: [Nexusflow/Starling-RM-34B](https://huggingface.co/Nexusflow/Starling-RM-34B)
|
| 97 |
- **Policy Optimization**: [Fine-Tuning Language Models from Human Preferences (PPO)](https://arxiv.org/abs/1909.08593)
|
| 98 |
- **Ranking Dataset**: [berkeley-nest/Nectar](https://huggingface.co/datasets/berkeley-nest/Nectar)
|
| 99 |
- **Fine-tuning dataset**: Custom Medical Instruct dataset (We plan to release a sample training dataset in our upcoming paper; please stay updated)
|
|
@@ -107,7 +105,7 @@ This combination of cutting-edge techniques enables OpenBioLLM-8B to align with
|
|
| 107 |
- **Language(s) (NLP):** en
|
| 108 |
- **Developed By**: [Ankit Pal (Aaditya Ura)](https://aadityaura.github.io/) from Saama AI Labs
|
| 109 |
- **License:** Meta-Llama License
|
| 110 |
-
- **Fine-tuned from models:** [meta-llama/Meta-Llama-3-8B](meta-llama/Meta-Llama-3-8B)
|
| 111 |
- **Resources for more information:**
|
| 112 |
- Paper: Coming soon
|
| 113 |
|
|
|
|
| 92 |
<img width="1200px" src="https://cdn-uploads.huggingface.co/production/uploads/5f3fe13d79c1ba4c353d0c19/oPchsJsEpQoGcGXVbh7YS.png">
|
| 93 |
</div>
|
| 94 |
|
|
|
|
|
|
|
| 95 |
- **Policy Optimization**: [Fine-Tuning Language Models from Human Preferences (PPO)](https://arxiv.org/abs/1909.08593)
|
| 96 |
- **Ranking Dataset**: [berkeley-nest/Nectar](https://huggingface.co/datasets/berkeley-nest/Nectar)
|
| 97 |
- **Fine-tuning dataset**: Custom Medical Instruct dataset (We plan to release a sample training dataset in our upcoming paper; please stay updated)
|
|
|
|
| 105 |
- **Language(s) (NLP):** en
|
| 106 |
- **Developed By**: [Ankit Pal (Aaditya Ura)](https://aadityaura.github.io/) from Saama AI Labs
|
| 107 |
- **License:** Meta-Llama License
|
| 108 |
+
- **Fine-tuned from models:** [meta-llama/Meta-Llama-3-8B](meta-llama/Meta-Llama-3-8B)
|
| 109 |
- **Resources for more information:**
|
| 110 |
- Paper: Coming soon
|
| 111 |
|