Update README.md
Browse files
README.md
CHANGED
|
@@ -10,6 +10,10 @@ language:
|
|
| 10 |
|
| 11 |
[GitHub](https://github.com/topic-overwrite/topic-level-overwrite/tree/main) | [Paper](https://arxiv.org/abs/2411.17265)
|
| 12 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 13 |
## Model Description
|
| 14 |
|
| 15 |
- **Trained from model:** [llava-v1.5-7B](https://huggingface.co/liuhaotian/llava-v1.5-7b)
|
|
|
|
| 10 |
|
| 11 |
[GitHub](https://github.com/topic-overwrite/topic-level-overwrite/tree/main) | [Paper](https://arxiv.org/abs/2411.17265)
|
| 12 |
|
| 13 |
+
## Model Details
|
| 14 |
+
|
| 15 |
+
The model, trained using the RLHF/RLAIF methods proposed in the [TPO paper](https://arxiv.org/abs/2411.17265) by llava, has enhanced trustworthiness and reduced hallucinations.
|
| 16 |
+
|
| 17 |
## Model Description
|
| 18 |
|
| 19 |
- **Trained from model:** [llava-v1.5-7B](https://huggingface.co/liuhaotian/llava-v1.5-7b)
|