Fancy-MLLM
/

R1-Onevision-7B

Image-Text-to-Text

Model card Files Files and versions

Fancy-MLLM commited on Feb 20, 2025

Commit

fb867be

·

verified ·

1 Parent(s): de3f7c6

Update README.md

Files changed (1) hide show

README.md +5 -0

README.md CHANGED Viewed

@@ -7,6 +7,11 @@ base_model:
 pipeline_tag: image-text-to-text
 ---
 ## Model Overview
 This is a multimodal large language model fine-tuned from Qwen2.5-VL on the **R1-Onevision** dataset. The model enhances vision-language understanding and reasoning capabilities, making it suitable for various tasks such as visual reasoning, image understanding. With its robust ability to perform multimodal reasoning, R1-Onevision emerges as a powerful AI assistant capable of addressing a wide range of problem-solving challenges across different domains.

 pipeline_tag: image-text-to-text
 ---
+## R1-Onevision
+[\[📂 GitHub\]](https://github.com/Fancy-MLLM/R1-Onevision)
+[\[🤗 HF Dataset\]](https://huggingface.co/datasets/Fancy-MLLM/R1-onevision)  [\[🤗 Reasoning Benchmark\]](https://huggingface.co/datasets/Fancy-MLLM/R1-OneVision-Bench) [\[🤗 HF Demo\]](https://huggingface.co/spaces/Fancy-MLLM/R1-OneVision)
 ## Model Overview
 This is a multimodal large language model fine-tuned from Qwen2.5-VL on the **R1-Onevision** dataset. The model enhances vision-language understanding and reasoning capabilities, making it suitable for various tasks such as visual reasoning, image understanding. With its robust ability to perform multimodal reasoning, R1-Onevision emerges as a powerful AI assistant capable of addressing a wide range of problem-solving challenges across different domains.