Fancy-MLLM
/

R1-Onevision-7B

Image-Text-to-Text

Model card Files Files and versions

Fancy-MLLM commited on Feb 25

Commit

cb24a13

·

verified ·

1 Parent(s): 7c5303f

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -27,6 +27,7 @@ cutoff_len: 8192
 per_device_train_batch_size: 1
 gradient_accumulation_steps: 16
 learning_rate: 1.0e-5
 num_train_epochs: 1.0
 lr_scheduler_type: cosine
 warmup_ratio: 0.05
@@ -107,7 +108,7 @@ print(output_text)
     We are working on the release of a smaller, more efficient 3B model, which is designed to provide a balance between performance and resource efficiency. This model aims to deliver strong multimodal reasoning capabilities while being more accessible and optimized for environments with limited computational resources, offering a more compact alternative to the current 7B model.
 ## R1-Onevision Authors
-- Yi Yang*, Xiaoxuan He*, Hongkun Pan*, Xiyan Jiang, Yan Deng, Xingtao Yang, Haoyu Lu, Minfeng Zhu†, Bo Zhang†, Wei Chen†
 - *Equal contribution. †Corresponding authors.
 ## Model Contact

 per_device_train_batch_size: 1
 gradient_accumulation_steps: 16
 learning_rate: 1.0e-5
 num_train_epochs: 1.0
 lr_scheduler_type: cosine
 warmup_ratio: 0.05
     We are working on the release of a smaller, more efficient 3B model, which is designed to provide a balance between performance and resource efficiency. This model aims to deliver strong multimodal reasoning capabilities while being more accessible and optimized for environments with limited computational resources, offering a more compact alternative to the current 7B model.
 ## R1-Onevision Authors
+- Yi Yang*, Xiaoxuan He*, Hongkun Pan*, Xiyan Jiang, Yan Deng, Xingtao Yang, Haoyu Lu, Minfeng Zhu†, Bo Zhang†
 - *Equal contribution. †Corresponding authors.
 ## Model Contact