DataCanvas
/

MMAlaya

text-generation

Model card Files Files and versions

bingwork commited on Feb 1, 2024

Commit

c2f157e

·

verified ·

1 Parent(s): 28d2bcd

Update README.md

Files changed (1) hide show

README.md +7 -4

README.md CHANGED Viewed

@@ -3,15 +3,18 @@ license: apache-2.0
 pipeline_tag: image-to-text
 ---
 # MMAlaya
-MMAlaya是基于大语言模型[Alaya](https://github.com/DataCanvasIO/Alaya)的多模态模型。
 MMAlaya包含以下三个模块：
 <br>1，大语言模型[Alaya-7B-Chat](https://huggingface.co/DataCanvas/Alaya-7B-Chat)。
 <br>2，图像文本特征编码器来自[blip2-opt-2.7b](https://huggingface.co/Salesforce/blip2-opt-2.7b)的EVA-G。
-<br>3，图像文本特征到大预言模型的适配器,使用的是来自[blip2-opt-2.7b](https://huggingface.co/Salesforce/blip2-opt-2.7b)的Qformer和线性投影器。
 模型的训练主要基于[LLaVA](https://github.com/haotian-liu/LLaVA)架构
-2024.01.23 最终在[MMBench](https://mmbench.opencompass.org.cn)线上测试中文测试集分数为56.9，英文测试集分数为59.8。
-推理可以参考 [inference.py](https://github.com/DataCanvasIO/MMAlaya/blob/main/inference.py)

 pipeline_tag: image-to-text
 ---
 # MMAlaya
+MMAlaya是基于大语言模型[Alaya](https://github.com/DataCanvasIO/Alaya)的多模态模型，模型权重文件在[DataCanvas/MMAlaya](https://huggingface.co/DataCanvas/MMAlaya/tree/main)
 MMAlaya包含以下三个模块：
 <br>1，大语言模型[Alaya-7B-Chat](https://huggingface.co/DataCanvas/Alaya-7B-Chat)。
 <br>2，图像文本特征编码器来自[blip2-opt-2.7b](https://huggingface.co/Salesforce/blip2-opt-2.7b)的EVA-G。
+<br>3，图像文本特征到大预言模型的连接器,使用的是来自[blip2-opt-2.7b](https://huggingface.co/Salesforce/blip2-opt-2.7b)的Qformer和线性投影器。
 模型的训练主要基于[LLaVA](https://github.com/haotian-liu/LLaVA)架构
+OpenCompass 评测榜单，均分41.1，排名25名。
+![opencompass-leaderboard-multimodal](https://github.com/DataCanvasIO/MMAlaya/blob/main/data/opencompass-leaderboard-multimodal.png)
+MMBench 评测榜单，开源开放的模型，中文测试集，均分58.6，排名25名。
+![opencompass-leaderboard-multimodal-cn](https://github.com/DataCanvasIO/MMAlaya/blob/main/data/opencompass-leaderboard-multimodal-cn.png)
+推理可以参考 [inference.py](https://github.com/DataCanvasIO/MMAlaya/blob/main/inference.py)