yanyoyo
/

InternVL

vision-language

Model card Files Files and versions

yanyoyo commited on Jan 8, 2025

Commit

06a473e

·

1 Parent(s): 8515c91

update

Files changed (1) hide show

README.md +49 -1

README.md CHANGED Viewed

@@ -1,3 +1,51 @@
 ---
-license: mit
 ---

 ---
+language:
+- zh
+- en
+tags:
+- internvl
+- multimodal
+- vision-language
+- food
+- finetuned
+license: apache-2.0
+datasets:
+- food-recognition
+model-index:
+- name: InternVL2-2B-Food-Finetuned
+  results:
+  - task:
+      type: vision-language-understanding
+      name: food-recognition
 ---
+# InternVL2-2B Food Recognition Finetuned Model
+## Model Description
+这是一个基于 InternVL2-2B 模型使用 LoRA 方法在食物识别数据集上微调的多模态模型。该模型专门优化了对食物图像的理解和描述能力。
+### Key Features
+- **基础模型**: InternVL2-2B
+- **微调方法**: LoRA (Low-Rank Adaptation)
+- **训练迭代**: 640 iterations
+- **特定领域**: 食物识别与描述
+- **多模态能力**: 图像理解和文本生成
+## Training Details
+### Base Model
+- **架构**: InternVL2
+- **参数量**: 2B
+- **类型**: 视觉-语言多模态模型
+### Fine-tuning
+- **方法**: LoRA
+- **配置文件**: internvl_v2_internlm2_2b_lora_finetune_food.py
+- **训练步数**: 640
+- **学习率**: 3.5e-5
+- **训练轮数**: 10 epochs
+## Usage