---
license: apache-2.0
language:
- en
pipeline_tag: image-to-text
library_name: transformers
base_model: Qwen/Qwen-VL
tags:
- vision-language
- chart-understanding
- chart-question-answering
- document-understanding
- multimodal
datasets:
- custom
metrics:
- accuracy
model-index:
- name: ChartQwen
  results: []
---

# ChartQwen

## Model Description

ChartQwen is a vision-language model fine-tuned from **Qwen/Qwen-VL** for chart understanding tasks.  
The model is designed to interpret visual charts such as bar charts, line graphs, and plots, and answer natural language questions related to them.

It supports multimodal reasoning by jointly processing images and text prompts.

---

## Intended Use

This model can be used for:

- Chart question answering
- Chart data interpretation
- Visual reasoning over plots and graphs
- Document and report analysis involving charts

---

## Training Details

- **Base model:** Qwen/Qwen-VL
- **Modality:** Image + Text
- **Fine-tuning type:** Supervised fine-tuning on chart-related visual-question pairs
- **Dataset:** Custom chart dataset (generated and curated for chart understanding)

## Limitations

- Performance may degrade on low-resolution or highly cluttered charts
- The model may struggle with handwritten charts or uncommon chart styles
- Numerical precision depends on chart clarity

---