Sayeem26s
/

Chartqwen

vision-language

chart-understanding

chart-question-answering

document-understanding

Model card Files Files and versions

Chartqwen / README.md

Sayeem26s's picture

Upload fine-tuned Qwen2.5-VL error bar detector

7ade315 verified 20 days ago

|

history blame contribute delete

1.35 kB

	---
	license: apache-2.0
	language:
	- en
	pipeline_tag: image-to-text
	library_name: transformers
	base_model: Qwen/Qwen-VL
	tags:
	- vision-language
	- chart-understanding
	- chart-question-answering
	- document-understanding
	- multimodal
	datasets:
	- custom
	metrics:
	- accuracy
	model-index:
	- name: ChartQwen
	results: []
	---

	# ChartQwen

	## Model Description

	ChartQwen is a vision-language model fine-tuned from Qwen/Qwen-VL for chart understanding tasks.
	The model is designed to interpret visual charts such as bar charts, line graphs, and plots, and answer natural language questions related to them.

	It supports multimodal reasoning by jointly processing images and text prompts.

	---

	## Intended Use

	This model can be used for:

	- Chart question answering
	- Chart data interpretation
	- Visual reasoning over plots and graphs
	- Document and report analysis involving charts

	---

	## Training Details

	- Base model: Qwen/Qwen-VL
	- Modality: Image + Text
	- Fine-tuning type: Supervised fine-tuning on chart-related visual-question pairs
	- Dataset: Custom chart dataset (generated and curated for chart understanding)

	## Limitations

	- Performance may degrade on low-resolution or highly cluttered charts
	- The model may struggle with handwritten charts or uncommon chart styles
	- Numerical precision depends on chart clarity

	---