deplot_kr / README.md
brainventures's picture
Update README.md
735bb9d verified
|
raw
history blame
947 Bytes
metadata
language:
  - ko
pipeline_tag: image-to-text

deplot_kr

deplot_kr is a Image-to-Data(Text) model based on the google's pix2struct architecture. It was fine-tuned from DePlot, using korean chart image-text pairs.

deplot_kr은 google의 pix2struct 구조를 기반으로 한 한국어 image-to-data(텍스트 형태의 데이터 테이블) 모델입니다. DePlot 모델을 한국어 차트 이미지-텍스트 쌍 데이터세트(30만 개)를 이용하여 fine-tuning 했습니다.

How to use

Training data

Preprocessing

Train

The model was trained in a TPU environment.

  • num_warmup_steps : 1,000
  • num_training_steps : 40,000

Evaluation Results

This model achieves the following results:

metrics name %
RNSS (Relative Number Set Similarity) 99.5483
RMS F1 (Relative Mapping Similarity) 16.6401