jjmcarrascosa
/

vit_receipts_classifier

Image Classification

Generated from Trainer

Model card Files Files and versions

Metrics Training metrics Community

jjmcarrascosa commited on Sep 23, 2022

Commit

c1f238c

·

1 Parent(s): d26b16c

Update README.md

Files changed (1) hide show

README.md +6 -4

README.md CHANGED Viewed

@@ -6,13 +6,15 @@ tags:
 metrics:
 - f1
 model-index:
-- name: vit_tickers_binaryclf
   results: []
 ---
-# vit_tickers_binaryclf
-This model is a fine-tuned version of [google/vit-base-patch16-224-in21k](https://huggingface.co/google/vit-base-patch16-224-in21k) on the cord, rvl-cdip, visual-genome and an external receipt tickers dataset, to carry out Binary Classification (`ticket` vs `no_ticket`).
 It achieves the following results on the evaluation set, which contain pictures from the above datasets in scanned, photography or mobile picture formats (color and grayscale):
 - Loss: 0.0116
@@ -20,7 +22,7 @@ It achieves the following results on the evaluation set, which contain pictures
 ## Model description
-This model is a Binary Classifier finetuned version of ViT, to predict if an input image is a picture / scan of ticket(s) o something else.
 ## Intended uses & limitations

 metrics:
 - f1
 model-index:
+- name: vit_receipts_classifier
   results: []
 ---
+# vit_receipts_classifier
+This model is a fine-tuned version of [google/vit-base-patch16-224-in21k](https://huggingface.co/google/vit-base-patch16-224-in21k) on the cord, rvl-cdip, visual-genome and an external receipt dataset to carry out Binary Classification (`ticket` vs `no_ticket`).
+Ticket here is used as a synonym to "receipt".
 It achieves the following results on the evaluation set, which contain pictures from the above datasets in scanned, photography or mobile picture formats (color and grayscale):
 - Loss: 0.0116
 ## Model description
+This model is a Binary Classifier finetuned version of ViT, to predict if an input image is a picture / scan of receipts(s) o something else.
 ## Intended uses & limitations