INVERTO
/

bird-captioning-cub200

@@ -1,4 +1,16 @@
 # Bird Captioning and Classification Model (CUB-200-2011)
 This is a fine-tuned VisionEncoderDecoderModel based on `nlpconnect/vit-gpt2-image-captioning`, trained on the CUB-200-2011 dataset for bird species classification and image captioning.
@@ -13,7 +25,7 @@ This is a fine-tuned VisionEncoderDecoderModel based on `nlpconnect/vit-gpt2-ima
 - **Best Validation Loss**: 0.0690 (Epoch 3)
 ## Files
-- `pytorch_model.bin`: Trained model weights
 - `config.json`: Model configuration
 - `preprocessor_config.json`: ViTImageProcessor settings
 - `tokenizer_config.json`, `vocab.json`: GPT2 tokenizer files

+---
+language: en
+license: mit
+tags:
+  - vision
+  - image-captioning
+  - image-classification
+  - bird-species
+datasets:
+  - cub-200-2011
+---
 # Bird Captioning and Classification Model (CUB-200-2011)
 This is a fine-tuned VisionEncoderDecoderModel based on `nlpconnect/vit-gpt2-image-captioning`, trained on the CUB-200-2011 dataset for bird species classification and image captioning.
 - **Best Validation Loss**: 0.0690 (Epoch 3)
 ## Files
+- `model.safetensors`: Trained model weights
 - `config.json`: Model configuration
 - `preprocessor_config.json`: ViTImageProcessor settings
 - `tokenizer_config.json`, `vocab.json`: GPT2 tokenizer files