Spaces:
Running
on
Zero
Running
on
Zero
| title: FoodExtract-Vision Fine-tuned VLM Structued Data Extractor | |
| emoji: πβ‘οΈπ | |
| colorFrom: green | |
| colorTo: blue | |
| sdk: gradio | |
| app_file: app.py | |
| pinned: false | |
| license: apache-2.0 | |
| Fine-tuned SmolVLM2-500M to extract food and drink items from images. | |
| Input can be any kind of image and output will be a formatted string such as the following: | |
| ```json | |
| {'is_food': 0, 'image_title': '', 'food_items': [], 'drink_items': []} | |
| ``` | |
| Or for an image of food: | |
| ```json | |
| {'is_food': 1, 'image_title': 'fried calamari', 'food_items': ['fried calamari'], 'drink_items': []} | |
| ``` | |
| Note: This README.md was authored in a live tutorial recorded for YouTube (link coming soon). | |