Sleeping Agents FoodExtract-Vision Fine-tuned VLM Structued Data Extractor π Extract food and drink items from any image as structured JSON
jhkim3217/FoodExtract-Vision-SmolVLM2-500M-fine-tune-v1-VIDEO Image-Text-to-Text β’ 0.5B β’ Updated 23 days ago β’ 19