FinalTry / README.md
arshjeevs's picture
Upload folder using huggingface_hub
1910736 verified
metadata
language:
  - en
license: apache-2.0
library_name: llama.cpp
tags:
  - gguf
  - llama.cpp
  - multimodal
  - vision-language-model
  - smolvlm
  - cytology
  - medical-imaging
pipeline_tag: image-text-to-text
base_model:
  - HuggingFaceTB/SmolVLM-500M-Instruct

SmolVLM Cytology GGUF

Fine-tuned SmolVLM multimodal model for cytology image analysis.

Files

  • SmolVLM-Cytology-Q4_K_M.gguf
  • mmproj-SmolVLM-Cytology-f16.gguf

Usage

llama-mtmd-cli \
  -m SmolVLM-Cytology-Q4_K_M.gguf \
  --mmproj mmproj-SmolVLM-Cytology-f16.gguf \
  --image test.png \
  -p "<image> Describe this image"

Notes

  • Quantized using llama.cpp
  • Compatible with llama-mtmd-cli
  • Vision encoder exported separately as mmproj GGUF