File size: 727 Bytes
1910736
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
---
language:
- en

license: apache-2.0

library_name: llama.cpp

tags:
- gguf
- llama.cpp
- multimodal
- vision-language-model
- smolvlm
- cytology
- medical-imaging

pipeline_tag: image-text-to-text

base_model:
- HuggingFaceTB/SmolVLM-500M-Instruct

---

# SmolVLM Cytology GGUF

Fine-tuned SmolVLM multimodal model for cytology image analysis.

## Files

- SmolVLM-Cytology-Q4_K_M.gguf
- mmproj-SmolVLM-Cytology-f16.gguf

## Usage

```bash
llama-mtmd-cli \
  -m SmolVLM-Cytology-Q4_K_M.gguf \
  --mmproj mmproj-SmolVLM-Cytology-f16.gguf \
  --image test.png \
  -p "<image> Describe this image"
```

## Notes

- Quantized using llama.cpp
- Compatible with llama-mtmd-cli
- Vision encoder exported separately as mmproj GGUF