Skywork
/

Skywork-R1V3-38B-GGUF

Model card Files Files and versions

catpp commited on Jul 15, 2025

Commit

ffcd0ee

·

verified ·

1 Parent(s): 5efff8c

Upload README.md

Files changed (1) hide show

README.md +36 -0

README.md ADDED Viewed

	@@ -0,0 +1,36 @@

+---
+license: mit
+---
+# 🧠 Skywork-R1V3-38B - GGUF Quantized
+This repository provides a `GGUF` quantized version of the [Skywork-R1V3-38B](https://huggingface.co/Skywork/Skywork-R1V3-38B) model, converted using the latest `master` branch of [llama.cpp](https://github.com/ggerganov/llama.cpp). This version is optimized for **fast and memory-efficient local inference** on CPU or GPU.
+## 💻 How to Use
+You can run this model with [`llama.cpp`](https://github.com/ggerganov/llama.cpp):
+```bash
+./llama-server -m /path/to/Skywork-R1V3-38B-Q8_0.gguf --mmproj /path/to/mmproj-Skywork-R1V3-38B-f16.gguf --port 8080
+```
+You can now use OpenAI-compatible tools (like curl) to query the model:
+```bash
+BASE64_IMAGE=$(base64 -w 0 /path/to/image)
+curl -X POST http://localhost:8080/v1/chat/completions \
+  -H "Content-Type: application/json" \
+  -d '{
+    "model": "Skywork-R1V3",
+    "messages": [
+      {
+        "role": "user",
+        "content": [
+          {"type": "text", "text": "Please describe this image."},
+          {"type": "image_url", "image_url": {"url": "data:image/jpeg;base64,'"${BASE64_IMAGE}"'" }}
+        ]
+      }
+    ],
+    "temperature": 0.7,
+    "max_tokens": 512
+  }'
+```