Image-Text-to-Text
Transformers
Safetensors
English
idefics2
multimodal
vision
text-generation-inference
Instructions to use HuggingFaceM4/idefics2-8b-base with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use HuggingFaceM4/idefics2-8b-base with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-text-to-text", model="HuggingFaceM4/idefics2-8b-base")# Load model directly from transformers import AutoProcessor, AutoModelForImageTextToText processor = AutoProcessor.from_pretrained("HuggingFaceM4/idefics2-8b-base") model = AutoModelForImageTextToText.from_pretrained("HuggingFaceM4/idefics2-8b-base") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use HuggingFaceM4/idefics2-8b-base with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "HuggingFaceM4/idefics2-8b-base" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "HuggingFaceM4/idefics2-8b-base", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/HuggingFaceM4/idefics2-8b-base
- SGLang
How to use HuggingFaceM4/idefics2-8b-base with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "HuggingFaceM4/idefics2-8b-base" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "HuggingFaceM4/idefics2-8b-base", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "HuggingFaceM4/idefics2-8b-base" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "HuggingFaceM4/idefics2-8b-base", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use HuggingFaceM4/idefics2-8b-base with Docker Model Runner:
docker model run hf.co/HuggingFaceM4/idefics2-8b-base
Update README.md
Browse files
README.md
CHANGED
|
@@ -78,14 +78,14 @@ IDEFICS-2 exhibits strong performance for a model of its size (8B parameters) wh
|
|
| 78 |
| [LLaVa-NeXT-34B](https://huggingface.co/liuhaotian/llava-v1.6-34b) | β
| 34B | 2880 | 51.1/44.7 | 46.5 | 69.5 | 79.3 | 83.7 | - | - |
|
| 79 |
| MM1-Chat-7B | β | 7B | 720 | 37.0/35.6 | 35.9 | 72.8 | 72.3 | - | - |
|
| 80 |
| MM1-Chat-30B | β | 30B | 720 | 44.7/40.3 | 39.4 | 73.5 | 75.1 | 83.7 | |
|
| 81 |
-
| Gemini 1.0 Pro | β |
|
| 82 |
-
| Gemini 1.5 Pro | β |
|
| 83 |
-
| Claude 3 Haiku | β |
|
| 84 |
| | | | | | | |
|
| 85 |
-
| [
|
| 86 |
| | | | | | | |
|
| 87 |
-
| **
|
| 88 |
-
| **
|
| 89 |
|
| 90 |
</details>
|
| 91 |
|
|
|
|
| 78 |
| [LLaVa-NeXT-34B](https://huggingface.co/liuhaotian/llava-v1.6-34b) | β
| 34B | 2880 | 51.1/44.7 | 46.5 | 69.5 | 79.3 | 83.7 | - | - |
|
| 79 |
| MM1-Chat-7B | β | 7B | 720 | 37.0/35.6 | 35.9 | 72.8 | 72.3 | - | - |
|
| 80 |
| MM1-Chat-30B | β | 30B | 720 | 44.7/40.3 | 39.4 | 73.5 | 75.1 | 83.7 | |
|
| 81 |
+
| Gemini 1.0 Pro | β | π€·ββοΈ | π€·ββοΈ | 47.9/- | 45.2 | 74.6 | - | 71.2 | 88.1 |
|
| 82 |
+
| Gemini 1.5 Pro | β | π€·ββοΈ | π€·ββοΈ | 58.5/- | 52.1 | 73.5 | - | 73.2 | 86.5 |
|
| 83 |
+
| Claude 3 Haiku | β | π€·ββοΈ | π€·ββοΈ | 50.2/- | 46.4 | - | - | - | 88.8 |
|
| 84 |
| | | | | | | |
|
| 85 |
+
| [IDEFICS-1 instruct](https://huggingface.co/HuggingFaceM4/idefics-80b-instruct) (32-shots) | β
| 80B | - | - | - | 39.3 | - | 68.8 | - |
|
| 86 |
| | | | | | | |
|
| 87 |
+
| **IDEFICS-2** (w/o im. split) | β
| 8B | 64 | 43.5/37.9 | 51.6 | 70.4 | 76.8 | 80.8 | 67.3 |
|
| 88 |
+
| **IDEFICS-2** (w/ im. split) | β
| 8B | 320 | 43.0/37.7 | 51.4 | 73.0 | 76.7 | 81.2 | 74.0 |
|
| 89 |
|
| 90 |
</details>
|
| 91 |
|