Image-Text-to-Text
Transformers
Safetensors
llava_phi
text-generation
multimodal
vision-language-model
llava
instruction-tuned
phi-4
vqa
conversational
custom_code
Instructions to use aimagelab/LLaVA_MORE-phi_4-finetuning with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use aimagelab/LLaVA_MORE-phi_4-finetuning with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-text-to-text", model="aimagelab/LLaVA_MORE-phi_4-finetuning", trust_remote_code=True) messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] pipe(text=messages)# Load model directly from transformers import AutoModelForCausalLM model = AutoModelForCausalLM.from_pretrained("aimagelab/LLaVA_MORE-phi_4-finetuning", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use aimagelab/LLaVA_MORE-phi_4-finetuning with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "aimagelab/LLaVA_MORE-phi_4-finetuning" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "aimagelab/LLaVA_MORE-phi_4-finetuning", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker
docker model run hf.co/aimagelab/LLaVA_MORE-phi_4-finetuning
- SGLang
How to use aimagelab/LLaVA_MORE-phi_4-finetuning with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "aimagelab/LLaVA_MORE-phi_4-finetuning" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "aimagelab/LLaVA_MORE-phi_4-finetuning", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "aimagelab/LLaVA_MORE-phi_4-finetuning" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "aimagelab/LLaVA_MORE-phi_4-finetuning", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }' - Docker Model Runner
How to use aimagelab/LLaVA_MORE-phi_4-finetuning with Docker Model Runner:
docker model run hf.co/aimagelab/LLaVA_MORE-phi_4-finetuning
Improve model card for LLaVA_MORE-phi_4-finetuning
#1
by nielsr HF Staff - opened
This PR significantly improves the model card for aimagelab/LLaVA_MORE-phi_4-finetuning by:
- Adding detailed metadata including the
license,pipeline_tag,base_model, and relevanttagsfor better discoverability and functionality on the Hub. - Populating the model card content with a comprehensive description derived from the paper abstract and GitHub README.
- Providing direct links to the paper, the GitHub repository, the project website, the Hugging Face model collection, and a demo space.
- Including detailed performance benchmarks, training data and procedure, acknowledgments, and academic citation.
- Adding a robust "How to Get Started" section with a runnable Python code example that demonstrates model inference including necessary image preprocessing steps.
This update will make the model much more accessible, informative, and usable for the Hugging Face community.
fede97 changed pull request status to merged