Add comprehensive model card for Osprey

by nielsr HF Staff - opened Sep 9, 2025

←

This PR adds a comprehensive model card for the Osprey model, significantly improving its documentation on the Hugging Face Hub.

Key improvements include:

Linking the model to its official paper: Osprey: Pixel Understanding with Visual Instruction Tuning.
Including the paper's abstract for quick understanding.
Adding pipeline_tag: image-text-to-text to enable discoverability on the Hub.
Specifying library_name: transformers based on the LlavaLlamaForCausalLM architecture found in config.json, integrating it with the Hugging Face transformers library ecosystem.
Including a link to the official GitHub repository for code access and further details.
Incorporating a detailed introduction, core features, and the complete "Try Our Demo" section (online and offline demo setup) directly from the original GitHub repository to provide robust usage instructions.
All relevant sections from the GitHub README have been adapted to the model card for a holistic view.

Please review and merge this PR to enhance the model's documentation on the Hugging Face Hub.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment