Image-Text-to-Text
Transformers.js
ONNX
florence2
vision
text-generation
text2text-generation
image-to-text
Instructions to use onnx-community/Florence-2-base with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers.js
How to use onnx-community/Florence-2-base with Transformers.js:
// npm i @huggingface/transformers import { pipeline } from '@huggingface/transformers'; // Allocate pipeline const pipe = await pipeline('image-text-to-text', 'onnx-community/Florence-2-base');
optimization for mobile
#7
by vlad-m-dev - opened
Hello. My task is to integrate the most optimal mobile model (in terms of speed and quality) that can generate descriptive text (caption) from a photo.
I have implemented the solution from this repository, but I was not satisfied with the speed.
You have a lot of experience in this field. Could you please tell me what you would do in my place to solve this problem? Which model and format would you choose, and how would you integrate it on mobile device?
I would greatly appreciate any recommendations for solving my task!