Spaces:
Sleeping
Sleeping
Commit ·
da2a069
1
Parent(s): 210def2
fix: added
Browse files- README.md +1 -1
- docker-compose.yml +1 -2
- requirements.txt +1 -1
- src/model.py +1 -0
README.md
CHANGED
|
@@ -10,4 +10,4 @@ pinned: false
|
|
| 10 |
|
| 11 |
Image captioning API using `microsoft/Florence-2-base` with a Python FastAPI backend. Open `/docs` for Swagger UI.
|
| 12 |
|
| 13 |
-
Speed tuning env vars: `DEFAULT_MAX_TOKENS` (default `64`), `MAX_IMAGE_SIDE` (default `896`), `MAX_MAX_TOKENS` (default `256`), `MODEL_ID` (default `microsoft/Florence-2-base`), `MODEL_REVISION` (
|
|
|
|
| 10 |
|
| 11 |
Image captioning API using `microsoft/Florence-2-base` with a Python FastAPI backend. Open `/docs` for Swagger UI.
|
| 12 |
|
| 13 |
+
Speed tuning env vars: `DEFAULT_MAX_TOKENS` (default `64`), `MAX_IMAGE_SIDE` (default `896`), `MAX_MAX_TOKENS` (default `256`), `MODEL_ID` (default `microsoft/Florence-2-base`), `MODEL_REVISION` (pin to a commit SHA, e.g. `5ca5edf5bd017b9919c05d08aebef5e4c7ac3bac`).
|
docker-compose.yml
CHANGED
|
@@ -9,6 +9,5 @@ services:
|
|
| 9 |
- MAX_IMAGE_SIDE=896
|
| 10 |
- MAX_MAX_TOKENS=256
|
| 11 |
- MODEL_ID=microsoft/Florence-2-base
|
| 12 |
-
|
| 13 |
-
# - MODEL_REVISION=<commit_sha>
|
| 14 |
restart: unless-stopped
|
|
|
|
| 9 |
- MAX_IMAGE_SIDE=896
|
| 10 |
- MAX_MAX_TOKENS=256
|
| 11 |
- MODEL_ID=microsoft/Florence-2-base
|
| 12 |
+
- MODEL_REVISION=5ca5edf5bd017b9919c05d08aebef5e4c7ac3bac
|
|
|
|
| 13 |
restart: unless-stopped
|
requirements.txt
CHANGED
|
@@ -1,6 +1,6 @@
|
|
| 1 |
fastapi==0.116.1
|
| 2 |
uvicorn[standard]==0.35.0
|
| 3 |
-
transformers==4.
|
| 4 |
torch==2.8.0
|
| 5 |
pillow==11.3.0
|
| 6 |
python-multipart==0.0.20
|
|
|
|
| 1 |
fastapi==0.116.1
|
| 2 |
uvicorn[standard]==0.35.0
|
| 3 |
+
transformers==4.49.0
|
| 4 |
torch==2.8.0
|
| 5 |
pillow==11.3.0
|
| 6 |
python-multipart==0.0.20
|
src/model.py
CHANGED
|
@@ -69,6 +69,7 @@ def load_model() -> tuple[Any, Any]:
|
|
| 69 |
_model = AutoModelForCausalLM.from_pretrained(
|
| 70 |
MODEL_ID,
|
| 71 |
torch_dtype=_dtype,
|
|
|
|
| 72 |
**pretrained_kwargs,
|
| 73 |
).to(_device)
|
| 74 |
_model.eval()
|
|
|
|
| 69 |
_model = AutoModelForCausalLM.from_pretrained(
|
| 70 |
MODEL_ID,
|
| 71 |
torch_dtype=_dtype,
|
| 72 |
+
attn_implementation="eager",
|
| 73 |
**pretrained_kwargs,
|
| 74 |
).to(_device)
|
| 75 |
_model.eval()
|