khushalcodiste commited on
Commit
da2a069
·
1 Parent(s): 210def2

fix: added

Browse files
Files changed (4) hide show
  1. README.md +1 -1
  2. docker-compose.yml +1 -2
  3. requirements.txt +1 -1
  4. src/model.py +1 -0
README.md CHANGED
@@ -10,4 +10,4 @@ pinned: false
10
 
11
  Image captioning API using `microsoft/Florence-2-base` with a Python FastAPI backend. Open `/docs` for Swagger UI.
12
 
13
- Speed tuning env vars: `DEFAULT_MAX_TOKENS` (default `64`), `MAX_IMAGE_SIDE` (default `896`), `MAX_MAX_TOKENS` (default `256`), `MODEL_ID` (default `microsoft/Florence-2-base`), `MODEL_REVISION` (optional commit SHA to pin remote model code).
 
10
 
11
  Image captioning API using `microsoft/Florence-2-base` with a Python FastAPI backend. Open `/docs` for Swagger UI.
12
 
13
+ Speed tuning env vars: `DEFAULT_MAX_TOKENS` (default `64`), `MAX_IMAGE_SIDE` (default `896`), `MAX_MAX_TOKENS` (default `256`), `MODEL_ID` (default `microsoft/Florence-2-base`), `MODEL_REVISION` (pin to a commit SHA, e.g. `5ca5edf5bd017b9919c05d08aebef5e4c7ac3bac`).
docker-compose.yml CHANGED
@@ -9,6 +9,5 @@ services:
9
  - MAX_IMAGE_SIDE=896
10
  - MAX_MAX_TOKENS=256
11
  - MODEL_ID=microsoft/Florence-2-base
12
- # Optional: pin to a specific commit SHA from huggingface.co/microsoft/Florence-2-base
13
- # - MODEL_REVISION=<commit_sha>
14
  restart: unless-stopped
 
9
  - MAX_IMAGE_SIDE=896
10
  - MAX_MAX_TOKENS=256
11
  - MODEL_ID=microsoft/Florence-2-base
12
+ - MODEL_REVISION=5ca5edf5bd017b9919c05d08aebef5e4c7ac3bac
 
13
  restart: unless-stopped
requirements.txt CHANGED
@@ -1,6 +1,6 @@
1
  fastapi==0.116.1
2
  uvicorn[standard]==0.35.0
3
- transformers==4.55.4
4
  torch==2.8.0
5
  pillow==11.3.0
6
  python-multipart==0.0.20
 
1
  fastapi==0.116.1
2
  uvicorn[standard]==0.35.0
3
+ transformers==4.49.0
4
  torch==2.8.0
5
  pillow==11.3.0
6
  python-multipart==0.0.20
src/model.py CHANGED
@@ -69,6 +69,7 @@ def load_model() -> tuple[Any, Any]:
69
  _model = AutoModelForCausalLM.from_pretrained(
70
  MODEL_ID,
71
  torch_dtype=_dtype,
 
72
  **pretrained_kwargs,
73
  ).to(_device)
74
  _model.eval()
 
69
  _model = AutoModelForCausalLM.from_pretrained(
70
  MODEL_ID,
71
  torch_dtype=_dtype,
72
+ attn_implementation="eager",
73
  **pretrained_kwargs,
74
  ).to(_device)
75
  _model.eval()