Spaces:

Perth0603
/

phishwatch-proxy

Sleeping

Perth0603 commited on Sep 25, 2025

Commit

9265bd4

verified ·

1 Parent(s): 0b9dff4

giid

Files changed (3) hide show

README.md CHANGED Viewed

@@ -1,12 +1,28 @@
----
-title: Phishwatch Proxy
-emoji: 📊
-colorFrom: green
-colorTo: blue
-sdk: gradio
-sdk_version: 5.47.0
-app_file: app.py
-pinned: false
----
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

+# Hugging Face Space - Phishing Text Classifier (FastAPI)
+This Space exposes a minimal `/predict` endpoint for your MobileBERT phishing model so the Flutter app can call it reliably.
+## Files
+- app.py - FastAPI app that loads the model and returns `{ label, score }`.
+- requirements.txt - Python dependencies.
+## How to deploy
+1. Create a new Space on Hugging Face (type: FastAPI).
+2. Upload the contents of this `hf_space/` folder to the Space root.
+3. In Space Settings → Variables, add:
+   - MODEL_ID = Perth0603/phishing-email-mobilebert
+4. Wait for the Space to build and become green. Test:
+   - GET `/` should return `{ status: ok, model: ... }`
+   - POST `/predict` with `{ "inputs": "Win an iPhone! Click here" }`
+## Flutter app config
+Set the Space URL in your env file so the app targets the Space instead of the Hosted Inference API:
+```
+{"HF_SPACE_URL":"https://<your-space>.hf.space"}
+```
+Run the app:
+```
+flutter run --dart-define-from-file=hf.env.json
+```

app.py ADDED Viewed

+from fastapi import FastAPI
+from pydantic import BaseModel
+from transformers import AutoTokenizer, AutoModelForSequenceClassification
+import torch
+import os
+MODEL_ID = os.environ.get("MODEL_ID", "Perth0603/phishing-email-mobilebert")
+app = FastAPI(title="Phishing Text Classifier", version="1.0.0")
+class PredictPayload(BaseModel):
+    inputs: str
+# Lazy singletons for model/tokenizer
+_tokenizer = None
+_model = None
+def _load_model():
+    global _tokenizer, _model
+    if _tokenizer is None or _model is None:
+        _tokenizer = AutoTokenizer.from_pretrained(MODEL_ID)
+        _model = AutoModelForSequenceClassification.from_pretrained(MODEL_ID)
+        # Warm-up
+        with torch.no_grad():
+            _ = _model(**_tokenizer(["warm up"], return_tensors="pt")).logits
+@app.get("/")
+def root():
+    return {"status": "ok", "model": MODEL_ID}
+@app.post("/predict")
+def predict(payload: PredictPayload):
+    _load_model()
+    with torch.no_grad():
+        logits = _model(**_tokenizer([payload.inputs], return_tensors="pt")).logits
+        probs = torch.softmax(logits, dim=-1)[0]
+        score, idx = torch.max(probs, dim=0)
+    # Map common ids to labels (kept generic; your config also has these)
+    id2label = {0: "LEGIT", 1: "PHISH"}
+    label = id2label.get(int(idx), str(int(idx)))
+    return {"label": label, "score": float(score)}

requirements.txt ADDED Viewed

+fastapi==0.115.0
+uvicorn==0.30.6
+transformers==4.46.3
+torch>=2.0.0