Spaces:

marynab
/

med-gemma

Runtime error

App Files Files Community

med-gemma / README.md

marynab

add readme

d63d7af 4 months ago

preview code

raw

history blame contribute delete

3 kB

metadata

title: Med Gemma
emoji: 🏥
colorFrom: red
colorTo: red
sdk: docker
app_port: 8501
tags:
  - streamlit
  - medical
  - chatbot
pinned: false
short_description: Med-Gemma Medical Assistant Chat Interface

🏥 Med-Gemma Medical Assistant

A Streamlit chat interface for interacting with the Med-Gemma medical language model deployed on HuggingFace Inference Endpoints.

🚀 Deployment to HuggingFace Spaces

Step 1: Configure Secrets

Go to your Space settings on HuggingFace
Navigate to Settings → Variables and secrets
Add these two secrets:
- Name: HF_TOKEN | Value: Your HuggingFace API token
- Name: INFERENCE_ENDPOINT | Value: Your inference endpoint URL (e.g., https://xxx.endpoints.huggingface.cloud)

Step 2: Get Your Credentials

HuggingFace API Token:

Go to HuggingFace Settings - Tokens
Click "New token", give it a name, select "read" permissions
Copy the token

Inference Endpoint URL:

Go to HuggingFace Inference Endpoints
Find your Med-Gemma endpoint (must be "Running")
Copy the endpoint URL

Step 3: Deploy

Push your code to the HuggingFace Space repository
The Space will automatically build and deploy
Once ready, users can start chatting immediately - no configuration needed!

🛠️ Local Development

Setup

Create virtual environment:

python -m venv venv
venv\Scripts\activate

Install dependencies:
```
pip install -r requirements.txt
```

Configure credentials: Create a .env file in the project root:

HF_TOKEN=your_token_here
INFERENCE_ENDPOINT=your_endpoint_url_here

Run the app:
```
streamlit run src/streamlit_app.py
```
The app will open at http://localhost:8501

📝 Features

💬 Real-time chat interface with Med-Gemma
⚙️ Adjustable model parameters (temperature, top_p, max tokens)
📝 Chat history (persists during session)
🗑️ Clear chat history button
🔒 Secure credential management via environment variables
✅ OpenAI-compatible API format for vLLM endpoints
🎨 Clean, professional UI
🚀 Docker-ready for HuggingFace Spaces

⚙️ Model Parameters

Max Tokens: Maximum length of the generated response (50-2048)
Temperature: Controls randomness (0.0 = deterministic, 2.0 = very random)
Top P: Controls diversity via nucleus sampling (0.0-1.0)

🔒 Security

✅ .env file is in .gitignore - never committed
✅ Use HuggingFace Spaces secrets for production
✅ Local .env file for development only
⚠️ Never share your HuggingFace API tokens