Spaces:

thisisam
/

fara-7b-chat-test

Runtime error

App Files Files Community

fara-7b-chat-test / DEPLOYMENT_GUIDE.md

thisisam

Initial commit: Fara-7B chat interface

4fe284b 5 months ago

preview code

raw

history blame contribute delete

4.09 kB

A newer version of the Gradio SDK is available: 6.13.0

Upgrade

🚀 Deployment Guide for Fara-7B Space

Quick Start

Your Hugging Face Space is ready to deploy! Follow these steps:

Step 1: Create a New Space on Hugging Face

Go to huggingface.co/new-space
Fill in the details:
- Space name: fara-7b-chat (or any name you prefer)
- License: MIT
- Select SDK: Gradio
- Space hardware: CPU Basic (free) - this is fine since we're using Inference API
- Visibility: Public or Private (your choice)
Click Create Space

Step 2: Upload Your Files

You have two options:

Option A: Git Upload (Recommended)

# Navigate to your space folder
cd "c:/Users/Amir/OneDrive - Digital Health CRC Limited/Projects/url2md/fara-7b-space"

# Initialize git repository
git init
git add .
git commit -m "Initial commit: Fara-7B chat interface"

# Add your HuggingFace Space as remote
# Replace YOUR_USERNAME and YOUR_SPACE_NAME
git remote add origin https://huggingface.co/spaces/YOUR_USERNAME/YOUR_SPACE_NAME
git push -u origin main

Option B: Web Upload

In your newly created Space, click Files → Add file → Upload files
Drag and drop these files:
- app.py
- requirements.txt
- README.md
- .gitignore
Click Commit changes to main

Step 3: Add Your HuggingFace Token as a Secret

This is CRITICAL - the app won't work without this:

In your Space, go to Settings (gear icon)
Scroll to Variables and secrets
Click New secret
Enter:
- Name: HF_TOKEN
- Value: Your Hugging Face token
- ⚠️ Make sure this is marked as Secret (not a variable)
Click Save
The Space will automatically rebuild

Step 4: Wait for Build

The Space will install dependencies and start
This usually takes 1-3 minutes
Watch the Logs tab to see progress
Once you see "Running on local URL", it's ready!

Step 5: Test Your Space

Go to the App tab
Try a test message: "Help me find a good coffee shop"
You should see Fara-7B respond!

Troubleshooting

Error: "HF_TOKEN not found"

Make sure you added the token as a Secret, not a Variable
Restart the Space after adding the secret

Error: "Model not found"

Check if microsoft/Fara-7B is publicly available
Ensure your token has inference permissions

Error: "Rate limit exceeded"

You're using the free inference tier
Wait a few minutes and try again
Consider upgrading to Inference Endpoints for production use

Space is slow

The free CPU tier is sufficient since inference happens on HF servers
Response time depends on model inference, not your Space hardware

Optional Enhancements

1. Request GPU Hardware

If you want faster Space loading (not needed for inference):

Settings → Hardware → Select a GPU tier
Note: This costs money, but inference API is separate

2. Add Custom Examples

Edit app.py and add example buttons:

gr.Examples(
    examples=[
        "Find Italian restaurants in Seattle",
        "Help me search for running shoes",
        "What's the process to book a hotel?"
    ],
    inputs=msg
)

3. Enable Analytics

Settings → Enable visitor analytics
Track usage of your Space

Cost Breakdown

Space hosting: FREE (CPU Basic tier)
Inference API:
- Free tier: Limited requests/day
- Pro account: More requests
- Inference Endpoints: ~$0.60-1.00/hour for dedicated

Next Steps

Once deployed, you can:

Share your Space URL with others
Embed it in websites
Use the API endpoint programmatically
Duplicate and customize for other models

Need Help?

Check the logs in your Space for detailed error messages, or refer to:

Your Space folder: c:/Users/Amir/OneDrive - Digital Health CRC Limited/Projects/url2md/fara-7b-space

Happy deploying! 🚀