Spaces:

kkkai123456
/

HW_3

Running

App Files Files Community

kkkai123456 commited on Nov 7

Commit

1ca85db

verified ·

1 Parent(s): 434a1b5

Update README.md

Browse files

Files changed (1) hide show

README.md +4 -114

README.md CHANGED Viewed

@@ -50,13 +50,13 @@ Interactive conversations about image content with context retention.
 ## 📸 Demo Screenshots
 ### Image Captioning
-![Image Captioning](source/image%20(1).png)
 ### Visual Question Answering
-![Visual Question Answering](source/image%20(1).png)
 ### Zero-Shot Classification
-![Zero-Shot Classification](source/image%20(1).png)
 ### Multimodal Chat
 ![Multimodal Chat](source/image%20(1).png)
@@ -76,7 +76,6 @@ Access at `http://localhost:7860`
 ### Deploy to Hugging Face Spaces
-#### Method 1: Web Interface
 1. Go to https://huggingface.co/spaces
 2. Click **"Create new Space"**
 3. Fill in:
@@ -91,28 +90,7 @@ Access at `http://localhost:7860`
    - `source/` folder (with screenshots)
 5. Space will auto-deploy in 5-10 minutes
-#### Method 2: Git
-```bash
-# Clone your space repository
-git clone https://huggingface.co/spaces/YOUR_USERNAME/YOUR_SPACE_NAME
-cd YOUR_SPACE_NAME
-# Copy your files
-cp app.py requirements.txt README.md ./
-cp -r source ./
-# Push to Hugging Face
-git add .
-git commit -m "Initial commit"
-git push
-```
-#### Enable GPU (Optional)
-1. Go to **Settings** → **Hardware**
-2. Select **GPU** option
-3. Restart the Space
-GPU provides 10-50x faster processing and better user experience.
 ## 🛠️ Models Used
@@ -222,80 +200,6 @@ building: 0.00%
 - Build on previous responses
 - Keep questions related to the image
----
-## ⚙️ Advanced Configuration
-### Change Models
-Edit `app.py` to use different models:
-```python
-# Use larger BLIP model for better quality
-caption_model = BlipForConditionalGeneration.from_pretrained(
-    "Salesforce/blip-image-captioning-large"  # 990MB, better quality
-)
-# Use larger CLIP model
-clip_model = CLIPModel.from_pretrained(
-    "openai/clip-vit-large-patch14"  # 1.7GB, more accurate
-)
-```
-### Customize Interface Style
-Modify `custom_css` in `app.py`:
-```python
-custom_css = """
-#title {
-    background: linear-gradient(90deg, #FF6B6B 0%, #4ECDC4 100%);
-    font-size: 3.5em;
-}
-"""
-```
-### Adjust Generation Parameters
-Control model behavior:
-```python
-# Generate longer captions
-out = caption_model.generate(**inputs, max_length=100)
-# More accurate but slower VQA
-out = vqa_model.generate(**inputs, max_length=50, num_beams=5)
-```
-## 🐛 Troubleshooting
-### Common Issues
-**Models downloading slowly**
-```bash
-# Set cache directory to a location with more space
-export HF_HOME=/path/to/large/storage
-python app.py
-```
-**Out of memory error**
-```python
-# Add at the start of app.py
-import torch
-torch.cuda.empty_cache()
-# Or force CPU usage
-device = "cpu"
-```
-**Port already in use**
-```bash
-# Use different port
-python app.py --server-port 8080
-```
-**Space build failing**
-- Check `requirements.txt` for correct package versions
-- Verify all files are uploaded correctly
-- Check build logs in Space settings
 ### Getting Help
 - 📖 [Gradio Documentation](https://gradio.app/docs/)
 - 🤗 [Hugging Face Forums](https://discuss.huggingface.co/)
@@ -324,7 +228,6 @@ MIT License - See [LICENSE](LICENSE) file for details.
 - **BLIP**: BSD-3-Clause License
 - **CLIP**: MIT License
-All models are free for commercial use.
 ## 🙏 Acknowledgments
@@ -334,18 +237,5 @@ Built with amazing open-source projects:
 - [Hugging Face Transformers](https://huggingface.co/docs/transformers) - Model hub and inference
 - [Gradio](https://gradio.app/) - Beautiful web interfaces
-## 🔗 Links
-- **Live Demo**: [Your Space URL]
-- **GitHub Repository**: [Your Repo URL]
-- **Report Issues**: [GitHub Issues]
----
-<div align="center">
-**⭐ If you find this project helpful, please star it! ⭐**
-Made with ❤️ by the open-source community
-</div>

 ## 📸 Demo Screenshots
 ### Image Captioning
+![Image Captioning](source/image%20(4).png)
 ### Visual Question Answering
+![Visual Question Answering](source/image%20(3).png)
 ### Zero-Shot Classification
+![Zero-Shot Classification](source/image%20(2).png)
 ### Multimodal Chat
 ![Multimodal Chat](source/image%20(1).png)
 ### Deploy to Hugging Face Spaces
 1. Go to https://huggingface.co/spaces
 2. Click **"Create new Space"**
 3. Fill in:
    - `source/` folder (with screenshots)
 5. Space will auto-deploy in 5-10 minutes
 ## 🛠️ Models Used
 - Build on previous responses
 - Keep questions related to the image
 ### Getting Help
 - 📖 [Gradio Documentation](https://gradio.app/docs/)
 - 🤗 [Hugging Face Forums](https://discuss.huggingface.co/)
 - **BLIP**: BSD-3-Clause License
 - **CLIP**: MIT License
 ## 🙏 Acknowledgments
 - [Hugging Face Transformers](https://huggingface.co/docs/transformers) - Model hub and inference
 - [Gradio](https://gradio.app/) - Beautiful web interfaces
+---