Spaces:

skshimada
/

Hello

Sleeping

App Files Files Community

skshimada commited on 10 days ago

Commit

5919f51

verified ·

1 Parent(s): b40dd10

Update README.md

Browse files

Files changed (1) hide show

README.md +14 -3

README.md CHANGED Viewed

@@ -1,4 +1,15 @@
-# 🍸 LocalAGI: The AI Sommelier
 ## 📖 Overview
 LocalAGI is a multimodal Retrieval-Augmented Generation (RAG) application that acts as an intelligent, interactive bartender. By combining state-of-the-art computer vision with vector search, the application allows users to upload a photo of any liquor bottle and instantly receive curated cocktail recipes utilizing that specific spirit from a custom-ingested library.
@@ -10,10 +21,10 @@ Engineered to run entirely on CPU-bound cloud environments (like Hugging Face Sp
 * **Custom Knowledge Base (RAG):** Ingests raw `.txt` and `.pdf` recipe books, intelligently splitting them into discrete recipe chunks using RegEx and LangChain, and stores them in a local Chroma vector database.
 * **Smart Cropping Pipeline:** Implements YOLOv8 to locate bottles or glasses in an image, applying dynamic 25% padding to isolate the label and strip away background noise.
 * **Hardware-Optimized Processing:** Features custom logic to downscale images and restrict token generation limits, allowing complex 2-billion-parameter models to run efficiently on free-tier cloud CPUs.
-* **Interactive UI:** A Gradio 6.0 interface featuring a conversational chat format, session state memory, and a hidden "Vision Debug" gallery for real-time insight into the AI's detection process.
 ## 🛠️ Technical Stack
-* **Frontend/UI:** Gradio 6.0
 * **Computer Vision:** Ultralytics YOLOv8 (Object Detection)
 * **Vision-Language Model:** HuggingFaceTB/SmolVLM-Instruct (Label OCR & Context)
 * **Vector Database:** ChromaDB

+---
+title: LocalAGI AI Mixologist
+emoji: 🍸
+colorFrom: indigo
+colorTo: purple
+sdk: gradio
+sdk_version: "5.0"
+app_file: app.py
+pinned: false
+---
+# 🍸 LocalAGI: The AI Mixologist
 ## 📖 Overview
 LocalAGI is a multimodal Retrieval-Augmented Generation (RAG) application that acts as an intelligent, interactive bartender. By combining state-of-the-art computer vision with vector search, the application allows users to upload a photo of any liquor bottle and instantly receive curated cocktail recipes utilizing that specific spirit from a custom-ingested library.
 * **Custom Knowledge Base (RAG):** Ingests raw `.txt` and `.pdf` recipe books, intelligently splitting them into discrete recipe chunks using RegEx and LangChain, and stores them in a local Chroma vector database.
 * **Smart Cropping Pipeline:** Implements YOLOv8 to locate bottles or glasses in an image, applying dynamic 25% padding to isolate the label and strip away background noise.
 * **Hardware-Optimized Processing:** Features custom logic to downscale images and restrict token generation limits, allowing complex 2-billion-parameter models to run efficiently on free-tier cloud CPUs.
+* **Interactive UI:** A Gradio interface featuring a conversational chat format, session state memory, and a hidden "Vision Debug" gallery for real-time insight into the AI's detection process.
 ## 🛠️ Technical Stack
+* **Frontend/UI:** Gradio
 * **Computer Vision:** Ultralytics YOLOv8 (Object Detection)
 * **Vision-Language Model:** HuggingFaceTB/SmolVLM-Instruct (Label OCR & Context)
 * **Vector Database:** ChromaDB