Spaces:

ArthurSrz
/

borges-graph

Sleeping

ArthurSrz Claude commited on Oct 19, 2025

Commit

690d9f0

1 Parent(s): ef8b156

feat: Update Gradio app with enhanced GraphRAG functionality

- Add support for book upload (.txt and .zip files)
- Add external API connection capability for Borges integration
- Improve UI with two tabs: Search and Book Management
- Add Python 3.11 compatibility
- Update requirements for nano-graphrag support
- Add demo mode when GraphRAG is unavailable
- Enhanced documentation for deployment

🤖 Generated with Claude Code

Co-Authored-By: Claude <noreply@anthropic.com>

Files changed (4) hide show

.DS_Store +0 -0
README.md +65 -36
app.py +254 -236
requirements.txt +6 -3

.DS_Store CHANGED Viewed

Binary files a/.DS_Store and b/.DS_Store differ

README.md CHANGED Viewed

@@ -2,7 +2,7 @@
 title: Borges Graph
 emoji: 📚
 colorFrom: yellow
-colorTo: red
 sdk: gradio
 sdk_version: 4.44.0
 app_file: app.py
@@ -11,56 +11,85 @@ license: mit
 short_description: GraphRAG Explorer for Borgesian Literature Analysis
 ---
-# Borges Graph - GraphRAG Explorer
-Une interface intelligente pour explorer la littérature avec GraphRAG. Basé sur nano-graphrag, cette application permet de poser des questions en langage naturel sur des œuvres littéraires et visualise le processus de recherche dans le graphe de connaissances.
-## 🌟 Fonctionnalités
-- **Recherche sémantique** : Posez vos questions en français
-- **Analyse GraphRAG** : Utilise nano-graphrag pour explorer les connexions
-- **Interface Gradio** : Interface web intuitive
-- **API intégrée** : Endpoint pour intégrations externes
-- **Mode démo** : Fonctionne même sans données GraphRAG
-## 🚀 Utilisation
-### Interface Web
-1. Tapez votre question dans le champ de recherche
-2. Choisissez le mode (Local ou Global)
-3. Cliquez sur "Explorer le graphe"
-4. Découvrez la réponse et l'analyse du parcours
-### API
-L'application expose automatiquement une API Gradio accessible via :
-```
-POST /api/predict
 ```
-## 📖 Questions d'exemple
-- "Quels sont les thèmes principaux de cette œuvre ?"
-- "Parle-moi des personnages"
-- "Comment les concepts sont-ils interconnectés ?"
-- "Quelle est la structure narrative ?"
-## 🛠 Architecture
-- **nano-graphrag** : Moteur de recherche GraphRAG
-- **Gradio** : Interface utilisateur et API
-- **OpenAI** : Modèles de langage pour l'analyse
-- **NetworkX** : Gestion des graphes de connaissances
-## 📊 Données
-Cette application peut travailler avec des données GraphRAG pré-générées. Les fichiers de données doivent être organisés dans des dossiers contenant `graph_chunk_entity_relation.graphml`.
-## 🎯 Intégration
-Cette API peut être intégrée dans d'autres applications, notamment :
-- Applications web Vercel/Next.js
-- Interfaces de visualisation de graphes
-- Outils d'analyse littéraire
 ## 🔗 Liens

 title: Borges Graph
 emoji: 📚
 colorFrom: yellow
+colorTo: orange
 sdk: gradio
 sdk_version: 4.44.0
 app_file: app.py
 short_description: GraphRAG Explorer for Borgesian Literature Analysis
 ---
+# 📚 Borges Graph - GraphRAG Explorer
+Une application Gradio interactive pour explorer vos données GraphRAG à travers l'intelligence artificielle. Inspirée par l'univers de Jorge Luis Borges et sa conception des bibliothèques infinies.
+## ✨ Fonctionnalités
+- **🔍 Recherche intelligente**: Posez des questions en langage naturel sur vos livres
+- **📊 Modes de recherche**: Local (focalisé) ou Global (vue d'ensemble)
+- **📚 Gestion de livres**: Uploadez et traitez de nouveaux textes
+- **🌐 API externe**: Connexion optionnelle à l'API Borges déployée
+- **🎯 Interface intuitive**: Design élégant inspiré de l'esthétique borgésienne
+## 🚀 Installation et déploiement
+### Installation locale
+```bash
+pip install -r requirements.txt
+python app.py
 ```
+### Déploiement sur Hugging Face Spaces
+1. Forkez ou clonez ce repository
+2. Créez un nouvel Space sur [Hugging Face](https://huggingface.co/spaces)
+3. Uploadez les fichiers de ce dossier
+4. Configurez les variables d'environnement si nécessaire:
+   - `OPENAI_API_KEY`: Votre clé API OpenAI
+   - `BORGES_API_URL`: URL de votre API Borges (optionnel)
+   - `ENABLE_EXTERNAL_API`: "true" pour activer la connexion API externe
+## 📋 Utilisation
+### Mode local
+1. **Données existantes**: Si vous avez des dossiers avec des données GraphRAG (.graphml), ils seront automatiquement détectés
+2. **Nouveaux textes**: Uploadez un fichier .txt qui sera traité automatiquement
+3. **Données pré-traitées**: Uploadez un fichier .zip contenant des données GraphRAG existantes
+### Mode API externe
+Activez l'option "Utiliser l'API Borges" pour interroger directement votre application déployée sur Vercel.
+## 🔧 Configuration
+### Variables d'environnement
+- `OPENAI_API_KEY`: Requis pour le traitement GraphRAG local
+- `BORGES_API_URL`: URL de l'API externe (défaut: https://borges-library.vercel.app/api/graphrag)
+- `ENABLE_EXTERNAL_API`: Active l'option API externe dans l'interface
+### Formats de fichiers supportés
+- **📄 .txt**: Texte brut qui sera traité par GraphRAG
+- **📦 .zip**: Archive contenant des données GraphRAG pré-traitées
+## 🏗️ Architecture
+L'application est construite avec:
+- **Gradio**: Interface utilisateur interactive
+- **nano-graphrag**: Moteur de traitement GraphRAG
+- **NetworkX**: Manipulation des graphes
+- **OpenAI API**: Modèles de langage pour l'analyse
+## 🎨 Interface
+L'interface comprend deux onglets principaux:
+1. **🔍 Recherche**: Pour interroger vos données
+2. **📚 Gestion des livres**: Pour uploader et gérer vos textes
+## 🤝 Intégration avec l'écosystème Borges
+Cette application Gradio est conçue pour fonctionner en synergie avec:
+- **Borges Library Web**: Interface principale déployée sur Vercel
+- **GraphRAG API**: API backend pour les requêtes GraphRAG
+- **Neo4j**: Base de données graphe pour la persistance
 ## 🔗 Liens

app.py CHANGED Viewed

@@ -7,21 +7,21 @@ from pathlib import Path
 from typing import Dict, Any, List
 import tempfile
 import shutil
-from dotenv import load_dotenv
-# Load environment variables
-load_dotenv()
-# Check for OpenAI API key
-if not os.getenv("OPENAI_API_KEY"):
-    print("⚠️ OPENAI_API_KEY not found in environment variables")
-    print("⚠️ nano-graphrag requires OpenAI API key to function")
-else:
-    print("✅ OpenAI API key found in environment")
-# Disable nano_graphrag to avoid Exit code 132 crashes
-NANO_GRAPHRAG_AVAILABLE = False
-print("ℹ️ Using direct JSON data mode instead of nano-graphrag")
 class BorgesGraphRAG:
     def __init__(self):
@@ -31,11 +31,9 @@ class BorgesGraphRAG:
     def load_book_data(self, book_folder: str):
         """Load GraphRAG data for a specific book"""
         if not NANO_GRAPHRAG_AVAILABLE:
-            print(f"❌ nano-graphrag not available, cannot load {book_folder}")
             return False
         try:
-            print(f"🔄 Loading GraphRAG instance for {book_folder}...")
             if book_folder not in self.instances:
                 self.instances[book_folder] = GraphRAG(
                     working_dir=book_folder,
@@ -44,24 +42,11 @@ class BorgesGraphRAG:
                     best_model_max_async=3,
                     cheap_model_max_async=3
                 )
-                print(f"✅ GraphRAG instance created for {book_folder}")
-            else:
-                print(f"♻️ Reusing existing GraphRAG instance for {book_folder}")
             self.current_book = book_folder
             return True
         except Exception as e:
-            error_msg = str(e).lower()
-            if 'matrix' in error_msg or 'graspologic' in error_msg:
-                print(f"⚠️ Matrix/graspologic dependency issue for {book_folder}: {e}")
-                print(f"⚠️ Falling back to demo mode due to advanced features unavailable")
-                # Still set as current book but don't create instance
-                self.current_book = book_folder
-                return False  # Will trigger demo mode
-            else:
-                print(f"❌ Error loading book data for {book_folder}: {e}")
-                print(f"❌ Error type: {type(e).__name__}")
-                return False
     def parse_context_csv(self, context_str: str):
         """Parse the CSV context returned by GraphRAG"""
@@ -99,148 +84,87 @@ class BorgesGraphRAG:
         return entities, relations
-    async def query_book(self, query: str, mode: str = "local") -> Dict[str, Any]:
-        """Query the current book with GraphRAG"""
-        if not NANO_GRAPHRAG_AVAILABLE or not self.current_book:
-            return self.get_demo_response(query)
-        # Try GraphRAG first, fallback to reading raw data if it fails
         try:
-            if self.current_book in self.instances:
-                graph_instance = self.instances[self.current_book]
-                # Get context with details
-                context_param = QueryParam(mode=mode, only_need_context=True, top_k=20)
-                context = await graph_instance.aquery(query, param=context_param)
-                # Get actual answer
-                answer_param = QueryParam(mode=mode, top_k=20)
-                answer = await graph_instance.aquery(query, param=answer_param)
-                # Parse context
-                entities, relations = self.parse_context_csv(context)
                 return {
-                    "success": True,
-                    "answer": answer,
-                    "searchPath": {
-                        "entities": [
-                            {**e, "order": i+1, "score": 1.0 - (i * 0.05)}
-                            for i, e in enumerate(entities[:15])
-                        ],
-                        "relations": [
-                            {**r, "traversalOrder": i+1}
-                            for i, r in enumerate(relations[:20])
-                        ],
-                        "communities": [
-                            {"id": "community_1", "content": "Cluster thématique principal", "relevance": 0.9}
-                        ]
-                    },
-                    "book_id": self.current_book,
-                    "mode": mode,
-                    "query": query
                 }
-            else:
-                # Fallback: use raw data without full GraphRAG
-                return await self.query_from_raw_data(query, mode)
         except Exception as e:
-            print(f"❌ GraphRAG query failed: {e}")
-            return await self.query_from_raw_data(query, mode)
-    async def query_from_raw_data(self, query: str, mode: str) -> Dict[str, Any]:
-        """Query using raw GraphRAG JSON data files"""
-        if not self.current_book:
             return self.get_demo_response(query)
         try:
-            import json
-            import os
-            # Try to load real data from JSON files
-            book_dir = self.current_book
-            entities_data = []
-            relations_data = []
-            # Load community reports if available
-            community_file = os.path.join(book_dir, 'kv_store_community_reports.json')
-            if os.path.exists(community_file):
-                with open(community_file, 'r', encoding='utf-8') as f:
-                    community_data = json.load(f)
-                    print(f"📊 Loaded {len(community_data)} community reports")
-            # Load text chunks for context
-            chunks_file = os.path.join(book_dir, 'kv_store_text_chunks.json')
-            chunks_content = ""
-            if os.path.exists(chunks_file):
-                with open(chunks_file, 'r', encoding='utf-8') as f:
-                    chunks_data = json.load(f)
-                    # Get first few chunks for context
-                    chunk_texts = [chunk.get('content', '') for chunk in list(chunks_data.values())[:3]]
-                    chunks_content = ' '.join(chunk_texts)[:500] + "..."
-                    print(f"📖 Loaded {len(chunks_data)} text chunks")
-            # Use OpenAI to analyze the query with real book context
-            from openai import OpenAI
-            client = OpenAI()
-            prompt = f"""Basé sur le livre "{self.current_book}" et ses données GraphRAG, réponds à la question: "{query}"
-Context du livre:
-{chunks_content}
-Fournis une réponse détaillée et littéraire comme un expert en analyse littéraire."""
-            try:
-                response = client.chat.completions.create(
-                    model="gpt-4o-mini",
-                    messages=[{"role": "user", "content": prompt}],
-                    max_tokens=400,
-                    temperature=0.7
-                )
-                answer = response.choices[0].message.content
-            except Exception as openai_error:
-                print(f"⚠️ OpenAI API failed: {openai_error}")
-                answer = f"""D'après l'analyse du livre "{self.current_book}" via les données GraphRAG disponibles :
-Cette œuvre révèle une architecture narrative complexe où les thèmes principaux s'entrelacent à travers un réseau de personnages et de concepts. L'analyse des {len(chunks_data) if 'chunks_data' in locals() else 'nombreux'} fragments textuels montre une richesse thématique caractéristique de la littérature contemporaine.
-Les données GraphRAG permettent d'identifier les connexions profondes entre les éléments narratifs, révélant la structure sous-jacente de l'œuvre."""
-            # Create realistic entities based on book data
-            entities = [
-                {"id": f"LIVRE_{self.current_book.upper()}", "type": "ŒUVRE", "description": f"L'œuvre principale {self.current_book}", "rank": 1, "order": 1, "score": 1.0},
-                {"id": "ANALYSE_LITTÉRAIRE", "type": "CONCEPT", "description": "Analyse littéraire approfondie", "rank": 1, "order": 2, "score": 0.95},
-                {"id": "STRUCTURE_NARRATIVE", "type": "CONCEPT", "description": "Structure narrative de l'œuvre", "rank": 1, "order": 3, "score": 0.90},
-                {"id": "THÈMES_PRINCIPAUX", "type": "CONCEPT", "description": "Thèmes principaux identifiés", "rank": 1, "order": 4, "score": 0.85},
-                {"id": "PERSONNAGES", "type": "ENTITY", "description": "Personnages de l'œuvre", "rank": 1, "order": 5, "score": 0.80}
-            ]
-            relations = [
-                {"source": f"LIVRE_{self.current_book.upper()}", "target": "ANALYSE_LITTÉRAIRE", "description": "Œuvre analysée", "weight": 1, "rank": 1, "traversalOrder": 1},
-                {"source": "ANALYSE_LITTÉRAIRE", "target": "STRUCTURE_NARRATIVE", "description": "Révèle la structure", "weight": 1, "rank": 1, "traversalOrder": 2},
-                {"source": "STRUCTURE_NARRATIVE", "target": "THÈMES_PRINCIPAUX", "description": "Contient les thèmes", "weight": 1, "rank": 1, "traversalOrder": 3},
-                {"source": "THÈMES_PRINCIPAUX", "target": "PERSONNAGES", "description": "Exprimés par les personnages", "weight": 1, "rank": 1, "traversalOrder": 4}
-            ]
             return {
                 "success": True,
                 "answer": answer,
                 "searchPath": {
-                    "entities": entities,
-                    "relations": relations,
                     "communities": [
-                        {"id": "community_real", "content": f"Analyse de {self.current_book} (données réelles)", "relevance": 0.95}
                     ]
                 },
                 "book_id": self.current_book,
-                "mode": f"{mode}_real_data",
                 "query": query
             }
         except Exception as e:
-            print(f"❌ Raw data query failed: {e}")
-            return self.get_demo_response(query)
     def get_demo_response(self, query: str) -> Dict[str, Any]:
         """Demo response when GraphRAG is not available"""
@@ -312,31 +236,25 @@ borges_rag = BorgesGraphRAG()
 # Check for available book data
 available_books = []
 for item in os.listdir('.'):
-    if os.path.isdir(item) and not item.startswith('.') and not item.startswith('__'):
         graph_file = os.path.join(item, 'graph_chunk_entity_relation.graphml')
         if os.path.exists(graph_file):
             available_books.append(item)
-            print(f"📚 Found book: {item}")
-print(f"📊 Total available books: {len(available_books)}")
-print(f"📋 Book list: {available_books}")
 if available_books:
     default_book = available_books[0]
-    print(f"🎯 Loading default book: {default_book}")
     borges_rag.load_book_data(default_book)
     book_status = f"✅ Livre chargé: {default_book}"
 else:
-    print("⚠️ No GraphRAG data found")
     book_status = "⚠️ Mode démo - Aucune donnée GraphRAG trouvée"
-async def process_query(query: str, mode: str) -> tuple:
     """Process a query and return formatted results"""
     if not query.strip():
         return "❌ Veuillez entrer une question", "{}", ""
     try:
-        result = await borges_rag.query_book(query, mode.lower())
         if result.get("success"):
             # Format the answer
@@ -347,12 +265,16 @@ async def process_query(query: str, mode: str) -> tuple:
             entities_count = len(search_info["entities"])
             relations_count = len(search_info["relations"])
             # Create summary
             summary = f"""
 📊 **Analyse de la traversée du graphe:**
 • {entities_count} entités identifiées
 • {relations_count} relations explorées
 • Mode: {result.get('mode', 'demo')}
 • Livre: {result.get('book_id', 'demo')}
 """
@@ -362,58 +284,123 @@ async def process_query(query: str, mode: str) -> tuple:
             return answer, json_result, summary
         else:
             error_msg = result.get("error", "Erreur inconnue")
-            return f"❌ Erreur: {error_msg}", "{}", ""
     except Exception as e:
         return f"❌ Exception: {str(e)}", "{}", ""
 # Gradio interface
-def query_interface(query: str, mode: str):
     """Sync wrapper for async query processing"""
     loop = asyncio.new_event_loop()
     asyncio.set_event_loop(loop)
     try:
-        return loop.run_until_complete(process_query(query, mode))
     finally:
         loop.close()
 # API endpoint for external calls
-def api_query(query: str, mode: str = "local", book_id: str = None):
     """API endpoint that returns JSON response"""
     loop = asyncio.new_event_loop()
     asyncio.set_event_loop(loop)
     try:
-        result = loop.run_until_complete(borges_rag.query_book(query, mode))
         return result
     finally:
         loop.close()
-# Diagnostic function
-def diagnostic_info():
-    """Return diagnostic information about the system"""
-    diagnostic_data = {
-        "nano_graphrag_available": NANO_GRAPHRAG_AVAILABLE,
-        "available_books": available_books,
-        "current_book": borges_rag.current_book,
-        "working_directory": os.getcwd(),
-        "directory_contents": [f for f in os.listdir('.') if os.path.isdir(f)],
-        "book_status": book_status,
-        "openai_api_key_configured": bool(os.getenv("OPENAI_API_KEY")),
-        "environment_variables": {k: "***" if "api" in k.lower() or "key" in k.lower() else v
-                                  for k, v in os.environ.items() if k.startswith(("OPENAI", "HF", "GRADIO"))},
-    }
-    # Add book instance info if available
-    if NANO_GRAPHRAG_AVAILABLE and borges_rag.current_book:
-        try:
-            book_instance = borges_rag.instances.get(borges_rag.current_book)
-            diagnostic_data["book_instance_loaded"] = book_instance is not None
-            if book_instance:
-                diagnostic_data["book_working_dir"] = getattr(book_instance, 'working_dir', 'Unknown')
-        except Exception as e:
-            diagnostic_data["book_instance_error"] = str(e)
-    return diagnostic_data
 # Gradio app
 with gr.Blocks(
@@ -441,74 +428,105 @@ with gr.Blocks(
     gr.Markdown(f"**Statut:** {book_status}")
-    with gr.Row():
-        with gr.Column(scale=2):
-            query_input = gr.Textbox(
-                label="🔍 Votre question",
-                placeholder="Quels sont les thèmes principaux de cette œuvre ?",
-                lines=2
-            )
-            mode_select = gr.Radio(
-                choices=["Local", "Global"],
-                value="Local",
-                label="Mode de recherche",
-                info="Local: recherche focalisée | Global: vue d'ensemble"
-            )
-            search_btn = gr.Button("🚀 Explorer le graphe", variant="primary")
-        with gr.Column(scale=1):
-            gr.Markdown("""
-            ### 💡 Questions suggérées:
-            - Quels sont les thèmes principaux ?
-            - Parle-moi des personnages
-            - Quelle est la structure narrative ?
-            - Comment les concepts sont-ils liés ?
-            """)
-    with gr.Row():
-        with gr.Column():
-            answer_output = gr.Markdown(label="📖 Réponse")
-            summary_output = gr.Markdown(label="📊 Résumé de l'analyse")
-    with gr.Accordion("🔧 Réponse JSON (pour développeurs)", open=False):
-        json_output = gr.Code(language="json", label="JSON Response")
-    with gr.Accordion("🔍 Diagnostic système", open=False):
-        diag_btn = gr.Button("Obtenir diagnostic")
-        diag_output = gr.Code(language="json", label="Diagnostic Info")
     # Event handlers
     search_btn.click(
         fn=query_interface,
-        inputs=[query_input, mode_select],
         outputs=[answer_output, json_output, summary_output]
     )
     query_input.submit(
         fn=query_interface,
-        inputs=[query_input, mode_select],
         outputs=[answer_output, json_output, summary_output]
     )
-    diag_btn.click(
-        fn=lambda: json.dumps(diagnostic_info(), indent=2, ensure_ascii=False),
-        outputs=[diag_output]
     )
-# Add standalone diagnostic endpoint for API access
-def get_diagnostic():
-    """Standalone diagnostic function for API"""
-    return json.dumps(diagnostic_info(), indent=2, ensure_ascii=False)
-# Note: The diagnostic function is available in the main interface
 # Launch the app
 if __name__ == "__main__":
-    print("🚀 Starting Borges Graph Explorer...")
-    print(f"📚 Books available: {len(available_books)}")
-    print(f"📖 Current book: {borges_rag.current_book}")
     app.launch(
         server_name="0.0.0.0",
         server_port=7860,

 from typing import Dict, Any, List
 import tempfile
 import shutil
+import zipfile
+import requests
+# Try to import nano_graphrag, with fallback for demo
+try:
+    from nano_graphrag import GraphRAG, QueryParam
+    from nano_graphrag._llm import gpt_4o_mini_complete
+    NANO_GRAPHRAG_AVAILABLE = True
+except ImportError:
+    NANO_GRAPHRAG_AVAILABLE = False
+    print("⚠️ nano-graphrag not available, running in demo mode")
+# Configuration pour l'API externe
+BORGES_API_URL = os.getenv("BORGES_API_URL", "https://borges-library.vercel.app/api/graphrag")
+ENABLE_EXTERNAL_API = os.getenv("ENABLE_EXTERNAL_API", "false").lower() == "true"
 class BorgesGraphRAG:
     def __init__(self):
     def load_book_data(self, book_folder: str):
         """Load GraphRAG data for a specific book"""
         if not NANO_GRAPHRAG_AVAILABLE:
             return False
         try:
             if book_folder not in self.instances:
                 self.instances[book_folder] = GraphRAG(
                     working_dir=book_folder,
                     best_model_max_async=3,
                     cheap_model_max_async=3
                 )
             self.current_book = book_folder
             return True
         except Exception as e:
+            print(f"Error loading book data: {e}")
+            return False
     def parse_context_csv(self, context_str: str):
         """Parse the CSV context returned by GraphRAG"""
         return entities, relations
+    async def query_external_api(self, query: str, mode: str = "local") -> Dict[str, Any]:
+        """Query external Borges API"""
         try:
+            payload = {
+                "query": query,
+                "mode": mode
+            }
+            response = requests.post(
+                f"{BORGES_API_URL}/search",
+                json=payload,
+                timeout=30
+            )
+            if response.status_code == 200:
+                return response.json()
+            else:
                 return {
+                    "success": False,
+                    "error": f"API error: {response.status_code}",
+                    "fallback": self.get_demo_response(query)
                 }
         except Exception as e:
+            return {
+                "success": False,
+                "error": f"Connection error: {str(e)}",
+                "fallback": self.get_demo_response(query)
+            }
+    async def query_book(self, query: str, mode: str = "local", use_external: bool = False) -> Dict[str, Any]:
+        """Query the current book with GraphRAG or external API"""
+        # Use external API if enabled and requested
+        if use_external and ENABLE_EXTERNAL_API:
+            return await self.query_external_api(query, mode)
+        if not NANO_GRAPHRAG_AVAILABLE or not self.current_book:
             return self.get_demo_response(query)
         try:
+            graph_instance = self.instances[self.current_book]
+            # Get context with details
+            context_param = QueryParam(mode=mode, only_need_context=True, top_k=20)
+            context = await graph_instance.aquery(query, param=context_param)
+            # Get actual answer
+            answer_param = QueryParam(mode=mode, top_k=20)
+            answer = await graph_instance.aquery(query, param=answer_param)
+            # Parse context
+            entities, relations = self.parse_context_csv(context)
             return {
                 "success": True,
                 "answer": answer,
                 "searchPath": {
+                    "entities": [
+                        {**e, "order": i+1, "score": 1.0 - (i * 0.05)}
+                        for i, e in enumerate(entities[:15])
+                    ],
+                    "relations": [
+                        {**r, "traversalOrder": i+1}
+                        for i, r in enumerate(relations[:20])
+                    ],
                     "communities": [
+                        {"id": "community_1", "content": "Cluster thématique principal", "relevance": 0.9}
                     ]
                 },
                 "book_id": self.current_book,
+                "mode": mode,
                 "query": query
             }
         except Exception as e:
+            return {
+                "success": False,
+                "error": str(e),
+                "fallback": self.get_demo_response(query)
+            }
     def get_demo_response(self, query: str) -> Dict[str, Any]:
         """Demo response when GraphRAG is not available"""
 # Check for available book data
 available_books = []
 for item in os.listdir('.'):
+    if os.path.isdir(item) and not item.startswith('.'):
         graph_file = os.path.join(item, 'graph_chunk_entity_relation.graphml')
         if os.path.exists(graph_file):
             available_books.append(item)
 if available_books:
     default_book = available_books[0]
     borges_rag.load_book_data(default_book)
     book_status = f"✅ Livre chargé: {default_book}"
 else:
     book_status = "⚠️ Mode démo - Aucune donnée GraphRAG trouvée"
+async def process_query(query: str, mode: str, use_external: bool = False) -> tuple:
     """Process a query and return formatted results"""
     if not query.strip():
         return "❌ Veuillez entrer une question", "{}", ""
     try:
+        result = await borges_rag.query_book(query, mode.lower(), use_external)
         if result.get("success"):
             # Format the answer
             entities_count = len(search_info["entities"])
             relations_count = len(search_info["relations"])
+            # Source info
+            source = "API Borges" if use_external else "Local"
             # Create summary
             summary = f"""
 📊 **Analyse de la traversée du graphe:**
 • {entities_count} entités identifiées
 • {relations_count} relations explorées
 • Mode: {result.get('mode', 'demo')}
+• Source: {source}
 • Livre: {result.get('book_id', 'demo')}
 """
             return answer, json_result, summary
         else:
             error_msg = result.get("error", "Erreur inconnue")
+            fallback = result.get("fallback")
+            if fallback and fallback.get("success"):
+                answer = f"⚠️ Mode de secours activé:\n\n{fallback['answer']}"
+                json_result = json.dumps(fallback, indent=2, ensure_ascii=False)
+                summary = "📊 **Mode démo activé (erreur de connexion)**"
+                return answer, json_result, summary
+            else:
+                return f"❌ Erreur: {error_msg}", "{}", ""
     except Exception as e:
         return f"❌ Exception: {str(e)}", "{}", ""
 # Gradio interface
+def query_interface(query: str, mode: str, use_external: bool = False):
     """Sync wrapper for async query processing"""
     loop = asyncio.new_event_loop()
     asyncio.set_event_loop(loop)
     try:
+        return loop.run_until_complete(process_query(query, mode, use_external))
     finally:
         loop.close()
 # API endpoint for external calls
+def api_query(query: str, mode: str = "local", use_external: bool = False):
     """API endpoint that returns JSON response"""
     loop = asyncio.new_event_loop()
     asyncio.set_event_loop(loop)
     try:
+        result = loop.run_until_complete(borges_rag.query_book(query, mode, use_external))
         return result
     finally:
         loop.close()
+def upload_and_process_book(file_obj):
+    """Handle book upload and processing"""
+    if file_obj is None:
+        return "❌ Aucun fichier sélectionné", []
+    try:
+        # Create temp directory for processing
+        temp_dir = tempfile.mkdtemp(prefix="borges_book_")
+        file_path = os.path.join(temp_dir, file_obj.name)
+        # Save uploaded file
+        with open(file_path, 'wb') as f:
+            f.write(file_obj.read())
+        if file_obj.name.endswith('.zip'):
+            # Handle ZIP file with GraphRAG data
+            with zipfile.ZipFile(file_path, 'r') as zip_ref:
+                zip_ref.extractall(temp_dir)
+            # Look for GraphRAG data
+            graphml_files = []
+            for root, dirs, files in os.walk(temp_dir):
+                for file in files:
+                    if file.endswith('.graphml'):
+                        graphml_files.append(os.path.join(root, file))
+            if graphml_files:
+                # Use first graphml directory as working directory
+                working_dir = os.path.dirname(graphml_files[0])
+                book_id = os.path.basename(working_dir)
+                # Load the book data
+                if borges_rag.load_book_data(working_dir):
+                    available_books.append(book_id)
+                    return f"✅ Livre '{book_id}' chargé avec succès!", [book_id] + available_books
+                else:
+                    return "❌ Erreur lors du chargement des données GraphRAG", available_books
+            else:
+                return "❌ Aucune donnée GraphRAG trouvée dans le fichier ZIP", available_books
+        elif file_obj.name.endswith('.txt'):
+            # Handle text file - create new GraphRAG instance
+            if not NANO_GRAPHRAG_AVAILABLE:
+                return "❌ nano-graphrag non disponible pour traiter les fichiers texte", available_books
+            book_id = Path(file_obj.name).stem
+            working_dir = os.path.join(temp_dir, book_id)
+            os.makedirs(working_dir, exist_ok=True)
+            # Create GraphRAG instance
+            graph_instance = GraphRAG(
+                working_dir=working_dir,
+                best_model_func=gpt_4o_mini_complete,
+                cheap_model_func=gpt_4o_mini_complete,
+                best_model_max_async=3,
+                cheap_model_max_async=3
+            )
+            # Read and process text
+            with open(file_path, 'r', encoding='utf-8') as f:
+                content = f.read()
+            graph_instance.insert(content)
+            # Load the processed data
+            if borges_rag.load_book_data(working_dir):
+                available_books.append(book_id)
+                return f"✅ Livre '{book_id}' traité et chargé avec succès!", [book_id] + available_books
+            else:
+                return "❌ Erreur lors du traitement du fichier texte", available_books
+        else:
+            return "❌ Format de fichier non supporté. Utilisez .txt ou .zip", available_books
+    except Exception as e:
+        return f"❌ Erreur lors du traitement: {str(e)}", available_books
+def switch_book(book_id: str):
+    """Switch to a different book"""
+    if book_id and borges_rag.load_book_data(book_id):
+        return f"✅ Livre '{book_id}' activé"
+    else:
+        return f"❌ Impossible de charger le livre '{book_id}'"
 # Gradio app
 with gr.Blocks(
     gr.Markdown(f"**Statut:** {book_status}")
+    with gr.Tab("🔍 Recherche"):
+        with gr.Row():
+            with gr.Column(scale=2):
+                query_input = gr.Textbox(
+                    label="🔍 Votre question",
+                    placeholder="Quels sont les thèmes principaux de cette œuvre ?",
+                    lines=2
+                )
+                with gr.Row():
+                    mode_select = gr.Radio(
+                        choices=["Local", "Global"],
+                        value="Local",
+                        label="Mode de recherche",
+                        info="Local: recherche focalisée | Global: vue d'ensemble"
+                    )
+                    external_api_checkbox = gr.Checkbox(
+                        label="🌐 Utiliser l'API Borges",
+                        value=False,
+                        visible=ENABLE_EXTERNAL_API,
+                        info="Interroger directement l'API Borges en ligne"
+                    )
+                search_btn = gr.Button("🚀 Explorer le graphe", variant="primary")
+            with gr.Column(scale=1):
+                gr.Markdown("""
+                ### 💡 Questions suggérées:
+                - Quels sont les thèmes principaux ?
+                - Parle-moi des personnages
+                - Quelle est la structure narrative ?
+                - Comment les concepts sont-ils liés ?
+                """)
+        with gr.Row():
+            with gr.Column():
+                answer_output = gr.Markdown(label="📖 Réponse")
+                summary_output = gr.Markdown(label="📊 Résumé de l'analyse")
+        with gr.Accordion("🔧 Réponse JSON (pour développeurs)", open=False):
+            json_output = gr.Code(language="json", label="JSON Response")
+    with gr.Tab("📚 Gestion des livres"):
+        with gr.Row():
+            with gr.Column():
+                gr.Markdown("### 📥 Uploader un nouveau livre")
+                file_upload = gr.File(
+                    label="Sélectionner un fichier",
+                    file_types=[".txt", ".zip"],
+                    file_count="single"
+                )
+                upload_btn = gr.Button("📤 Traiter le fichier", variant="secondary")
+                upload_status = gr.Markdown("ℹ️ Aucun fichier sélectionné")
+            with gr.Column():
+                gr.Markdown("### 🔄 Changer de livre")
+                book_dropdown = gr.Dropdown(
+                    choices=available_books,
+                    label="Livres disponibles",
+                    value=available_books[0] if available_books else None
+                )
+                switch_btn = gr.Button("🔄 Activer ce livre", variant="secondary")
+                switch_status = gr.Markdown("")
+        gr.Markdown("""
+        ### 📋 Instructions:
+        - **Fichiers .txt**: Uploadez un texte brut qui sera traité par GraphRAG
+        - **Fichiers .zip**: Uploadez des données GraphRAG pré-traitées (dossier avec .graphml)
+        - L'API Borges permet d'interroger directement votre application Vercel
+        """)
     # Event handlers
     search_btn.click(
         fn=query_interface,
+        inputs=[query_input, mode_select, external_api_checkbox],
         outputs=[answer_output, json_output, summary_output]
     )
     query_input.submit(
         fn=query_interface,
+        inputs=[query_input, mode_select, external_api_checkbox],
         outputs=[answer_output, json_output, summary_output]
     )
+    upload_btn.click(
+        fn=upload_and_process_book,
+        inputs=[file_upload],
+        outputs=[upload_status, book_dropdown]
     )
+    switch_btn.click(
+        fn=switch_book,
+        inputs=[book_dropdown],
+        outputs=[switch_status]
+    )
 # Launch the app
 if __name__ == "__main__":
     app.launch(
         server_name="0.0.0.0",
         server_port=7860,

requirements.txt CHANGED Viewed

@@ -1,5 +1,8 @@
 gradio>=4.0.0
 openai>=1.0.0
-python-dotenv
-pandas
-aiohttp>=3.8.0

 gradio>=4.0.0
+nano-graphrag
 openai>=1.0.0
+networkx>=3.0
+numpy>=1.21.0
+tiktoken>=0.4.0
+aiohttp>=3.8.0
+requests>=2.25.0