Spaces:

tomo2chin2
/

ImageGenMCP

Paused

App Files Files Community

tomo2chin2 commited on Jun 1, 2025

Commit

672d01c

verified ·

1 Parent(s): f7515c9

Upload 9 files

Browse files

Files changed (3) hide show

DEVELOPMENT_LOG.md +10 -1
app.py +146 -101
requirements.txt +2 -1

DEVELOPMENT_LOG.md CHANGED Viewed

@@ -83,16 +83,25 @@
 - **原因**: モデルは`IMAGE`と`TEXT`の両方を要求
 - **解決**: `response_modalities=["IMAGE", "TEXT"]`に変更
-### 2. 画像生成の不安定性 (調査中)
 - **症状**: 1回目は成功するが、2回目以降は画像データが含まれない
 - **対策実施**:
   - レスポンスのパーツを全て検査するように改善
   - テキストコンテンツも収集してエラー理由を把握
 - **推測される原因**:
   - レート制限
   - コンテンツフィルタリング
   - プロンプトの内容による拒否
 ## 環境設定メモ
 - Hugging Face Spaces Secrets:
   - `GEMINI_API_KEY` - Gemini APIキー（必須）

 - **原因**: モデルは`IMAGE`と`TEXT`の両方を要求
 - **解決**: `response_modalities=["IMAGE", "TEXT"]`に変更
+### 2. 画像生成の不安定性 (部分的解決)
 - **症状**: 1回目は成功するが、2回目以降は画像データが含まれない
 - **対策実施**:
   - レスポンスのパーツを全て検査するように改善
   - テキストコンテンツも収集してエラー理由を把握
+  - システムインストラクションを追加して画像生成を強制
 - **推測される原因**:
   - レート制限
   - コンテンツフィルタリング
   - プロンプトの内容による拒否
+### 3. MCPエンドポイント認識問題 (調査中)
+- **症状**: `/api/health`, `/api/mcp/*`エンドポイントがGradio UIを返す
+- **原因**: Gradio 5.31.0内部FastAPIでのルート追加が正しく動作していない
+- **対策検討**:
+  - FastAPIとGradioを完全分離
+  - プロキシ型アーキテクチャ
+  - Gradio External APIの利用
 ## 環境設定メモ
 - Hugging Face Spaces Secrets:
   - `GEMINI_API_KEY` - Gemini APIキー（必須）

app.py CHANGED Viewed

@@ -10,7 +10,9 @@ import logging
 import traceback
 from typing import Optional
 from fastapi import FastAPI, Request
-from fastapi.responses import JSONResponse
 # ログ設定
 logging.basicConfig(
@@ -19,6 +21,26 @@ logging.basicConfig(
 )
 logger = logging.getLogger(__name__)
 # Gemini APIクライアントの初期化
 def get_gemini_client():
     api_key = os.environ.get("GEMINI_API_KEY")
@@ -151,6 +173,99 @@ def generate_image(prompt: str, previous_image: Optional[Image.Image] = None) ->
         logger.error(traceback.format_exc())
         return None, f"エラーが発生しました: {str(e)}"
 # Gradioインターフェースの作成
 def create_gradio_interface():
     with gr.Blocks(title="画像生成MCP - Gemini 2.0 Flash") as demo:
@@ -159,6 +274,11 @@ def create_gradio_interface():
         このアプリケーションは主にClaudeCodeから利用するためのMCPサーバーです。
         Gemini 2.0 Flash Previewを使用して画像を生成します。
         """)
         with gr.Tab("画像生成テスト"):
@@ -190,34 +310,25 @@ def create_gradio_interface():
         with gr.Tab("MCP API情報"):
             gr.Markdown("""
-            ### MCP APIエンドポイント
-            以下のエンドポイントを使用してClaudeCodeから画像生成機能を利用できます：
-            - **ツール一覧取得**: `POST https://[your-space-name].hf.space/api/mcp/list_tools`
-            - **画像生成実行**: `POST https://[your-space-name].hf.space/api/mcp/call_tool`
-            - **ヘルスチェック**: `GET https://[your-space-name].hf.space/api/health`
-            ### リクエスト例
-            #### ツール実行 (call_tool)
             ```json
             {
-                "name": "generate_image",
-                "arguments": {
-                    "prompt": "美しい夕日の風景"
                 }
             }
             ```
-            ### レスポンス例
-            ```json
-            {
-                "success": true,
-                "message": "画像生成に成功しました！",
-                "image_base64": "iVBORw0KGgo..."
-            }
-            ```
             """)
         # イベントハンドラ
@@ -229,7 +340,6 @@ def create_gradio_interface():
     return demo
 # メイン実行部分
 if __name__ == "__main__":
     # APIキーチェック
@@ -245,88 +355,23 @@ if __name__ == "__main__":
             - Name: `GEMINI_API_KEY`
             - Value: あなたのGemini APIキー
             """)
-        # Hugging Face Spaces用の設定
-        demo.queue()
         demo.launch()
     else:
         # Gradioインターフェース作成
         logger.info("画像生成MCPサーバーを起動中...")
         demo = create_gradio_interface()
-        # Gradio内部のFastAPIアプリを取得
-        app = demo.app
-        # MCPエンドポイントを直接追加
-        @app.post("/api/mcp/list_tools")
-        async def mcp_list_tools():
-            """MCPツールのリストを返す"""
-            return {
-                "tools": [
-                    {
-                        "name": "generate_image",
-                        "description": "Gemini 2.0 Flash Previewを使用して画像を生成します",
-                        "inputSchema": {
-                            "type": "object",
-                            "properties": {
-                                "prompt": {
-                                    "type": "string",
-                                    "description": "生成したい画像の説明"
-                                }
-                            },
-                            "required": ["prompt"]
-                        }
-                    }
-                ]
-            }
-        @app.post("/api/mcp/call_tool")
-        async def mcp_call_tool(request: Request):
-            """MCPツールを実行する"""
-            try:
-                data = await request.json()
-                tool_name = data.get("name")
-                arguments = data.get("arguments", {})
-                if tool_name == "generate_image":
-                    prompt = arguments.get("prompt", "")
-                    # 画像生成を実行
-                    image, message = generate_image(prompt)
-                    if image:
-                        # 画像をbase64エンコード
-                        buffered = io.BytesIO()
-                        image.save(buffered, format="PNG")
-                        img_str = base64.b64encode(buffered.getvalue()).decode()
-                        return JSONResponse({
-                            "success": True,
-                            "message": message,
-                            "image_base64": img_str
-                        })
-                    else:
-                        return JSONResponse({
-                            "success": False,
-                            "message": message
-                        })
-                return JSONResponse({
-                    "success": False,
-                    "message": f"Unknown tool: {tool_name}"
-                }, status_code=400)
-            except Exception as e:
-                logger.error(f"MCPエラー: {str(e)}")
-                return JSONResponse({
-                    "success": False,
-                    "message": f"Error: {str(e)}"
-                }, status_code=500)
-        @app.get("/api/health")
-        async def health_check():
-            """ヘルスチェックエンドポイント"""
-            return {"status": "healthy", "service": "image-gen-mcp"}
-        # Hugging Face Spaces用の設定
-        demo.queue()
-        demo.launch()

 import traceback
 from typing import Optional
 from fastapi import FastAPI, Request
+from fastapi.responses import JSONResponse, RedirectResponse
+from fastapi.middleware.cors import CORSMiddleware
+import uvicorn
 # ログ設定
 logging.basicConfig(
 )
 logger = logging.getLogger(__name__)
+# Gradio内部ログも取得
+gradio_logger = logging.getLogger("gradio")
+gradio_logger.setLevel(logging.INFO)
+# FastAPIアプリケーション初期化
+app = FastAPI(
+    title="ImageGenMCP Server",
+    version="1.0.0",
+    description="Gemini 2.0 Flash画像生成MCPサーバー"
+)
+# CORS設定の最適化
+app.add_middleware(
+    CORSMiddleware,
+    allow_origins=["*"],
+    allow_credentials=True,
+    allow_methods=["*"],
+    allow_headers=["*"],
+)
 # Gemini APIクライアントの初期化
 def get_gemini_client():
     api_key = os.environ.get("GEMINI_API_KEY")
         logger.error(traceback.format_exc())
         return None, f"エラーが発生しました: {str(e)}"
+# MCPエンドポイント
+@app.post("/mcp/list_tools")
+async def mcp_list_tools():
+    """MCPツールのリストを返す"""
+    logger.info("MCP list_tools 呼び出し")
+    return {
+        "tools": [
+            {
+                "name": "generate_image",
+                "description": "Gemini 2.0 Flash Previewを使用して画像を生成します",
+                "inputSchema": {
+                    "type": "object",
+                    "properties": {
+                        "prompt": {
+                            "type": "string",
+                            "description": "生成したい画像の説明"
+                        }
+                    },
+                    "required": ["prompt"]
+                }
+            }
+        ]
+    }
+@app.post("/mcp/call_tool")
+async def mcp_call_tool(request: Request):
+    """MCPツールを実行する"""
+    try:
+        data = await request.json()
+        tool_name = data.get("name")
+        arguments = data.get("arguments", {})
+        logger.info(f"MCP call_tool 呼び出し: {tool_name}, args: {arguments}")
+        if tool_name == "generate_image":
+            prompt = arguments.get("prompt", "")
+            # 画像生成を実行
+            image, message = generate_image(prompt)
+            if image:
+                # 画像をbase64エンコード
+                buffered = io.BytesIO()
+                image.save(buffered, format="PNG")
+                img_str = base64.b64encode(buffered.getvalue()).decode()
+                logger.info("MCP画像生成成功")
+                return JSONResponse({
+                    "success": True,
+                    "message": message,
+                    "image_base64": img_str
+                })
+            else:
+                logger.warning(f"MCP画像生成失敗: {message}")
+                return JSONResponse({
+                    "success": False,
+                    "message": message
+                })
+        return JSONResponse({
+            "success": False,
+            "message": f"Unknown tool: {tool_name}"
+        }, status_code=400)
+    except Exception as e:
+        logger.error(f"MCPエラー: {str(e)}")
+        return JSONResponse({
+            "success": False,
+            "message": f"Error: {str(e)}"
+        }, status_code=500)
+# ヘルスチェックエンドポイント
+@app.get("/health")
+async def health_check():
+    """ヘルスチェックエンドポイント"""
+    return {
+        "status": "OK",
+        "version": "5.31.0",
+        "service": "image-gen-mcp",
+        "endpoints": ["/mcp/list_tools", "/mcp/call_tool", "/health"]
+    }
+# プロキシエンドポイント（外部からのアクセス用）
+@app.post("/gradio_api/mcp/list_tools")
+async def proxy_list_tools():
+    """Gradio API経由でのlist_tools呼び出し"""
+    return await mcp_list_tools()
+@app.post("/gradio_api/mcp/call_tool")
+async def proxy_call_tool(request: Request):
+    """Gradio API経由でのcall_tool呼び出し"""
+    return await mcp_call_tool(request)
 # Gradioインターフェースの作成
 def create_gradio_interface():
     with gr.Blocks(title="画像生成MCP - Gemini 2.0 Flash") as demo:
         このアプリケーションは主にClaudeCodeから利用するためのMCPサーバーです。
         Gemini 2.0 Flash Previewを使用して画像を生成します。
+        ## APIエンドポイント
+        - `POST /mcp/list_tools` - ツール一覧
+        - `POST /mcp/call_tool` - 画像生成実行
+        - `GET /health` - ヘルスチェック
         """)
         with gr.Tab("画像生成テスト"):
         with gr.Tab("MCP API情報"):
             gr.Markdown("""
+            ### ClaudeCode設定例
             ```json
             {
+              "mcpServers": {
+                "image-gen": {
+                  "url": "https://tomo2chin2-imagegenmcp.hf.space/mcp/"
                 }
+              }
             }
             ```
+            ### プロキシエンドポイント
+            - `POST /gradio_api/mcp/list_tools`
+            - `POST /gradio_api/mcp/call_tool`
+            ### 直接エンドポイント
+            - `POST /mcp/list_tools`
+            - `POST /mcp/call_tool`
+            - `GET /health`
             """)
         # イベントハンドラ
     return demo
 # メイン実行部分
 if __name__ == "__main__":
     # APIキーチェック
             - Name: `GEMINI_API_KEY`
             - Value: あなたのGemini APIキー
             """)
         demo.launch()
     else:
         # Gradioインターフェース作成
         logger.info("画像生成MCPサーバーを起動中...")
         demo = create_gradio_interface()
+        # GradioをFastAPIにマウント
+        app = gr.mount_gradio_app(app, demo, path="/")
+        # サーバー起動設定
+        config = uvicorn.Config(
+            app=app,
+            host="0.0.0.0",
+            port=7860,
+            log_level="info"
+        )
+        logger.info("FastAPI + Gradio統合サーバー起動")
+        server = uvicorn.Server(config)
+        server.run()

requirements.txt CHANGED Viewed

@@ -1,5 +1,6 @@
-gradio==5.31.0
 google-genai
 pillow
 fastapi
 python-multipart

+gradio>=5.31.0,<5.32
 google-genai
 pillow
 fastapi
+uvicorn[standard]
 python-multipart