3morixd's picture
Upload README.md with huggingface_hub
3c8e62e verified
|
Raw
History Blame Contribute Delete
819 Bytes

A newer version of the Gradio SDK is available: 6.19.0

Upgrade
metadata
title: MCP Latency Estimator
emoji: 
colorFrom: blue
colorTo: indigo
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: false
license: apache-2.0
tags:
  - mcp
  - latency
  - mobile
  - benchmark
  - dispatchai

⚡ dispatchAI MCP Latency Estimator

MCP Server — estimate on-device inference latency for any mobile LLM.

Tools

  • estimate_latency(params, quant, hardware, prompt_tokens, generate_tokens) — Get tokens/sec, RAM, and total time
  • list_supported_hardware() — Browse supported hardware profiles

Usage in Claude Desktop

{
  "mcpServers": {
    "dispatchai-latency": {
      "url": "https://huggingface.co/spaces/dispatchAI/mcp-latency-estimator/mcp"
    }
  }
}

🚀 dispatchAI — Small. Mobile. Free. UAE-built.