Spaces:
Sleeping
Sleeping
Đỗ Hải Nam commited on
Commit ·
ab12077
1
Parent(s): 359123a
add: metrics and performance
Browse files
README.md
CHANGED
|
@@ -9,7 +9,7 @@ license: mit
|
|
| 9 |
short_description: Multi-Agent Calculus Orchestration System
|
| 10 |
---
|
| 11 |
|
| 12 |
-
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
|
| 13 |
|
| 14 |
# Pochi 4.o: Multi-Agent Calculus Orchestration System
|
| 15 |
|
|
@@ -19,9 +19,36 @@ Pochi is a high-performance, asynchronous AI platform specialized in solving com
|
|
| 19 |
|
| 20 |
| Platform | URL |
|
| 21 |
| :--- | :--- |
|
| 22 |
-
| **Koyeb** | [Visit Pochi on Koyeb](https://organisational-etheline-hnam-13dc7109.koyeb.app) |
|
| 23 |
| **Hugging Face** | [Visit Pochi on Hugging Face](https://huggingface.co/spaces/baeGil/calculus-agent) |
|
| 24 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 25 |
## Highlight Features
|
| 26 |
|
| 27 |
- **Multi-Agent Orchestration**: Stateful DAG-based workflow using LangGraph for complex, multi-stage reasoning.
|
|
|
|
| 9 |
short_description: Multi-Agent Calculus Orchestration System
|
| 10 |
---
|
| 11 |
|
| 12 |
+
<!-- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference -->
|
| 13 |
|
| 14 |
# Pochi 4.o: Multi-Agent Calculus Orchestration System
|
| 15 |
|
|
|
|
| 19 |
|
| 20 |
| Platform | URL |
|
| 21 |
| :--- | :--- |
|
|
|
|
| 22 |
| **Hugging Face** | [Visit Pochi on Hugging Face](https://huggingface.co/spaces/baeGil/calculus-agent) |
|
| 23 |
|
| 24 |
+
## Project Achievements & Performance
|
| 25 |
+
|
| 26 |
+
Pochi's performance and reliability are continuously monitored via LangSmith. The following data highlights the system's operational excellence and high-speed reasoning capabilities.
|
| 27 |
+
|
| 28 |
+

|
| 29 |
+
|
| 30 |
+
### System Health & Usage
|
| 31 |
+
| Metric | Value | Description |
|
| 32 |
+
| :--- | :--- | :--- |
|
| 33 |
+
| **Total Runs** | 476 | Cumulative successful execution cycles. |
|
| 34 |
+
| **Total Tokens** | 1.86M | Aggregate token throughput across all agents. |
|
| 35 |
+
| **Median Tokens** | 2,846 | Average context size per solver request. |
|
| 36 |
+
| **Success Rate** | 99% | System resilience against API and execution errors. |
|
| 37 |
+
| **Streaming Adoption** | 99% | Percentage of responses delivered via SSE for real-time feedback. |
|
| 38 |
+
|
| 39 |
+
### Latency Performance
|
| 40 |
+
> Latency varies significantly based on task complexity (e.g., Simple symbolic math vs. Multi-image OCR + Recursive code fixing).
|
| 41 |
+
|
| 42 |
+
| Stage | P50 (Median) | P99 (Tail) |
|
| 43 |
+
| :--- | :---: | :---: |
|
| 44 |
+
| **Time to First Token (TTFT)** | 0.53s | 5.30s |
|
| 45 |
+
| **End-to-End Latency** | 1.51s | 36.95s |
|
| 46 |
+
|
| 47 |
+
**Analysis**:
|
| 48 |
+
- **Responsiveness**: A P50 TTFT of **0.53s** ensures that users perceive an "instant" start to the response, crucial for engagement.
|
| 49 |
+
- **Efficiency**: The P50 latency of **1.51s** for full calculus resolution demonstrates the high-performance nature of the asynchronous multi-agent orchestration.
|
| 50 |
+
- **Complexity Buffer**: The P99 latency (**~37s**) accounts for the most intensive "Self-Healing" loops, where the system may perform multiple recursive code fixes or deep vision analysis.
|
| 51 |
+
|
| 52 |
## Highlight Features
|
| 53 |
|
| 54 |
- **Multi-Agent Orchestration**: Stateful DAG-based workflow using LangGraph for complex, multi-stage reasoning.
|
frontend/public/favicon.jpeg
DELETED
Git LFS Details
|
frontend/public/favicon.png
DELETED
Git LFS Details
|
frontend/public/hnam.jpeg
DELETED
Git LFS Details
|
frontend/public/pochi.jpeg
DELETED
Git LFS Details
|
frontend/src/assets/electrocardiogram-svgrepo-com.svg
DELETED
frontend/public/favicon.ico → images/traces.png
RENAMED
|
File without changes
|