ForgeSight / hf_space /README.md
rasAli02's picture
docs: final hackathon submission polish and README update
72d96c1
---
title: ForgeSight
emoji: πŸ”
colorFrom: red
colorTo: gray
sdk: gradio
sdk_version: 5.29.1
app_file: app.py
pinned: true
license: mit
short_description: "Multimodal QC Copilot on AMD MI300X + ROCm"
tags:
- amd
- rocm
- mi300x
- qwen
- vllm
- quality-control
- agents
---
# πŸ” ForgeSight β€” Multimodal QC Copilot on AMD Instinctβ„’ MI300X
ForgeSight is a production-ready **Agentic Quality Control (QC) Pipeline** designed for high-throughput manufacturing environments. Built exclusively for the **AMD + lablab.ai Developer Hackathon**, it leverages the massive 192GB VRAM of the **AMD Instinct MI300X** to run a state-of-the-art multimodal multi-agent workflow.
## πŸš€ Key Features
* **Multimodal Reasoning**: Uses **Qwen2-VL-7B** to "see" and understand complex assembly line defects in a single forward pass.
* **4-Agent Pipeline**: Chained reasoning workflow:
1. **Inspector** β€” Identifies surface defects, anomalies, and violations.
2. **Diagnostician** β€” Performs industry-literate root-cause analysis.
3. **Action** β€” Generates prioritized work orders and tool checklists.
4. **Reporter** β€” Summarizes findings into human-readable executive reports.
* **MI300X Optimized**: Served via **vLLM on ROCm**, utilizing continuous batching and paged attention for near-instant inference.
* **Audit-Ready**: Generates downloadable **PDF QC Audit Reports** for every inspection.
* **Persistent Data**: Integrated with **MongoDB Atlas** for long-term defect tracking and telemetry history.
## πŸ—οΈ Technical Architecture
```mermaid
graph TD
A[React Dashboard] --> B[FastAPI Gateway]
B --> C[Gradio Admin Console]
B --> D[4-Agent Pipeline]
D --> E[AMD MI300X Inference Server]
E --> F[vLLM / ROCm]
F --> G[Qwen2-VL-7B-Instruct]
B --> H[MongoDB Atlas]
B --> I[PDF Generator]
```
### Stack
- **Hardware**: AMD Instinct MI300X (192GB HBM3)
- **Software**: ROCm 6.2, PyTorch 2.4, vLLM
- **Frontend**: React 18, Tailwind CSS, Recharts
- **Backend**: FastAPI, Gradio, Python 3.10
## πŸ› οΈ Installation & Setup
1. **Clone the Repo**: `git clone https://github.com/rasali535/hans.git`
2. **Install Deps**: `pip install -r requirements.txt`
3. **Configure Environment**: Set `AMD_INFERENCE_URL` and `AMD_INFERENCE_TOKEN` in your `.env`.
4. **Launch**: `python hf_space/app.py`
## πŸ“Š Performance on AMD
The MI300X's 5.3 TB/s bandwidth allows ForgeSight to maintain **>2500 tokens/sec** throughput, enabling real-time visual inspection of high-speed manufacturing lines without the latency typical of cloud-based VLM APIs.
---
Built by **Hans** for the **AMD Developer Hackathon**.