Spaces:
Sleeping
Sleeping
Commit ·
fa19dc2
1
Parent(s): db06a63
Update README with OMR project description and Starry features
Browse files
README.md
CHANGED
|
@@ -9,17 +9,30 @@ license: bsd
|
|
| 9 |
app_port: 7860
|
| 10 |
---
|
| 11 |
|
| 12 |
-
# STARRY —
|
| 13 |
|
| 14 |
-
|
| 15 |
|
| 16 |
-
##
|
| 17 |
|
| 18 |
-
-
|
| 19 |
-
- Automatic
|
| 20 |
-
-
|
| 21 |
-
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 22 |
|
| 23 |
## Note
|
| 24 |
|
| 25 |
-
This is a lightweight deployment. ML prediction services (layout, gauge, mask, semantic, OCR) are not included in this Space.
|
|
|
|
| 9 |
app_port: 7860
|
| 10 |
---
|
| 11 |
|
| 12 |
+
# STARRY — Optical Music Recognition
|
| 13 |
|
| 14 |
+
Starry is a demo application for an end-to-end **Optical Music Recognition (OMR)** system. It transforms scanned sheet music images into structured digital music notation through a multi-stage ML pipeline, and provides an interactive platform for reviewing, editing, and managing the recognition results.
|
| 15 |
|
| 16 |
+
## What Makes Starry Different
|
| 17 |
|
| 18 |
+
- **Multi-stage ML pipeline** — Seven specialized models work in sequence: layout detection, staff gauge prediction, foreground/background mask separation, symbol semantic recognition, text location detection, OCR, and bracket recognition. Each model focuses on a specific subtask, enabling high overall accuracy.
|
| 19 |
+
- **Automatic regulation** — An ONNX-based solver analyzes recognized measures for timing consistency and automatically corrects quantization errors, reducing the need for manual intervention.
|
| 20 |
+
- **Measure-level quality tracking** — The system identifies and flags problematic measures for human review, with an annotation workflow that tracks corrections per-measure.
|
| 21 |
+
- **Real-time progress** — WebSocket streaming keeps the UI updated during long-running recognition and regulation tasks.
|
| 22 |
+
- **Score collection management** — Organize recognized scores into music sets with tagging and categorization, useful for building OMR datasets or curating repertoire.
|
| 23 |
+
|
| 24 |
+
## Architecture
|
| 25 |
+
|
| 26 |
+
```
|
| 27 |
+
Frontend (React)
|
| 28 |
+
↓ nginx (port 7860)
|
| 29 |
+
├─ /api/score → cluster-server (NestJS)
|
| 30 |
+
└─ /api/* → omr-service (Fastify)
|
| 31 |
+
├─ async task worker
|
| 32 |
+
├─ PostgreSQL
|
| 33 |
+
└─ ZMQ → Python ML services (×7)
|
| 34 |
+
```
|
| 35 |
|
| 36 |
## Note
|
| 37 |
|
| 38 |
+
This is a lightweight demo deployment on HuggingFace Spaces. The ML prediction services (layout, gauge, mask, semantic, text, OCR, brackets) are **not included** in this Space — it is intended for showcasing the review and editing workflow with pre-computed results.
|