MuseEye / README.md
NoExitPrior's picture
Update README with custom Muse:Eye documentation
2a9d63a
---
title: "Muse:Eye β€” On-device Multimodal XR Docent"
emoji: 🎨
colorFrom: blue
colorTo: purple
sdk: other
pipeline_tag: other
tags:
- multimodal
- XR
- on-device
- tflite
- mobileclip
- android
- unity
- RAG
- museum-tech
---
# Muse:Eye
**On-device Multimodal AI + XR Museum Docent for Android**
Muse:EyeλŠ” **카메라 기반 μž‘ν’ˆ 인식(MobileCLIP)**,
**μ˜¨λ””λ°”μ΄μŠ€ LLM(Gemma-3N)**,
**XR μΈν„°λž™μ…˜(Unity MR)**
을 κ²°ν•©ν•œ ν•˜μ΄λΈŒλ¦¬λ“œ AI λ„μŠ¨νŠΈ μ‹œμŠ€ν…œμž…λ‹ˆλ‹€.
μ˜€ν”„λΌμΈ ν™˜κ²½μ—μ„œλ„ λΉ λ₯Έ 이미지 검색,
메타데이터 기반 RAG,
성인/어린이/μ „λ¬Έκ°€ λͺ¨λ“œ μ„€λͺ… 제곡이 κ°€λŠ₯ν•©λ‹ˆλ‹€.
---
# ✨ Features
### 1) ⚑ On-device AI (No Internet Required)
- MobileCLIP (TFLite) 이미지 μž„λ² λ”©
- μž‘ν’ˆ μœ μ‚¬λ„ 검색 (FAISS β†’ custom `.bin` index)
- Gemma-3N μ˜¨λ””λ°”μ΄μŠ€ LLM
### 2) πŸ–Ό Artwork Recognition
- 카메라 인식 β†’ μ¦‰μ‹œ embedding μΆ”μΆœ
- RAG 기반 μž‘ν’ˆΒ·μž‘κ°€ μ„€λͺ… 생성
### 3) πŸ‘ΆπŸ§‘β€πŸ¦³ Three Docent Modes
- 어린이 λͺ¨λ“œ
- 성인 λͺ¨λ“œ
- μ „λ¬Έκ°€ λͺ¨λ“œ
### 4) πŸ₯½ XR Integrated
- Unity + Android Studio 톡합
- MR λͺ¨λ“œμ—μ„œ 3D UI 및 μŒμ„± μ•ˆλ‚΄ 제곡
---
# πŸ— System Architecture
Muse:EyeλŠ” 두 κ°€μ§€ μ‹€ν–‰ λͺ¨λ“œλ₯Ό μ§€μ›ν•©λ‹ˆλ‹€:
- **Mobile Mode (On-device AI)** β€” μ™„μ „ μ˜€ν”„λΌμΈ
- **XR/MR Mode (Cloud Multimodal AI)** β€” Unity 기반 MR + Cloud API
---
## πŸ“± A. Mobile Mode (On-device AI)
```plaintext
Mobile App (Android)
β”‚
β”œβ”€β”€ Camera Input
β”‚
β”œβ”€β”€ MobileCLIP (TFLite)
β”‚ └─ on-device embedding (512-d)
β”‚
β”œβ”€β”€ Embedding Index (.bin)
β”‚ └─ cosine similarity search (offline)
β”‚
β”œβ”€β”€ Local Metadata RAG
β”‚ └─ artwork metadata lookup
β”‚
β”œβ”€β”€ Gemma-3N On-device LLM
β”‚ └─ adult / child / expert explanation
β”‚
└── Android TTS
```
---
## πŸ₯½ B. XR/MR Mode (Cloud Multimodal AI)
```plaintext
Unity XR App (MR View)
β”‚
β”œβ”€β”€ MR Camera (RenderTexture)
β”‚
β”œβ”€β”€ JNI Bridge β†’ Android
β”‚ └─ frame bytes 전달
β”‚
β”œβ”€β”€ Cloud Multimodal API (Gemini)
β”‚ β”œβ”€ artwork analysis
β”‚ β”œβ”€ style / meaning extraction
β”‚ └─ multimodal reasoning
β”‚
β”œβ”€β”€ Cloud PromptManager
β”‚ β”œβ”€ 어린이 λͺ¨λ“œ
β”‚ β”œβ”€ 성인 λͺ¨λ“œ
β”‚ └─ μ „λ¬Έκ°€ λͺ¨λ“œ
β”‚
└── Unity 3D UI + Android TTS
β”œβ”€ floating info panel
β”œβ”€β”€ 3D guide elements
└── audio output
```