File size: 2,633 Bytes
2a9d63a
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
---

title: "Muse:Eye β€” On-device Multimodal XR Docent"
emoji: 🎨
colorFrom: blue
colorTo: purple
sdk: other
pipeline_tag: other
tags:
  - multimodal
  - XR
  - on-device
  - tflite
  - mobileclip
  - android
  - unity
  - RAG
  - museum-tech
---


# Muse:Eye  
**On-device Multimodal AI + XR Museum Docent for Android**

Muse:EyeλŠ” **카메라 기반 μž‘ν’ˆ 인식(MobileCLIP)**,  
**μ˜¨λ””λ°”μ΄μŠ€ LLM(Gemma-3N)**,  
**XR μΈν„°λž™μ…˜(Unity MR)**  
을 κ²°ν•©ν•œ ν•˜μ΄λΈŒλ¦¬λ“œ AI λ„μŠ¨νŠΈ μ‹œμŠ€ν…œμž…λ‹ˆλ‹€.

μ˜€ν”„λΌμΈ ν™˜κ²½μ—μ„œλ„ λΉ λ₯Έ 이미지 검색,  
메타데이터 기반 RAG,  
성인/어린이/μ „λ¬Έκ°€ λͺ¨λ“œ μ„€λͺ… 제곡이 κ°€λŠ₯ν•©λ‹ˆλ‹€.

---

# ✨ Features

### 1) ⚑ On-device AI (No Internet Required)
- MobileCLIP (TFLite) 이미지 μž„λ² λ”©  
- μž‘ν’ˆ μœ μ‚¬λ„ 검색 (FAISS β†’ custom `.bin` index)  
- Gemma-3N μ˜¨λ””λ°”μ΄μŠ€ LLM  

### 2) πŸ–Ό Artwork Recognition
- 카메라 인식 β†’ μ¦‰μ‹œ embedding μΆ”μΆœ  
- RAG 기반 μž‘ν’ˆΒ·μž‘κ°€ μ„€λͺ… 생성  

### 3) πŸ‘ΆπŸ§‘β€πŸ¦³ Three Docent Modes
- 어린이 λͺ¨λ“œ  
- 성인 λͺ¨λ“œ  
- μ „λ¬Έκ°€ λͺ¨λ“œ  

### 4) πŸ₯½ XR Integrated
- Unity + Android Studio 톡합  
- MR λͺ¨λ“œμ—μ„œ 3D UI 및 μŒμ„± μ•ˆλ‚΄ 제곡  

---
# πŸ— System Architecture

Muse:EyeλŠ” 두 κ°€μ§€ μ‹€ν–‰ λͺ¨λ“œλ₯Ό μ§€μ›ν•©λ‹ˆλ‹€:

- **Mobile Mode (On-device AI)** β€” μ™„μ „ μ˜€ν”„λΌμΈ  
- **XR/MR Mode (Cloud Multimodal AI)** β€” Unity 기반 MR + Cloud API  

---

## πŸ“± A. Mobile Mode (On-device AI)

```plaintext

Mobile App (Android)

β”‚

β”œβ”€β”€ Camera Input

β”‚

β”œβ”€β”€ MobileCLIP (TFLite)

β”‚     └─ on-device embedding (512-d)

β”‚

β”œβ”€β”€ Embedding Index (.bin)

β”‚     └─ cosine similarity search (offline)

β”‚

β”œβ”€β”€ Local Metadata RAG

β”‚     └─ artwork metadata lookup

β”‚

β”œβ”€β”€ Gemma-3N On-device LLM

β”‚     └─ adult / child / expert explanation

β”‚

└── Android TTS

```

---

## πŸ₯½ B. XR/MR Mode (Cloud Multimodal AI)

```plaintext

Unity XR App (MR View)

β”‚

β”œβ”€β”€ MR Camera (RenderTexture)

β”‚

β”œβ”€β”€ JNI Bridge β†’ Android

β”‚     └─ frame bytes 전달

β”‚

β”œβ”€β”€ Cloud Multimodal API (Gemini)

β”‚     β”œβ”€ artwork analysis

β”‚     β”œβ”€ style / meaning extraction

β”‚     └─ multimodal reasoning

β”‚

β”œβ”€β”€ Cloud PromptManager

β”‚     β”œβ”€ 어린이 λͺ¨λ“œ

β”‚     β”œβ”€ 성인 λͺ¨λ“œ

β”‚     └─ μ „λ¬Έκ°€ λͺ¨λ“œ

β”‚

└── Unity 3D UI + Android TTS

      β”œβ”€ floating info panel

      β”œβ”€β”€ 3D guide elements

      └── audio output

```