File size: 1,041 Bytes
6152b4e
 
0ed01b4
6152b4e
0ed01b4
6152b4e
0ed01b4
6152b4e
0ed01b4
 
 
 
 
 
 
 
 
 
6152b4e
 
0ed01b4
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
---
title: ShutterMuse Video Demo
emoji: 📷
colorFrom: blue
colorTo: purple
sdk: static
app_file: index.html
pinned: false
models:
- ShutterMuse/ShutterMuse
datasets:
- ShutterMuse/CaptureGuide-Bench
tags:
- arxiv:2606.25763
- photography
- multimodal
- vision-language
- video-demo
---

# ShutterMuse Video Demo

This Space hosts the video demo for [ShutterMuse: Capture-Time Photography Guidance with MLLMs](https://huggingface.co/papers/2606.25763).

- **Paper:** https://arxiv.org/abs/2606.25763
- **Paper Page:** https://huggingface.co/papers/2606.25763
- **Project Page:** https://lijayuTnT.github.io/ShutterMuse/
- **GitHub Repository:** https://github.com/lijayuTnT/ShutterMuse
- **ShutterMuse Model:** https://huggingface.co/ShutterMuse/ShutterMuse
- **CaptureGuide-Bench:** https://huggingface.co/datasets/ShutterMuse/CaptureGuide-Bench

ShutterMuse is a unified multimodal large language model for capture-time photography guidance. It supports photographer-side composition recommendation and subject-side pose recommendation.