Spaces:
Runtime error
Runtime error
| itle: Multimodal | |
| emoji: ๐ | |
| colorFrom: yellow | |
| colorTo: indigo | |
| sdk: docker | |
| pinned: false | |
| license: mit | |
| # Video in, audio out | |
| This is a [Shiny for Python](https://shiny.posit.co/py/) app for easily interacting with GPT-4o via short webcam recordings. | |
| ## Installation | |
| ### ffmpeg | |
| You will need the `ffmpeg` utility installed. Either use the [official installers](https://ffmpeg.org/download.html), or `brew install ffmpeg` (for macOS brew users) or `choco install ffmpeg` (for Windows chocolatey users). | |
| ### OpenAI API key | |
| Create a file called `.env` in the root of the project and add the following line: | |
| ``` | |
| OPENAI_API_KEY=<your-api-key> | |
| ``` | |
| If you have an OpenAI account, you can generate an API key from [this page](https://platform.openai.com/api-keys). | |
| ### Python dependencies | |
| ``` | |
| pip install -r requirements.txt | |
| ``` | |
| ## Usage | |
| ``` | |
| shiny run app.py --port 0 --launch-browser | |
| ``` | |
| This will launch a browser window with a video preview. Press Record, speak your prompt, and press Stop. The video will be processed and the response will be read aloud. | |