Spaces:
Sleeping
Sleeping
| title: Pdf2audio | |
| emoji: 📚 | |
| colorFrom: yellow | |
| colorTo: pink | |
| sdk: gradio | |
| sdk_version: 4.44.0 | |
| app_file: app.py | |
| pinned: false | |
| license: apache-2.0 | |
| # PDF to Audio Converter | |
| This Gradio app converts PDFs into audio podcasts, lectures, summaries, and more. It uses OpenAI's GPT models for text generation and text-to-speech conversion. | |
| ## Features | |
| - Upload multiple PDF files | |
| - Choose from different instruction templates (podcast, lecture, summary, etc.) | |
| - Customize text generation and audio models | |
| - Select different voices for speakers | |
| ## How to Use | |
| 1. Upload one or more PDF files | |
| 2. Select the desired instruction template | |
| 3. Customize the instructions if needed | |
| 4. Click "Generate Audio" to create your audio content | |
| ## Note | |
| This app requires an OpenAI API key to function. | |
| ## Credits | |
| This project was inspired by and based on the code available at [https://github.com/knowsuchagency/pdf-to-podcast](https://github.com/knowsuchagency/pdf-to-podcast). | |
| ```bibtex | |
| @article{ghafarollahi2024sciagentsautomatingscientificdiscovery, | |
| title={SciAgents: Automating scientific discovery through multi-agent intelligent graph reasoning}, | |
| author={Alireza Ghafarollahi and Markus J. Buehler}, | |
| year={2024}, | |
| eprint={2409.05556}, | |
| archivePrefix={arXiv}, | |
| primaryClass={cs.AI}, | |
| url={https://arxiv.org/abs/2409.05556}, | |
| } | |
| @article{buehler2024graphreasoning, | |
| author={Markus J. Buehler}, | |
| title={Accelerating Scientific Discovery with Generative Knowledge Extraction, Graph-Based Representation, and Multimodal Intelligent Graph Reasoning}, | |
| journal={Machine Learning: Science and Technology}, | |
| year={2024}, | |
| url={http://iopscience.iop.org/article/10.1088/2632-2153/ad7228}, | |
| } | |
| ``` | |