PDF2Audio

Sleeping

App Files Files Community

PDF2Audio / README.md

mjbuehler

Update README.md

15a2b0b verified over 1 year ago

preview code

raw

history blame

1.81 kB

	---
	title: Pdf2audio
	emoji: 📚
	colorFrom: yellow
	colorTo: pink
	sdk: gradio
	sdk_version: 4.44.0
	app_file: app.py
	pinned: false
	license: apache-2.0
	---

	# PDF to Audio Converter

	This Gradio app converts PDFs into audio podcasts, lectures, summaries, and more. It uses OpenAI's GPT models for text generation and text-to-speech conversion.

	## Features

	- Upload multiple PDF files
	- Choose from different instruction templates (podcast, lecture, summary, etc.)
	- Customize text generation and audio models
	- Select different voices for speakers

	## How to Use

	1. Upload one or more PDF files
	2. Select the desired instruction template
	3. Customize the instructions if needed
	4. Click "Generate Audio" to create your audio content

	## Note

	This app requires an OpenAI API key to function.

	## Credits

	This project was inspired by and based on the code available at [https://github.com/knowsuchagency/pdf-to-podcast](https://github.com/knowsuchagency/pdf-to-podcast) and [https://github.com/knowsuchagency/promptic](https://github.com/knowsuchagency/promptic).

	```bibtex
	@article{ghafarollahi2024sciagentsautomatingscientificdiscovery,
	title={SciAgents: Automating scientific discovery through multi-agent intelligent graph reasoning},
	author={Alireza Ghafarollahi and Markus J. Buehler},
	year={2024},
	eprint={2409.05556},
	archivePrefix={arXiv},
	primaryClass={cs.AI},
	url={https://arxiv.org/abs/2409.05556},
	}
	@article{buehler2024graphreasoning,
	title={Accelerating Scientific Discovery with Generative Knowledge Extraction, Graph-Based Representation, and Multimodal Intelligent Graph Reasoning},
	author={Markus J. Buehler},
	journal={Machine Learning: Science and Technology},
	year={2024},
	url={http://iopscience.iop.org/article/10.1088/2632-2153/ad7228},
	}
	```