Spaces:
Runtime error
Runtime error
title: Pdf2audio | |
emoji: 📚 | |
colorFrom: yellow | |
colorTo: pink | |
sdk: gradio | |
sdk_version: 4.44.0 | |
app_file: app.py | |
pinned: false | |
license: apache-2.0 | |
# PDF to Audio Converter | |
This Gradio app converts PDFs into audio podcasts, lectures, summaries, and more. It uses OpenAI's GPT models for text generation and text-to-speech conversion. | |
## Features | |
- Upload multiple PDF files | |
- Choose from different instruction templates (podcast, lecture, summary, etc.) | |
- Customize text generation and audio models | |
- Select different voices for speakers | |
## How to Use | |
1. Upload one or more PDF files | |
2. Select the desired instruction template | |
3. Customize the instructions if needed | |
4. Click "Generate Audio" to create your audio content | |
## Note | |
This app requires an OpenAI API key to function. | |
## Credits | |
References and citations below. Credits to the code base in ```https://github.com/knowsuchagency/pdf-to-podcast```. | |
```bibtex | |
@article{ghafarollahi2024sciagentsautomatingscientificdiscovery, | |
title={SciAgents: Automating scientific discovery through multi-agent intelligent graph reasoning}, | |
author={Alireza Ghafarollahi and Markus J. Buehler}, | |
year={2024}, | |
eprint={2409.05556}, | |
archivePrefix={arXiv}, | |
primaryClass={cs.AI}, | |
url={https://arxiv.org/abs/2409.05556}, | |
} | |
@article{buehler2024graphreasoning, | |
author={Markus J. Buehler}, | |
title={Accelerating Scientific Discovery with Generative Knowledge Extraction, Graph-Based Representation, and Multimodal Intelligent Graph Reasoning}, | |
journal={Machine Learning: Science and Technology}, | |
year={2024}, | |
url={http://iopscience.iop.org/article/10.1088/2632-2153/ad7228}, | |
} | |
``` | |