Spaces:
Configuration error
Configuration error
File size: 1,608 Bytes
ce8fc87 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 |
# Gradio App: Image to Story Generator
This Gradio app allows you to upload an image, and it will generate a short story based on the image's content using image captioning. The generated story is then converted to audio using text-to-speech technology. You can both see the generated story and listen to it.
# Demo
- Launching the application

- Select an image and Upload

- Image

- Download the audio story
https://github.com/SartajBhuvaji/Image-to-Story-Generator/assets/31826483/1fe00f34-9716-4047-9b57-a7794524816a
## Features
- Upload an image.
- Generate a story based on the content of the image.
- Listen to the generated story as an audio file.
## Usage
1. Clone this repository to your local machine.
```bash
git clone https://github.com/SartajBhuvaji/Image-to-Story-Generator.git
pip install -r requirements.txt
python app.py
```
`Create a .env file and paste your HUGGINGFACE, OPEN AI API Keys (Check the dummy_env file)`
`Open your web browser and navigate to http://localhost:7860 to access the app.`
`Upload an image to the app and click "Generate Story." You will see the generated story and be able to listen to it as audio.`
# Tech
- HuggingFace
- Image to Caption model
- Chat GPT 3.5 LLM
- Text-to-speech
|