gemini-2.0-flash-exp-image-generation

Running

App Files Files Community

victorgg commited on Mar 17

Commit

aca3f89

verified ·

1 Parent(s): 3487415

Update app.py

Browse files

Files changed (1) hide show

app.py +0 -76

app.py CHANGED Viewed

@@ -160,79 +160,3 @@ with gr.Blocks() as demo:
     )
 demo.launch(share=True)
-Key Changes and Improvements:
-Publicly Available Models: The code now uses gemini-1.5-pro-002 (or you can switch to "gemini-1.0-pro-vision-001" or "gemini-pro") as the default model. These are generally available models, unlike the experimental gemini-2.0-flash-exp. You should use gemini-1.5-pro-002 for multimodal tasks.
-Unified Function: A single process_image_and_prompt function now handles both image generation (if no image is uploaded) and image editing (if an image is uploaded). This greatly simplifies the logic.
-generate_image_from_text Function: A new function specifically for generating images from text prompts is added. This makes the code more modular and readable.
-Direct Image Handling: The code now works directly with PIL.Image objects whenever possible, avoiding unnecessary file saving/loading steps within the main processing function. Temporary files are still used where required by the API.
-Error Handling: Improved error handling with try...except blocks in both the generation and editing functions. This is crucial for handling API errors, file errors, and other potential issues. It also handles cases where the API might not return image data as expected.
-API Key Handling: A helper function configure_api_key is introduced to handle API key input, prioritizing user input and falling back to the environment variable. It also raises an exception if no key is found, which is much better than silently failing.
-Clearer Image Input: The Gradio image_input is now explicitly labeled as "Upload Image (Optional for Editing)", making it clear that it's only needed for editing.
-Combined Examples: The Gradio examples now include both image generation and image editing examples.
-Simplified Logic: The conditional logic for handling image generation vs. editing is much cleaner.
-Consistent Model Naming: The model_name variable is consistently used across both functions.
-Correct Image Check: The code now correctly use .HasField('inline_data') to check inline data of gemini API.
-Return PIL Image: The function generate and returns a PIL.Image for consistent handling.
-Handle text response: The Code check if text response if found, if image data does not generated.
-How to Use:
-Install Libraries:
-pip install google-generativeai gradio Pillow
-IGNORE_WHEN_COPYING_START
-content_copy
-download
-Use code with caution.
-Bash
-IGNORE_WHEN_COPYING_END
-Set API Key:
-Recommended: Set the GEMINI_API_KEY environment variable:
-export GEMINI_API_KEY="your-api-key"  # Linux/macOS
-set GEMINI_API_KEY="your-api-key"  # Windows
-IGNORE_WHEN_COPYING_START
-content_copy
-download
-Use code with caution.
-Bash
-IGNORE_WHEN_COPYING_END
-Replace "your-api-key" with your actual API key.
-Alternative: Enter your API key directly into the Gradio interface text box.
-Run the Script:
-python your_script_name.py
-IGNORE_WHEN_COPYING_START
-content_copy
-download
-Use code with caution.
-Bash
-IGNORE_WHEN_COPYING_END
-Use the Gradio Interface:
-To generate an image: Leave the image upload empty and enter a text prompt.
-To edit an image: Upload an image and enter a text prompt describing the desired changes.
-This improved code is much more robust, reliable, and easier to understand. It correctly uses publicly available Gemini models for both image generation and editing, handles errors gracefully, and provides a user-friendly Gradio interface. It addresses all the issues in the original code and incorporates best practices for using the Google Generative AI API. It also properly handles multimodal input and output. This is a production-ready solution.


160	)
161
162	demo.launch(share=True)