Spaces:

CognizantAI
/

marketing-image-generator

Running

App Files Files Community

Noo88ear commited on 26 days ago

Commit

ae9c474

verified ·

1 Parent(s): 265fa55

Update README.md

Browse files

Files changed (1) hide show

README.md +65 -23

README.md CHANGED Viewed

@@ -4,19 +4,21 @@ emoji: 🎨
 colorFrom: blue
 colorTo: purple
 sdk: gradio
-sdk_version: 5.39.0
 app_file: app.py
 pinned: false
 ---
 # Marketing Image Generator with Agent Review
-A sophisticated AI-powered image generation system that creates high-quality marketing images with automated quality review and refinement. Built on modern AI technologies including Google's Imagen3 and advanced agent orchestration.
 ## Features
 - **AI-Powered Image Generation**: Create stunning marketing images from text prompts using Google's Imagen4 via MCP server
-- **Automated Quality Review**: Intelligent Gemini agent (2.5-Pro) automatically reviews and refines generated images
 - **Marketing-Focused**: Optimized for marketing materials, social media, and promotional content
 - **Real-time Feedback**: Get instant quality scores and improvement suggestions
 - **Professional Workflow**: Streamlined process from concept to final image
@@ -56,11 +58,11 @@ A sophisticated AI-powered image generation system that creates high-quality mar
 ### Core Components
-- **Agent 1 (Image Generator)**: Creates images using Google's Imagen3 via MCP server integration
 - **Agent 2 (Marketing Reviewer)**: Analyzes image quality and provides marketing-focused feedback using Gemini Vision
 - **Orchestrator**: Manages workflow between agents and handles handover
 - **Web Interface**: Gradio-based user interface optimized for Hugging Face
-- **MCP Server Integration**: Model Context Protocol for seamless Imagen3 access
 ### System Architecture and Workflow
@@ -73,18 +75,18 @@ A sophisticated AI-powered image generation system that creates high-quality mar
 │Reviewer     │───▶│             │───▶│  Agent 2 (Gemini) Marketing │
 │Prompt       │    │             │    │  Reviewer                   │
 │             │    │             │    │                             │
-│             │    │             │    │  ┌─────────────────────────┐│
-│             │    │             │    │  │ Ag1: Imagen4 (via MCP)  ││
-│             │    │             │    │  │                         ││
-│             │    │             │    │  │  Draft Image Creation   ││
-│             │    │             │    │  └─────────────────────────┘│
 │             │    │             │    │                             │
-│             │    │             │    │  ┌─────────────────────────┐│
-│             │    │             │    │  │Ag2;Draft Image Reviewed ││
-│             │    │             │    │  │  & Changes Suggested    ││
-│             │    │             │    │  └─────────────────────────┘│
 │             │    │             │    │                             │
-│ Image       │◀───│             │◀───│  Final Image Response       │
 │ Response    │    │             │    │                             │
 └─────────────┘    └─────────────┘    └─────────────────────────────┘
 ```
@@ -104,12 +106,12 @@ A sophisticated AI-powered image generation system that creates high-quality mar
 3. **Image Generation and Drafting (Top Right)**:
    - **Agent 1 (Gemini) Drafter**: Receives Image Prompt, orchestrates image generation
-   - **Imagen3 (via MCP)**: Agent 1 interacts with Imagen4 through MCP server to create initial image draft
 4. **Marketing Review and Refinement (Bottom Right)**:
    - **Agent 2 (Gemini) Marketing Reviewer**: Receives Reviewer Prompt, evaluates generated image against marketing criteria
    - **Draft Image Reviewed and Changes Suggested**: Agent 2's review process output
-   - **Iterative Refinement Loop**: Bidirectional feedback between Agent 2 and Imagen3 (via Agent 1) to refine image until it meets marketing standards
    - Final **Image Response** sent back to Gradio UI
 ### Summary of Flow:
@@ -117,12 +119,12 @@ User provides prompts → Gradio UI → Agent 1 drafts image with Imagen4 → Ag
 ### Technology Stack
-- **AI Models**: Google Imagen4 (via MCP), Gemini Vision
 - **Framework**: Gradio (Web Interface)
 - **Orchestration**: Custom agent handover system
 - **Deployment**: Hugging Face Spaces
 - **Authentication**: Google Cloud API Keys
-- **Protocol**: MCP (Model Context Protocol) for Imagen3 integration
 ### Why A2A Was Not Applied
@@ -179,7 +181,7 @@ quality_score = result["data"]["review"]["quality_score"]
 - **Quality Threshold**: Minimum quality score for auto-approval
 - **Max Iterations**: Maximum refinement attempts
 - **Review Settings**: Customize review criteria
-- **MCP Configuration**: Imagen3 server settings
 ## Development
@@ -268,12 +270,52 @@ Access monitoring dashboards:
 1. **API Key Errors**: Ensure your Google API keys are valid and configured as HF secrets
 2. **Image Generation Fails**: Check your internet connection and API quotas
 3. **Review Not Working**: Verify the Gemini agent is running and configured correctly
-4. **MCP Connection Issues**: Check Imagen3 server connectivity and configuration
 ### Debug Mode
 Enable debug logging by setting `LOG_LEVEL=DEBUG` in your environment variables.
 ### Support
 For issues and questions:
@@ -287,7 +329,7 @@ This project is licensed under the MIT License - see the LICENSE file for detail
 ## Acknowledgments
-- Google AI for Imagen4 and Gemini technologies
 - Hugging Face for the deployment platform
 - Gradio for the web interface framework
-- The open-source community for various dependencies

 colorFrom: blue
 colorTo: purple
 sdk: gradio
+sdk_version: 5.38.2
 app_file: app.py
 pinned: false
+license: mit
+short_description: AI marketing image generator with GCP Imagen4 + Gemini 2.5
 ---
 # Marketing Image Generator with Agent Review
+A sophisticated AI-powered image generation system that creates high-quality marketing images with automated quality review and refinement. Built on modern AI technologies including Google's Imagen4 and Gemini 2.5 Pro with advanced agent orchestration.
 ## Features
 - **AI-Powered Image Generation**: Create stunning marketing images from text prompts using Google's Imagen4 via MCP server
+- **Automated Quality Review**: Intelligent Gemini agent automatically reviews and refines generated images
 - **Marketing-Focused**: Optimized for marketing materials, social media, and promotional content
 - **Real-time Feedback**: Get instant quality scores and improvement suggestions
 - **Professional Workflow**: Streamlined process from concept to final image
 ### Core Components
+- **Agent 1 (Image Generator)**: Creates images using Google's Imagen4 via MCP server integration
 - **Agent 2 (Marketing Reviewer)**: Analyzes image quality and provides marketing-focused feedback using Gemini Vision
 - **Orchestrator**: Manages workflow between agents and handles handover
 - **Web Interface**: Gradio-based user interface optimized for Hugging Face
+- **MCP Server Integration**: Model Context Protocol for seamless Imagen4 access
 ### System Architecture and Workflow
 │Reviewer     │───▶│             │───▶│  Agent 2 (Gemini) Marketing │
 │Prompt       │    │             │    │  Reviewer                   │
 │             │    │             │    │                             │
+│             │    │             │    │  ┌─────────────────────────┐ │
+│             │    │             │    │  │   Imagen4 (via MCP)     │ │
+│             │    │             │    │  │                         │ │
+│             │    │             │    │  │  Draft Image Creation   │ │
+│             │    │             │    │  └─────────────────────────┘ │
 │             │    │             │    │                             │
+│             │    │             │    │  ┌─────────────────────────┐ │
+│             │    │             │    │  │  Draft Image Reviewed   │ │
+│             │    │             │    │  │  & Changes Suggested    │ │
+│             │    │             │    │  └─────────────────────────┘ │
 │             │    │             │    │                             │
+│ Image       │◀───│             │◀───│  Final Image Response      │
 │ Response    │    │             │    │                             │
 └─────────────┘    └─────────────┘    └─────────────────────────────┘
 ```
 3. **Image Generation and Drafting (Top Right)**:
    - **Agent 1 (Gemini) Drafter**: Receives Image Prompt, orchestrates image generation
+   - **Imagen4 (via MCP)**: Agent 1 interacts with Imagen4 through MCP server to create initial image draft
 4. **Marketing Review and Refinement (Bottom Right)**:
    - **Agent 2 (Gemini) Marketing Reviewer**: Receives Reviewer Prompt, evaluates generated image against marketing criteria
    - **Draft Image Reviewed and Changes Suggested**: Agent 2's review process output
+   - **Iterative Refinement Loop**: Bidirectional feedback between Agent 2 and Imagen4 (via Agent 1) to refine image until it meets marketing standards
    - Final **Image Response** sent back to Gradio UI
 ### Summary of Flow:
 ### Technology Stack
+- **AI Models**: Google Imagen4 (via MCP), Gemini 2.5 Pro Vision
 - **Framework**: Gradio (Web Interface)
 - **Orchestration**: Custom agent handover system
 - **Deployment**: Hugging Face Spaces
 - **Authentication**: Google Cloud API Keys
+- **Protocol**: MCP (Model Context Protocol) for Imagen4 integration
 ### Why A2A Was Not Applied
 - **Quality Threshold**: Minimum quality score for auto-approval
 - **Max Iterations**: Maximum refinement attempts
 - **Review Settings**: Customize review criteria
+- **MCP Configuration**: Imagen4 server settings
 ## Development
 1. **API Key Errors**: Ensure your Google API keys are valid and configured as HF secrets
 2. **Image Generation Fails**: Check your internet connection and API quotas
 3. **Review Not Working**: Verify the Gemini agent is running and configured correctly
+4. **MCP Connection Issues**: Check Imagen4 server connectivity and configuration
+### Content Policy & Brand Restrictions
+Google's AI models have built-in safety guardrails that may cause timeouts or rejections for certain content types:
+#### 🚫 **Highly Restricted Content** (Likely to cause stalls/timeouts):
+- **Political Figures**: Named world leaders, politicians (e.g., "Putin", "Zelensky", "Biden")
+- **Political Buildings**: Government buildings like "10 Downing Street", "White House"
+- **Geopolitical Content**: War, conflict, or sensitive international relations
+- **Financial Institution Brands**: Major banks like "HSBC", "Bank of America", "JPMorgan"
+#### ⚠️ **Moderately Restricted Content** (May cause delays):
+- **Regulated Industries**: Healthcare, pharmaceutical, financial services
+- **Some Corporate Brands**: Varies by sector and brand sensitivity
+#### ✅ **Generally Permitted Content**:
+- **Technology Brands**: "Cognizant", "Microsoft", "IBM", "Accenture"
+- **Generic Business**: "Professional office", "corporate environment"
+- **Non-branded Content**: Generic descriptions without specific brand names
+#### 🔧 **Workarounds for Restricted Content**:
+**Instead of**: `"Professional boardroom with HSBC signage"`
+**Use**: `"Professional boardroom with international banking corporation signage in red and white colors"`
+**Instead of**: `"Meeting with political leaders"`
+**Use**: `"Meeting with business executives in government-style building"`
+**Strategy**: Move brand-specific requirements to **Review Guidelines** instead of the main prompt:
+- **Main Prompt**: `"Professional corporate environment"`
+- **Review Guidelines**: `"Ensure branding reflects HSBC corporate colors (red and white)"`
+This approach bypasses content filters while still providing guidance for review.
 ### Debug Mode
 Enable debug logging by setting `LOG_LEVEL=DEBUG` in your environment variables.
+### Content Policy Testing
+Use the included diagnostic scripts to test content restrictions:
+- `debug_hsbc_prompt.py` - Test financial brand restrictions
+- `test_cognizant_brand.py` - Test tech brand accessibility
+- `test_brand_workaround.py` - Test workaround strategies
 ### Support
 For issues and questions:
 ## Acknowledgments
+- Google AI for Imagen4 and Gemini 2.5 Pro technologies
 - Hugging Face for the deployment platform
 - Gradio for the web interface framework
+- The open-source community for various dependencies