Spaces:

Agents-MCP-Hackathon
/

ImageAlfred

Running

App Files Files Community

pr3

by Soodoo - opened 17 days ago

base: refs/heads/main

←

from: refs/pr/3

Discussion Files changed

+239

-1039

This PR is in draft mode

Files changed (17) hide show

.gitattributes +0 -1
Makefile +1 -6
README.md +40 -40
claude_desktop_config.json +0 -22
hf.sh +1 -15
pyproject.toml +1 -3
requirements.txt +1 -1
src/app.py +21 -70
src/assets/examples/test_6.jpg +0 -3
src/assets/examples/test_7.jpg +0 -3
src/assets/examples/test_8.jpg +0 -3
src/assets/icons/hf-logo.svg +0 -8
src/assets/icons/python-logo-only.svg +0 -265
src/assets/vid/demo.mp4 +0 -3
src/modal_app.py +111 -401
src/tools.py +61 -75
uv.lock +2 -120

.gitattributes CHANGED Viewed

@@ -35,4 +35,3 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
 *.jpg filter=lfs diff=lfs merge=lfs -text
 *.png filter=lfs diff=lfs merge=lfs -text
-*.mp4 filter=lfs diff=lfs merge=lfs -text

 *tfevents* filter=lfs diff=lfs merge=lfs -text
 *.jpg filter=lfs diff=lfs merge=lfs -text
 *.png filter=lfs diff=lfs merge=lfs -text

Makefile CHANGED Viewed

@@ -21,9 +21,4 @@ dev:
 hf:
 	chmod 777 hf.sh
-	./hf.sh
-requirements:
-	uv pip compile --no-annotate pyproject.toml --no-deps --no-strip-extras --no-header \
-	| sed -E 's/([a-zA-Z0-9_-]+(\[[a-zA-Z0-9_,-]+\])?)[=><~!].*/\1/g' \
-	> requirements.txt

 hf:
 	chmod 777 hf.sh
+	./hf.sh

README.md CHANGED Viewed

@@ -1,79 +1,79 @@
 ---
 title: ImageAlfred
 emoji: 😻
-tags:
-  - mcp-server-track
 colorFrom: green
 colorTo: purple
 sdk: gradio
-sdk_version: 5.33.0
 app_file: src/app.py
 pinned: false
 license: apache-2.0
 short_description: 'Alfred of Images: An MCP server to handle your image edits.'
 ---
-<div align="center">
-<a href="https://github.com/mahan-ym/ImageAlfred">
-<img src="./src/assets/icons/ImageAlfredIcon.png" alt="ImageAlfred" width=200 height=200>
-<span><img src="https://img.shields.io/badge/GitHub-100000?style=for-the-badge&logo=github&logoColor=white"></span>
-</a>
-<h1>Image Alfred</h1>
-ImageAlfred is an image Model Context Protocol (MCP) tool designed to streamline image processing workflows
-<img alt="Python Version from PEP 621 TOML" src="https://img.shields.io/python/required-version-toml?tomlFilePath=https%3A%2F%2Fraw.githubusercontent.com%2Fmahan-ym%2FImageAlfred%2Fmain%2Fpyproject.toml">
-<img src="https://badge.mcpx.dev?type=server" title="MCP Server"/>
-<img alt="GitHub License" src="https://img.shields.io/github/license/mahan-ym/ImageAlfred">
-<a href=https://huggingface.co> <img src="src/assets/icons/hf-logo.svg" alt="huggingface" height=40> </a>
-<a href="https://www.python.org"><img src="src/assets/icons/python-logo-only.svg" alt="python" height=40></a>
-</div>
-## Demo
-[🎬 Video demo](https://youtu.be/tEov-Bcuulk)
 ## Maintainers
-[Mahan-ym | Mahan Yarmohammad](https://www.mahan-ym.com/)
-[Soodoo | Saaed Saadatipour](https://soodoo.me/)
-## Tools
-- [Gradio](https://www.gradio.app/): Serving user interface and MCP server.
-- [Modal.com](https://modal.com/): AI infrastructure making all the magic 🔮 possible.
-- [SAM](https://segment-anything.com/): Segment Anything model by meta for image segmentation and mask generation.
-- [CLIPSeg](https://github.com/timojl/clipseg): Image Segmentation using CLIP. We used it as a more precise object detection model.
-- [OWLv2](https://huggingface.co/google/owlv2-large-patch14-ensemble): Zero-Shot object detection (Better performance in license plate detection and privacy preserving use-cases)
 - [HuggingFace](https://huggingface.co/): Downloading SAM and using Space for hosting.
 ## Getting Started
 ### Prerequisites
-- Python 3.12+
 - [uv](https://github.com/astral-sh/uv) (a fast Python package installer and virtual environment manager)
 ### Installation
-It will create virtual environment, activate it, install dependecies and setup modal
 ```bash
-make install
 ```
-### Running the App
-This will launch the Gradio interface for ImageAlfred.
 ```bash
-make run
 ```
 ## License
 This project is licensed under the terms of the LICENSE file in this repository.

 ---
 title: ImageAlfred
 emoji: 😻
 colorFrom: green
 colorTo: purple
 sdk: gradio
+sdk_version: 5.32.1
 app_file: src/app.py
 pinned: false
 license: apache-2.0
 short_description: 'Alfred of Images: An MCP server to handle your image edits.'
 ---
+![Image Alfred](./src/assets/ImageAlfredIcon.png)
+# ImageAlfred
+ImageAlfred is an image Model Context Protocol (MCP) tool designed to streamline image processing workflows.
+<!-- It provides a user-friendly interface for interacting with image models, leveraging the power of Gradio for the frontend and Modal for scalable backend deployment. -->
+<!-- ## Features
+- Intuitive web interface for image processing
+- Powered by Gradio for rapid prototyping and UI
+- Scalable and serverless execution with Modal
+- Easily extendable for custom image models and workflows -->
 ## Maintainers
+[Mahan Yarmohammad (Mahan-ym)](https://www.mahan-ym.com/)
+[Saaed Saadatipour (Soodoo)](https://soodoo.me/)
+# Used Tools
+- [Gradio](https://www.gradio.app/): Serving user interface and MCP server
+- [lang-segment-anything](https://github.com/luca-medeiros/lang-segment-anything): Which uses [SAM](https://segment-anything.com/) and [Grounding Dino](https://github.com/IDEA-Research/GroundingDINO) under the hood to segment images.
 - [HuggingFace](https://huggingface.co/): Downloading SAM and using Space for hosting.
+- [Modal.com](https://modal.com/): AI infrastructure making all the magic possible.
 ## Getting Started
 ### Prerequisites
+- Python 3.13+
 - [uv](https://github.com/astral-sh/uv) (a fast Python package installer and virtual environment manager)
 ### Installation
+1. **Create a virtual environment using uv:**
 ```bash
+uv venv
 ```
+2. **Activate the virtual environment:**
+```bash
+source .venv/bin/activate
+```
+3. **Install dependencies:**
+```bash
+uv sync
+```
+4. **Setup Modal**
+```bash
+modal setup
+```
+### Running the App
 ```bash
+uv run src/app.py
 ```
+This will launch the Gradio interface for ImageAlfred.
 ## License
 This project is licensed under the terms of the LICENSE file in this repository.

claude_desktop_config.json DELETED Viewed

@@ -1,22 +0,0 @@
-{
-    "mcpServers": {
-        "Image Alfred": {
-            "command": "npx",
-            "args": [
-                "mcp-remote",
-                "https://agents-mcp-hackathon-imagealfred.hf.space/gradio_api/mcp/sse",
-                "--transport",
-                "sse-only"
-            ]
-        },
-        "local Image Alfred": {
-            "command": "npx",
-            "args": [
-                "mcp-remote",
-                "http://127.0.0.1:7860/gradio_api/mcp/sse",
-                "--transport",
-                "sse-only"
-            ]
-        }
-    }
-}

hf.sh CHANGED Viewed

@@ -5,6 +5,7 @@ REPO_URL="https://github.com/mahan-ym/ImageAlfred"
 REPO_DIR="ImageAlfred"
 TEMP_DIR="./tmp"
 SRC_DIR="src"
 echo "🚀 Starting Huggingface Space update script..."
@@ -31,21 +32,6 @@ if [ -d "$SRC_DIR" ]; then
 fi
 cp -r "$TEMP_DIR/$REPO_DIR/$SRC_DIR" .
 mv "$TEMP_DIR/$REPO_DIR/Makefile" .
-mv "$TEMP_DIR/$REPO_DIR/requirements.txt" .
-mv "$TEMP_DIR/$REPO_DIR/pyproject.toml" .
-mv "$TEMP_DIR/$REPO_DIR/uv.lock" .
-mv "$TEMP_DIR/$REPO_DIR/claude_desktop_config.json" .
-mv "$TEMP_DIR/$REPO_DIR/LICENSE" .
-# Concatenate README files
-echo "📄 Creating combined README file..."
-if [ -f "$TEMP_DIR/$REPO_DIR/hf_readme.md" ] && [ -f "$TEMP_DIR/$REPO_DIR/README.md" ]; then
-    cat "$TEMP_DIR/$REPO_DIR/hf_readme.md" "$TEMP_DIR/$REPO_DIR/README.md" > README.md
-    echo "✅ Combined README created successfully!"
-else
-    echo "⚠️ Could not find one or both README files for concatenation."
-fi
 # Check if copy was successful
 if [ $? -eq 0 ]; then

 REPO_DIR="ImageAlfred"
 TEMP_DIR="./tmp"
 SRC_DIR="src"
+REQUIREMENTS_FILE="requirements.txt"
 echo "🚀 Starting Huggingface Space update script..."
 fi
 cp -r "$TEMP_DIR/$REPO_DIR/$SRC_DIR" .
 mv "$TEMP_DIR/$REPO_DIR/Makefile" .
 # Check if copy was successful
 if [ $? -eq 0 ]; then

pyproject.toml CHANGED Viewed

@@ -9,6 +9,7 @@ requires-python = ">=3.12"
 dependencies = [
     "gradio[mcp]>=5.32.1",
     "modal>=1.0.2",
     "numpy>=2.2.6",
     "pillow>=11.2.1",
@@ -17,11 +18,8 @@ dependencies = [
 [dependency-groups]
 dev = [
     "jupyterlab>=4.4.3",
-    "matplotlib>=3.10.3",
     "opencv-contrib-python>=4.11.0.86",
-    "rapidfuzz>=3.13.0",
     "ruff>=0.11.12",
-    "supervision>=0.25.1",
 ]
 [tool.ruff]

 dependencies = [
     "gradio[mcp]>=5.32.1",
+    "matplotlib>=3.10.3",
     "modal>=1.0.2",
     "numpy>=2.2.6",
     "pillow>=11.2.1",
 [dependency-groups]
 dev = [
     "jupyterlab>=4.4.3",
     "opencv-contrib-python>=4.11.0.86",
     "ruff>=0.11.12",
 ]
 [tool.ruff]

requirements.txt CHANGED Viewed

@@ -1,4 +1,4 @@
 gradio[mcp]
 modal
 numpy
-pillow

 gradio[mcp]
 modal
 numpy
+pillow

src/app.py CHANGED Viewed

@@ -6,7 +6,6 @@ from tools import (
     change_color_objects_hsv,
     change_color_objects_lab,
     privacy_preserve_image,
-    remove_background,
 )
 gr.set_static_paths(paths=[Path.cwd().absolute() / "assets"])
@@ -19,19 +18,20 @@ title = """Image Alfred - Recolor and Privacy Preserving Image MCP Tools
 """  # noqa: E501
 hsv_df_input = gr.Dataframe(
-    headers=["Object", "Red", "Green", "Blue"],
-    datatype=["str", "number", "number", "number"],
-    col_count=(4, "fixed"),
     show_row_numbers=True,
-    label="Target Objects and Their new RGB Colors",
     type="array",
 )
 lab_df_input = gr.Dataframe(
     headers=["Object", "New A", "New B"],
     datatype=["str", "number", "number"],
-    col_count=(3, "fixed"),
-    label="Target Objects and New Settings.(0-255 -- 128 = Neutral)",
     type="array",
 )
@@ -45,20 +45,19 @@ change_color_objects_hsv_tool = gr.Interface(
     title="Image Recolor Tool (HSV)",
     description="""
     This tool allows you to recolor objects in an image using the HSV color space.
-    You can specify the RGB values for each object.""",  # noqa: E501
     examples=[
         [
             "https://raw.githubusercontent.com/mahan-ym/ImageAlfred/main/src/assets/examples/test_1.jpg",
-            [
-                ["pants", 255, 178, 102],
-            ],
         ],
         [
-            "https://raw.githubusercontent.com/mahan-ym/ImageAlfred/main/src/assets/examples/test_8.jpg",
-            [
-                ["pants", 114, 117, 34],
-                ["shirt", 51, 51, 37],
-            ],
         ],
     ],
 )
@@ -78,15 +77,15 @@ change_color_objects_lab_tool = gr.Interface(
     examples=[
         [
             "https://raw.githubusercontent.com/mahan-ym/ImageAlfred/main/src/assets/examples/test_1.jpg",
-            [["pants", 112, 128]],
         ],
         [
             "https://raw.githubusercontent.com/mahan-ym/ImageAlfred/main/src/assets/examples/test_4.jpg",
-            [["desk", 166, 193]],
         ],
         [
             "https://raw.githubusercontent.com/mahan-ym/ImageAlfred/main/src/assets/examples/test_5.jpg",
-            [["suits coat", 110, 133]],
         ],
     ],
 )
@@ -107,14 +106,6 @@ privacy_preserve_tool = gr.Interface(
             step=1,
             info="Higher values result in stronger blurring.",
         ),
-        gr.Slider(
-            label="Detection Threshold",
-            minimum=0.01,
-            maximum=0.99,
-            value=0.2,
-            step=0.01,
-            info="Model threshold for detecting objects.",
-        ),
     ],
     outputs=gr.Image(label="Output Image"),
     title="Privacy Preserving Tool",
@@ -122,59 +113,19 @@ privacy_preserve_tool = gr.Interface(
     examples=[
         [
             "https://raw.githubusercontent.com/mahan-ym/ImageAlfred/main/src/assets/examples/test_3.jpg",
-            "license plate",
             10,
-            0.5,
-        ],
-        [
-            "https://raw.githubusercontent.com/mahan-ym/ImageAlfred/main/src/assets/examples/test_8.jpg",
-            "face",
-            15,
-            0.1,
-        ],
-        [
-            "https://raw.githubusercontent.com/mahan-ym/ImageAlfred/main/src/assets/examples/test_6.jpg",
-            "face",
-            20,
-            0.1,
-        ],
-    ],
-)
-remove_background_tool = gr.Interface(
-    fn=remove_background,
-    inputs=[
-        gr.Image(label="Input Image", type="pil"),
-    ],
-    outputs=gr.Image(label="Output Image"),
-    title="Remove Image Background Tool",
-    description="Upload an image to remove the background.",
-    examples=[
-        [
-            "https://raw.githubusercontent.com/mahan-ym/ImageAlfred/main/src/assets/examples/test_5.jpg",
-        ],
-        [
-            "https://raw.githubusercontent.com/mahan-ym/ImageAlfred/main/src/assets/examples/test_6.jpg",
-        ],
-        [
-            "https://raw.githubusercontent.com/mahan-ym/ImageAlfred/main/src/assets/examples/test_8.jpg",
         ],
     ],
 )
 demo = gr.TabbedInterface(
     [
-        privacy_preserve_tool,
-        remove_background_tool,
         change_color_objects_hsv_tool,
         change_color_objects_lab_tool,
     ],
-    [
-        "Privacy Preserving Tool",
-        "Remove Background Tool",
-        "Change Color Objects HSV",
-        "Change Color Objects LAB",
-    ],
     title=title,
     theme=gr.themes.Default(
         primary_hue="blue",

     change_color_objects_hsv,
     change_color_objects_lab,
     privacy_preserve_image,
 )
 gr.set_static_paths(paths=[Path.cwd().absolute() / "assets"])
 """  # noqa: E501
 hsv_df_input = gr.Dataframe(
+    headers=["Object", "Hue", "Saturation Scale"],
+    datatype=["str", "number", "number"],
+    col_count=(3, "fixed"),
     show_row_numbers=True,
+    label="Target Objects and New Settings",
     type="array",
+    # row_count=(1, "dynamic"),
 )
 lab_df_input = gr.Dataframe(
     headers=["Object", "New A", "New B"],
     datatype=["str", "number", "number"],
+    col_count=(3,"fixed"),
+    label="Target Objects and New Settings",
     type="array",
 )
     title="Image Recolor Tool (HSV)",
     description="""
     This tool allows you to recolor objects in an image using the HSV color space.
+    You can specify the hue and saturation scale for each object.""",  # noqa: E501
     examples=[
         [
             "https://raw.githubusercontent.com/mahan-ym/ImageAlfred/main/src/assets/examples/test_1.jpg",
+            [["pants", 128, 1]],
+        ],
+        [
+            "https://raw.githubusercontent.com/mahan-ym/ImageAlfred/main/src/assets/examples/test_4.jpg",
+            [["desk", 15, 0.5], ["left cup", 40, 1.1]],
         ],
         [
+            "https://raw.githubusercontent.com/mahan-ym/ImageAlfred/main/src/assets/examples/test_5.jpg",
+            [["suits", 60, 1.5], ["pants", 10, 0.8]],
         ],
     ],
 )
     examples=[
         [
             "https://raw.githubusercontent.com/mahan-ym/ImageAlfred/main/src/assets/examples/test_1.jpg",
+            [["pants", 128, 1]],
         ],
         [
             "https://raw.githubusercontent.com/mahan-ym/ImageAlfred/main/src/assets/examples/test_4.jpg",
+            [["desk", 15, 0.5], ["left cup", 40, 1.1]],
         ],
         [
             "https://raw.githubusercontent.com/mahan-ym/ImageAlfred/main/src/assets/examples/test_5.jpg",
+            [["suits", 60, 1.5], ["pants", 10, 0.8]],
         ],
     ],
 )
             step=1,
             info="Higher values result in stronger blurring.",
         ),
     ],
     outputs=gr.Image(label="Output Image"),
     title="Privacy Preserving Tool",
     examples=[
         [
             "https://raw.githubusercontent.com/mahan-ym/ImageAlfred/main/src/assets/examples/test_3.jpg",
+            "license plate.",
             10,
         ],
     ],
 )
 demo = gr.TabbedInterface(
     [
         change_color_objects_hsv_tool,
         change_color_objects_lab_tool,
+        privacy_preserve_tool,
     ],
+    ["Change Color Objects HSV", "Change Color Objects LAB", "Privacy Preserving Tool"],
     title=title,
     theme=gr.themes.Default(
         primary_hue="blue",

src/assets/examples/test_6.jpg DELETED Viewed

Git LFS Details

SHA256: c07eebe3188403b130a467f0e96ca72503f7498649d4101752d94bf4c9294635
Pointer size: 133 Bytes
Size of remote file: 10.5 MB

src/assets/examples/test_7.jpg DELETED Viewed

Git LFS Details

SHA256: 1ab95b5752d51f55bf4d774b4bd028e66897b4ca2b1c459f73014cca949d0945
Pointer size: 132 Bytes
Size of remote file: 1.47 MB

src/assets/examples/test_8.jpg DELETED Viewed

Git LFS Details

SHA256: a28b42702894d8ebc72c7f80b1ee218cbe2f0a4db9b554f70b07ef3aa823ffb4
Pointer size: 132 Bytes
Size of remote file: 1.22 MB

src/assets/icons/hf-logo.svg DELETED Viewed

src/assets/icons/python-logo-only.svg DELETED Viewed

src/assets/vid/demo.mp4 DELETED Viewed

@@ -1,3 +0,0 @@
-version https://git-lfs.github.com/spec/v1
-oid sha256:249a630becf81774a3cc86bf35858c648c7a673a4e8c2854b4a92fb02eb7c01f
-size 3678354

src/modal_app.py CHANGED Viewed

@@ -29,17 +29,13 @@ image = (
             "TORCH_HOME": TORCH_HOME,
         }
     )
-    .apt_install(
-        "git",
-    )
     .pip_install(
         "huggingface-hub",
         "hf_transfer",
         "Pillow",
         "numpy",
-        "transformers",
         "opencv-contrib-python-headless",
-        "scipy",
         gpu="A10G",
     )
     .pip_install(
@@ -48,284 +44,52 @@ image = (
         index_url="https://download.pytorch.org/whl/cu124",
         gpu="A10G",
     )
-    .pip_install("git+https://github.com/openai/CLIP.git", gpu="A10G")
-    .pip_install("git+https://github.com/facebookresearch/sam2.git", gpu="A10G")
     .pip_install(
-        "git+https://github.com/PramaLLC/BEN2.git#egg=ben2",
         gpu="A10G",
     )
 )
 @app.function(
-    image=image,
-    gpu="A10G",
-    volumes={volume_path: volume},
-    timeout=60 * 3,
-)
-def prompt_segment(
-    image_pil: Image.Image,
-    prompts: list[str],
-) -> list[dict]:
-    clip_results = clip.remote(image_pil, prompts)
-    if not clip_results:
-        print("No boxes returned from CLIP.")
-        return None
-    boxes = np.array(clip_results["boxes"])
-    sam_result_masks, sam_result_scores = sam2.remote(image_pil=image_pil, boxes=boxes)
-    print(f"sam_result_mask {sam_result_masks}")
-    if not sam_result_masks.any():
-        print("No masks or scores returned from SAM2.")
-        return None
-    if sam_result_masks.ndim == 3:
-        # If the masks are in 3D, we need to convert them to 4D
-        sam_result_masks = [sam_result_masks]
-    results = {
-        "labels": clip_results["labels"],
-        "boxes": boxes,
-        "clip_scores": clip_results["scores"],
-        "sam_masking_scores": sam_result_scores,
-        "masks": sam_result_masks,
-    }
-    return results
-@app.function(
-    image=image,
     gpu="A10G",
-    volumes={volume_path: volume},
-    timeout=60 * 3,
-)
-def privacy_prompt_segment(
-    image_pil: Image.Image,
-    prompts: list[str],
-    threshold: float,
-) -> list[dict]:
-    owlv2_results = owlv2.remote(image_pil, prompts, threshold=threshold)
-    if not owlv2_results:
-        print("No boxes returned from OWLV2.")
-        return None
-    boxes = np.array(owlv2_results["boxes"])
-    sam_result_masks, sam_result_scores = sam2.remote(image_pil=image_pil, boxes=boxes)
-    print(f"sam_result_mask {sam_result_masks}")
-    if not sam_result_masks.any():
-        print("No masks or scores returned from SAM2.")
-        return None
-    if sam_result_masks.ndim == 3:
-        # If the masks are in 3D, we need to convert them to 4D
-        sam_result_masks = [sam_result_masks]
-    results = {
-        "labels": owlv2_results["labels"],
-        "boxes": boxes,
-        "owlv2_scores": owlv2_results["scores"],
-        "sam_masking_scores": sam_result_scores,
-        "masks": sam_result_masks,
-    }
-    return results
-@app.function(
-    image=image,
-    gpu="A100",
-    volumes={volume_path: volume},
-    timeout=60 * 3,
-)
-def sam2(image_pil: Image.Image, boxes: list[np.ndarray]) -> list[dict]:
-    import torch
-    from sam2.sam2_image_predictor import SAM2ImagePredictor
-    predictor = SAM2ImagePredictor.from_pretrained("facebook/sam2-hiera-large")
-    with torch.inference_mode(), torch.autocast("cuda", dtype=torch.bfloat16):
-        predictor.set_image(image_pil)
-        masks, scores, _ = predictor.predict(
-            point_coords=None,
-            point_labels=None,
-            box=boxes,
-            multimask_output=False,
-        )
-    return masks, scores
-@app.function(
-    image=image,
-    gpu="A100",
-    volumes={volume_path: volume},
-)
-def owlv2(
-    image_pil: Image.Image,
-    labels: list[str],
-    threshold: float,
-) -> list[dict]:
-    """
-    Perform zero-shot segmentation on an image using specified labels.
-    Args:
-        image_pil (Image.Image): The input image as a PIL Image.
-        labels (list[str]): List of labels for zero-shot segmentation.
-    Returns:
-        list[dict]: List of dictionaries containing label and bounding box information.
-    """
-    from transformers import pipeline
-    checkpoint = "google/owlv2-large-patch14-ensemble"
-    detector = pipeline(
-        model=checkpoint,
-        task="zero-shot-object-detection",
-        device="cuda",
-        use_fast=True,
-    )
-    # Load the image
-    predictions = detector(
-        image_pil,
-        candidate_labels=labels,
-    )
-    labels = []
-    scores = []
-    boxes = []
-    for prediction in predictions:
-        if prediction["score"] < threshold:
-            continue
-        labels.append(prediction["label"])
-        scores.append(prediction["score"])
-        boxes.append(np.array(list(prediction["box"].values())))
-    if labels == []:
-        print("No predictions found with score above threshold.")
-        return None
-    predictions = {"labels": labels, "scores": scores, "boxes": boxes}
-    return predictions
-@app.function(
     image=image,
-    gpu="A100",
     volumes={volume_path: volume},
     timeout=60 * 3,
 )
-def clip(
     image_pil: Image.Image,
-    prompts: list[str],
-) -> list[dict]:
-    """
-    returns:
-        dict with keys each are lists:
-            - labels: str, the prompt used for the prediction
-            - scores: float, confidence score of the prediction
-            - boxes: np.array representing bounding box coordinates
-    """
-    from transformers import CLIPSegProcessor, CLIPSegForImageSegmentation
-    import torch
-    processor = CLIPSegProcessor.from_pretrained(
-        "CIDAS/clipseg-rd64-refined",
-        use_fast=True,
-    )
-    model = CLIPSegForImageSegmentation.from_pretrained("CIDAS/clipseg-rd64-refined")
-    # Get original image dimensions
-    orig_width, orig_height = image_pil.size
-    inputs = processor(
-        text=prompts,
-        images=[image_pil] * len(prompts),
-        padding="max_length",
-        return_tensors="pt",
     )
-    # predict
-    with torch.no_grad():
-        outputs = model(**inputs)
-    preds = outputs.logits.unsqueeze(1)
-    # Get the dimensions of the prediction output
-    pred_height, pred_width = preds.shape[-2:]
-    # Calculate scaling factors
-    width_scale = orig_width / pred_width
-    height_scale = orig_height / pred_height
-    labels = []
-    scores = []
-    boxes = []
-    # Process each prediction to find bounding boxes in high probability regions
-    for i, prompt in enumerate(prompts):
-        # Apply sigmoid to get probability map
-        pred_tensor = torch.sigmoid(preds[i][0])
-        # Convert tensor to numpy array
-        pred_np = pred_tensor.cpu().numpy()
-        # Convert to uint8 for OpenCV processing
-        heatmap = (pred_np * 255).astype(np.uint8)
-        # Apply threshold to find high probability regions
-        _, binary = cv2.threshold(heatmap, 127, 255, cv2.THRESH_BINARY)
-        # Find contours in thresholded image
-        contours, _ = cv2.findContours(
-            binary, cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_SIMPLE
-        )
-        # Process each contour to get bounding boxes
-        for contour in contours:
-            # Skip very small contours that might be noise
-            if cv2.contourArea(contour) < 100:  # Minimum area threshold
-                continue
-            # Get bounding box coordinates in prediction space
-            x, y, w, h = cv2.boundingRect(contour)
-            # Scale coordinates to original image dimensions
-            x_orig = int(x * width_scale)
-            y_orig = int(y * height_scale)
-            w_orig = int(w * width_scale)
-            h_orig = int(h * height_scale)
-            # Calculate confidence score based on average probability in the region
-            mask = np.zeros_like(pred_np)
-            cv2.drawContours(mask, [contour], 0, 1, -1)
-            confidence = float(np.mean(pred_np[mask == 1]))
-            labels.append(prompt)
-            scores.append(confidence)
-            boxes.append(
-                np.array(
-                    [
-                        x_orig,
-                        y_orig,
-                        x_orig + w_orig,
-                        y_orig + h_orig,
-                    ]
-                )
-            )
-    if labels == []:
         return None
-    results = {
-        "labels": labels,
-        "scores": scores,
-        "boxes": boxes,
-    }
-    return results
 @app.function(
-    gpu="A10G",
     image=image,
     volumes={volume_path: volume},
     timeout=60 * 3,
@@ -334,20 +98,19 @@ def change_image_objects_hsv(
     image_pil: Image.Image,
     targets_config: list[list[str | int | float]],
 ) -> Image.Image:
     if not isinstance(targets_config, list) or not all(
         (
             isinstance(target, list)
-            and len(target) == 4
             and isinstance(target[0], str)
-            and isinstance(target[1], (int))
-            and isinstance(target[2], (int))
-            and isinstance(target[3], (int))
-            and target[1] >= 0
-            and target[1] <= 255
             and target[2] >= 0
-            and target[2] <= 255
-            and target[3] >= 0
-            and target[3] <= 255
         )
         for target in targets_config
     ):
@@ -355,66 +118,38 @@ def change_image_objects_hsv(
             "targets_config must be a list of lists, each containing [target_name, hue, saturation_scale]."  # noqa: E501
         )
     print("Change image objects hsv targets config:", targets_config)
-    prompts = [target[0].strip() for target in targets_config]
-    prompt_segment_results = prompt_segment.remote(
-        image_pil=image_pil,
-        prompts=prompts,
-    )
-    if not prompt_segment_results:
         return image_pil
-    output_labels = prompt_segment_results["labels"]
     img_array = np.array(image_pil)
     img_hsv = cv2.cvtColor(img_array, cv2.COLOR_RGB2HSV).astype(np.float32)
-    for idx, label in enumerate(output_labels):
-        if not label or label == "":
-            print("Skipping empty label.")
-            continue
-        if label not in prompts:
-            print(f"Label '{label}' not found in prompts. Skipping.")
             continue
-        input_label_idx = prompts.index(label)
-        target_rgb = targets_config[input_label_idx][1:]
-        target_hsv = cv2.cvtColor(np.uint8([[target_rgb]]), cv2.COLOR_RGB2HSV)[0][0]
-        mask = prompt_segment_results["masks"][idx][0].astype(bool)
-        h, s, v = cv2.split(img_hsv)
-        # Convert all channels to float32 for consistent processing
-        h = h.astype(np.float32)
-        s = s.astype(np.float32)
-        v = v.astype(np.float32)
-        # Compute original S and V means inside the mask
-        mean_s = np.mean(s[mask])
-        mean_v = np.mean(v[mask])
-        # Target S and V
-        target_hue, target_s, target_v = target_hsv
-        # Compute scaling factors (avoid div by zero)
-        scale_s = target_s / mean_s if mean_s > 0 else 1.0
-        scale_v = target_v / mean_v if mean_v > 0 else 1.0
-        scale_s = np.clip(scale_s, 0.8, 1.2)
-        scale_v = np.clip(scale_v, 0.8, 1.2)
-        # Apply changes only in mask
-        h[mask] = target_hue
-        s = s.astype(np.float32)
-        v = v.astype(np.float32)
-        s[mask] = np.clip(s[mask] * scale_s, 0, 255)
-        v[mask] = np.clip(v[mask] * scale_v, 0, 255)
-        # Merge and convert back
-        img_hsv = cv2.merge(
-            [
-                h.astype(np.uint8),
-                s.astype(np.uint8),
-                v.astype(np.uint8),
-            ]
         )
     output_img = cv2.cvtColor(img_hsv.astype(np.uint8), cv2.COLOR_HSV2RGB)
@@ -423,7 +158,7 @@ def change_image_objects_hsv(
 @app.function(
-    gpu="A10G",
     image=image,
     volumes={volume_path: volume},
     timeout=60 * 3,
@@ -454,35 +189,33 @@ def change_image_objects_lab(
     print("change image objects lab targets config:", targets_config)
-    prompts = [target[0].strip() for target in targets_config]
-    prompt_segment_results = prompt_segment.remote(
         image_pil=image_pil,
-        prompts=prompts,
     )
-    if not prompt_segment_results:
         return image_pil
-    output_labels = prompt_segment_results["labels"]
     img_array = np.array(image_pil)
     img_lab = cv2.cvtColor(img_array, cv2.COLOR_RGB2Lab).astype(np.float32)
-    for idx, label in enumerate(output_labels):
-        if not label or label == "":
-            print("Skipping empty label.")
-            continue
-        if label not in prompts:
-            print(f"Label '{label}' not found in prompts. Skipping.")
             continue
-        input_label_idx = prompts.index(label)
-        new_a = targets_config[input_label_idx][1]
-        new_b = targets_config[input_label_idx][2]
-        mask = prompt_segment_results["masks"][idx][0]
         mask_bool = mask.astype(bool)
         img_lab[mask_bool, 1] = new_a
@@ -495,7 +228,7 @@ def change_image_objects_lab(
 @app.function(
-    gpu="A10G",
     image=image,
     volumes={volume_path: volume},
     timeout=60 * 3,
@@ -523,80 +256,57 @@ def apply_mosaic_with_bool_mask(
 @app.function(
-    gpu="A10G",
     image=image,
     volumes={volume_path: volume},
     timeout=60 * 3,
 )
 def preserve_privacy(
     image_pil: Image.Image,
-    prompts: list[str],
     privacy_strength: int = 15,
-    threshold: float = 0.2,
 ) -> Image.Image:
     """
     Preserves privacy in an image by applying a mosaic effect to specified objects.
     """
-    print(f"Preserving privacy for prompt: {prompts} with strength {privacy_strength}")
-    if isinstance(prompts, str):
-        prompts = [prompt.strip() for prompt in prompts.split(".")]
-        print(f"Parsed prompts: {prompts}")
-    prompt_segment_results = privacy_prompt_segment.remote(
         image_pil=image_pil,
-        prompts=prompts,
-        threshold=threshold,
     )
-    if not prompt_segment_results:
         return image_pil
     img_array = np.array(image_pil)
-    for i, mask in enumerate(prompt_segment_results["masks"]):
-        mask_bool = mask[0].astype(bool)
-        # Create kernel for morphological operations
-        kernel_size = 100
-        kernel = np.ones((kernel_size, kernel_size), np.uint8)
-        # Convert bool mask to uint8 for OpenCV operations
-        mask_uint8 = mask_bool.astype(np.uint8) * 255
-        # Apply dilation to slightly expand the mask area
-        mask_uint8 = cv2.dilate(mask_uint8, kernel, iterations=2)
-        # Optional: Apply erosion again to refine the mask
-        mask_uint8 = cv2.erode(mask_uint8, kernel, iterations=2)
-        # Convert back to boolean mask
-        mask_bool = mask_uint8 > 127
-        img_array = apply_mosaic_with_bool_mask.remote(
-            img_array, mask_bool, privacy_strength
-        )
     output_image_pil = Image.fromarray(img_array)
     return output_image_pil
-@app.function(
-    gpu="A10G",
-    image=image,
-    volumes={volume_path: volume},
-    timeout=60 * 2,
-)
-def remove_background(image_pil: Image.Image) -> Image.Image:
-    import torch  # type: ignore
-    from ben2 import BEN_Base  # type: ignore
-    device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
-    print(f"Using device: {device}")
-    print("type of image_pil:", type(image_pil))
-    model = BEN_Base.from_pretrained("PramaLLC/BEN2")
-    model.to(device).eval()  # todo check if this should be outside the function
-    output_image = model.inference(
-        image_pil,
-        refine_foreground=True,
-    )
-    print(f"output type: {type(output_image)}")
-    return output_image

             "TORCH_HOME": TORCH_HOME,
         }
     )
+    .apt_install("git")
     .pip_install(
         "huggingface-hub",
         "hf_transfer",
         "Pillow",
         "numpy",
         "opencv-contrib-python-headless",
         gpu="A10G",
     )
     .pip_install(
         index_url="https://download.pytorch.org/whl/cu124",
         gpu="A10G",
     )
     .pip_install(
+        "git+https://github.com/luca-medeiros/lang-segment-anything.git",
         gpu="A10G",
     )
 )
 @app.function(
     gpu="A10G",
     image=image,
     volumes={volume_path: volume},
+    # min_containers=1,
     timeout=60 * 3,
 )
+def lang_sam_segment(
     image_pil: Image.Image,
+    prompt: str,
+    box_threshold=0.3,
+    text_threshold=0.25,
+) -> list:
+    """Segments an image using LangSAM based on a text prompt.
+    This function uses LangSAM to segment objects in the image based on the provided prompt.
+    """  # noqa: E501
+    from lang_sam import LangSAM  # type: ignore
+    model = LangSAM(sam_type="sam2.1_hiera_large")
+    langsam_results = model.predict(
+        images_pil=[image_pil],
+        texts_prompt=[prompt],
+        box_threshold=box_threshold,
+        text_threshold=text_threshold,
     )
+    if len(langsam_results[0]["labels"]) == 0:
+        print("No masks found for the given prompt.")
         return None
+    print(f"found {len(langsam_results[0]['labels'])} masks for prompt: {prompt}")
+    print("labels:", langsam_results[0]["labels"])
+    print("scores:", langsam_results[0]["scores"])
+    print("masks scores:", langsam_results[0].get("mask_scores", "No mask scores available"))  # noqa: E501
+    return langsam_results
 @app.function(
+    gpu="T4",
     image=image,
     volumes={volume_path: volume},
     timeout=60 * 3,
     image_pil: Image.Image,
     targets_config: list[list[str | int | float]],
 ) -> Image.Image:
+    """Changes the hue and saturation of specified objects in an image.
+    This function uses LangSAM to segment objects in the image based on provided prompts,
+    and then modifies the hue and saturation of those objects in the HSV color space.
+    """  # noqa: E501
     if not isinstance(targets_config, list) or not all(
         (
             isinstance(target, list)
+            and len(target) == 3
             and isinstance(target[0], str)
+            and isinstance(target[1], (int, float))
+            and isinstance(target[2], (int, float))
+            and 0 <= target[1] <= 179
             and target[2] >= 0
         )
         for target in targets_config
     ):
             "targets_config must be a list of lists, each containing [target_name, hue, saturation_scale]."  # noqa: E501
         )
     print("Change image objects hsv targets config:", targets_config)
+    prompts = ". ".join(target[0] for target in targets_config)
+    langsam_results = lang_sam_segment.remote(image_pil=image_pil, prompt=prompts)
+    if not langsam_results:
         return image_pil
+    labels = langsam_results[0]["labels"]
+    scores = langsam_results[0]["scores"]
     img_array = np.array(image_pil)
     img_hsv = cv2.cvtColor(img_array, cv2.COLOR_RGB2HSV).astype(np.float32)
+    for target_spec in targets_config:
+        target_obj = target_spec[0]
+        hue = target_spec[1]
+        saturation_scale = target_spec[2]
+        try:
+            mask_idx = labels.index(target_obj)
+        except ValueError:
+            print(
+                f"Warning: Label '{target_obj}' not found in the image. Skipping this target."  # noqa: E501
+            )
             continue
+        mask = langsam_results[0]["masks"][mask_idx]
+        mask_bool = mask.astype(bool)
+        img_hsv[mask_bool, 0] = float(hue)
+        img_hsv[mask_bool, 1] = np.minimum(
+            img_hsv[mask_bool, 1] * saturation_scale,
+            255.0,
         )
     output_img = cv2.cvtColor(img_hsv.astype(np.uint8), cv2.COLOR_HSV2RGB)
 @app.function(
+    gpu="T4",
     image=image,
     volumes={volume_path: volume},
     timeout=60 * 3,
     print("change image objects lab targets config:", targets_config)
+    prompts = ". ".join(target[0] for target in targets_config)
+    langsam_results = lang_sam_segment.remote(
         image_pil=image_pil,
+        prompt=prompts,
     )
+    if not langsam_results:
         return image_pil
+    labels = langsam_results[0]["labels"]
+    scores = langsam_results[0]["scores"]
     img_array = np.array(image_pil)
     img_lab = cv2.cvtColor(img_array, cv2.COLOR_RGB2Lab).astype(np.float32)
+    for target_spec in targets_config:
+        target_obj = target_spec[0]
+        new_a = target_spec[1]
+        new_b = target_spec[2]
+        try:
+            mask_idx = labels.index(target_obj)
+        except ValueError:
+            print(
+                f"Warning: Label '{target_obj}' not found in the image. Skipping this target."  # noqa: E501
+            )
             continue
+        mask = langsam_results[0]["masks"][mask_idx]
         mask_bool = mask.astype(bool)
         img_lab[mask_bool, 1] = new_a
 @app.function(
+    gpu="T4",
     image=image,
     volumes={volume_path: volume},
     timeout=60 * 3,
 @app.function(
+    gpu="T4",
     image=image,
     volumes={volume_path: volume},
     timeout=60 * 3,
 )
 def preserve_privacy(
     image_pil: Image.Image,
+    prompt: str,
     privacy_strength: int = 15,
 ) -> Image.Image:
     """
     Preserves privacy in an image by applying a mosaic effect to specified objects.
     """
+    print(f"Preserving privacy for prompt: {prompt} with strength {privacy_strength}")
+    langsam_results = lang_sam_segment.remote(
         image_pil=image_pil,
+        prompt=prompt,
+        box_threshold=0.35,
+        text_threshold=0.40,
     )
+    if not langsam_results:
         return image_pil
     img_array = np.array(image_pil)
+    for result in langsam_results:
+        print(f"result: {result}")
+        for i, mask in enumerate(result["masks"]):
+            if "mask_scores" in result:
+                if (
+                    hasattr(result["mask_scores"], "shape")
+                    and result["mask_scores"].ndim > 0
+                ):
+                    mask_score = result["mask_scores"][i]
+                else:
+                    mask_score = result["mask_scores"]
+            if mask_score < 0.6:
+                print(f"Skipping mask {i + 1}/{len(result['masks'])} -> low score.")
+                continue
+            print(
+                f"Processing mask {i + 1}/{len(result['masks'])} Mask score: {mask_score}"  # noqa: E501
+            )
+            mask_bool = mask.astype(bool)
+            img_array = apply_mosaic_with_bool_mask.remote(
+                img_array, mask_bool, privacy_strength
+            )
     output_image_pil = Image.fromarray(img_array)
     return output_image_pil

src/tools.py CHANGED Viewed

@@ -9,40 +9,10 @@ from PIL import Image
 modal_app_name = "ImageAlfred"
-def remove_background(
-    input_img,
-) -> np.ndarray | Image.Image | str | Path | None:
-    """
-    Remove the background of the image.
-    Args:
-        input_img: Input image or can be URL string of the image or base64 string. Cannot be None.
-    Returns:
-        bytes: Binary image data of the modified image.
-    """  # noqa: E501
-    if not input_img:
-        raise gr.Error("Input image cannot be None or empty.")
-    func = modal.Function.from_name(modal_app_name, "remove_background")
-    output_pil = func.remote(
-        image_pil=input_img,
-    )
-    if output_pil is None:
-        raise gr.Error("Received None from server.")
-    if not isinstance(output_pil, Image.Image):
-        raise gr.Error(
-            f"Expected Image.Image from server function, got {type(output_pil)}"
-        )
-    return output_pil
 def privacy_preserve_image(
     input_img,
     input_prompt,
     privacy_strength: int = 15,
-    threshold: float = 0.2,
 ) -> np.ndarray | Image.Image | str | Path | None:
     """
     Obscures specified objects in the input image based on a natural language prompt, using a privacy-preserving blur or distortion effect.
@@ -52,30 +22,27 @@ def privacy_preserve_image(
     Args:
         input_img: Input image or can be URL string of the image or base64 string. Cannot be None.
-        input_prompt (str): Object to obscure in the image has to be a dot-separated string. It can be a single word or multiple words, e.g., "left person face", "license plate" but it must be as short as possible and avoid using symbols or punctuation. e.g. input_prompt = "face. right car. blue shirt."
         privacy_strength (int): Strength of the privacy preservation effect. Higher values result in stronger blurring. Default is 15.
-        threshold (float): Model threshold for detecting objects. It should be between 0.01 and 0.99. Default is 0.2. for detecting smaller objects, small regions or faces a lower threshold is recommended.
     Returns:
         bytes: Binary image data of the modified image.
     example:
-        input_prompt = "faces, license plates, logos"
     """  # noqa: E501
     if not input_img:
         raise gr.Error("Input image cannot be None or empty.")
     if not input_prompt or input_prompt.strip() == "":
         raise gr.Error("Input prompt cannot be None or empty.")
-    if threshold < 0.01 or threshold > 0.99:
-        raise gr.Error("Threshold must be between 0.01 and 0.99.")
-    if isinstance(input_prompt, str):
-        prompts = [prompt.strip() for prompt in input_prompt.split(".")]
-    func = modal.Function.from_name(modal_app_name, "preserve_privacy")
     output_pil = func.remote(
         image_pil=input_img,
-        prompts=prompts,
         privacy_strength=privacy_strength,
-        threshold=threshold,
     )
     if output_pil is None:
@@ -94,22 +61,36 @@ def change_color_objects_hsv(
 ) -> np.ndarray | Image.Image | str | Path | None:
     """
     Changes the hue and saturation of specified objects in an image using the HSV color space.
-    This function segments image regions based on a user-provided text prompt and applies
-    color transformations in the HSV color space. HSV separates chromatic content (hue) from
-    intensity (value), making it more intuitive for color manipulation tasks.
     Use this method when:
-    - You want to change the color of objects based on their hue and saturation.
-    - You want to apply color transformations that are less influenced by lighting conditions or brightness variations.
     Args:
         input_img: Input image or can be URL string of the image or base64 string. Cannot be None.
-        user_input : A list of target specifications for color transformation. Each inner list must contain exactly four elements in the following order: 1. target_object (str) - A short, human-readable description of the object to be modified. Multi-word, descriptions are allowed for disambiguation (e.g., "right person shirt"), but they must be concise and free of punctuation, symbols, or special characters.2. Red (int) - Desired red value in RGB color space from 0 to 255. 3. Green (int) - Desired green value in RGB color space from 0 to 255. 4. Blue (int) - Desired blue value in RGB color space from 0 to 255. Example: user_input = [["hair", 30, 55, 255], ["shirt", 70, 0 , 157]].
     Returns:
         Base64-encoded string.
     Raises:
-        ValueError: If user_input format is invalid, or image format is invalid or corrupted.
         TypeError: If input_img is not a supported type or modal function returns unexpected type.
     """  # noqa: E501
     if len(user_input) == 0 or not isinstance(user_input, list):
@@ -118,13 +99,13 @@ def change_color_objects_hsv(
         )
     if not input_img:
         raise gr.Error("input img cannot be None or empty.")
     print("before processing input:", user_input)
     valid_pattern = re.compile(r"^[a-zA-Z\s]+$")
     for item in user_input:
-        if len(item) != 4:
             raise gr.Error(
-                "Each item in user_input must be a list of [object, red, green, blue]"  # noqa: E501
             )
         if not item[0] or not valid_pattern.match(item[0]):
             raise gr.Error(
@@ -133,31 +114,28 @@ def change_color_objects_hsv(
         if not isinstance(item[0], str):
             item[0] = str(item[0])
-        try:
-            item[1] = int(item[1])
-        except ValueError:
-            raise gr.Error("Red must be an integer.")
-        if item[1] < 0 or item[1] > 255:
-            raise gr.Error("Red must be in the range [0, 255]")
-        try:
-            item[2] = int(item[2])
-        except ValueError:
-            raise gr.Error("Green must be an integer.")
-        if item[2] < 0 or item[2] > 255:
-            raise gr.Error("Green must be in the range [0, 255]")
-        try:
-            item[3] = int(item[3])
-        except ValueError:
-            raise gr.Error("Blue must be an integer.")
-        if item[3] < 0 or item[3] > 255:
-            raise gr.Error("Blue must be in the range [0, 255]")
     print("after processing input:", user_input)
-    func = modal.Function.from_name(modal_app_name, "change_image_objects_hsv")
     output_pil = func.remote(image_pil=input_img, targets_config=user_input)
     if output_pil is None:
@@ -202,7 +180,7 @@ def change_color_objects_lab(
         - Purple: (L=?, A≈180, B≈100)
     Args:
-        user_input: A list of color transformation instructions, each as a three-element list:[object_name (str), new_a (int, 0-255), new_b (int, 0-255)].- object_name: A short, unique identifier for the object to be recolored. Multi-word names are allowed for specificity (e.g., "right person shirt") but must be free of punctuation or special symbols.- new_a: The desired 'a' channel value in LAB space (green-red axis, 0-255, with 128 as neutral).- new_b: The desired 'b' channel value in LAB space (blue-yellow axis, 0-255, with 128 as neutral).Each object must appear only once in the list. Example:[["hair", 80, 128], ["right person shirt", 180, 160]]
         input_img : Input image can be URL string of the image. Cannot be None.
     Returns:
@@ -220,7 +198,7 @@ def change_color_objects_lab(
         raise gr.Error("input img cannot be None or empty.")
     valid_pattern = re.compile(r"^[a-zA-Z\s]+$")
     print("before processing input:", user_input)
     for item in user_input:
         if len(item) != 3:
             raise gr.Error(
@@ -252,7 +230,7 @@ def change_color_objects_lab(
                 raise gr.Error("new B must be in the range [0, 255]")
     print("after processing input:", user_input)
-    func = modal.Function.from_name(modal_app_name, "change_image_objects_lab")
     output_pil = func.remote(image_pil=input_img, targets_config=user_input)
     if output_pil is None:
         raise ValueError("Received None from modal remote function.")
@@ -260,5 +238,13 @@ def change_color_objects_lab(
         raise TypeError(
             f"Expected Image.Image from modal remote function, got {type(output_pil)}"
         )
     return output_pil

 modal_app_name = "ImageAlfred"
 def privacy_preserve_image(
     input_img,
     input_prompt,
     privacy_strength: int = 15,
 ) -> np.ndarray | Image.Image | str | Path | None:
     """
     Obscures specified objects in the input image based on a natural language prompt, using a privacy-preserving blur or distortion effect.
     Args:
         input_img: Input image or can be URL string of the image or base64 string. Cannot be None.
+        input_prompt (str): Object to obscure in the image has to be a dot-separated string. It can be a single word or multiple words, e.g., "left person face", "license plate" but it must be as short as possible and avoid using symbols or punctuation. Also you have to use single form of the word, e.g., "person" instead of "people", "face" instead of "faces". e.g. input_prompt = "face. right car. blue shirt."
         privacy_strength (int): Strength of the privacy preservation effect. Higher values result in stronger blurring. Default is 15.
     Returns:
         bytes: Binary image data of the modified image.
     example:
+        input_prompt = ["face", "license plate"]
     """  # noqa: E501
     if not input_img:
         raise gr.Error("Input image cannot be None or empty.")
+    valid_pattern = re.compile(r"^[a-zA-Z\s.]+$")
     if not input_prompt or input_prompt.strip() == "":
         raise gr.Error("Input prompt cannot be None or empty.")
+    if not valid_pattern.match(input_prompt):
+        raise gr.Error("Input prompt must contain only letters, spaces, and dots.")
+    func = modal.Function.from_name("ImageAlfred", "preserve_privacy")
     output_pil = func.remote(
         image_pil=input_img,
+        prompt=input_prompt,
         privacy_strength=privacy_strength,
     )
     if output_pil is None:
 ) -> np.ndarray | Image.Image | str | Path | None:
     """
     Changes the hue and saturation of specified objects in an image using the HSV color space.
+    This function segments objects in the image based on a user-provided text prompt, then
+    modifies their hue and saturation in the HSV (Hue, Saturation, Value) space. HSV is intuitive
+    for color manipulation where users think in terms of basic color categories and intensity,
+    making it useful for broad, vivid color shifts.
     Use this method when:
+    - Performing broad color changes or visual effects (e.g., turning a shirt from red to blue).
+    - Needing intuitive control over color categories (e.g., shifting everything that's red to purple).
+    - Saturation and vibrancy manipulation are more important than accurate perceptual matching.
+    OpenCV HSV Ranges:
+        - H: 0-179 (Hue angle on color wheel, where 0 = red, 60 = green, 120 = blue, etc.)
+        - S: 0-255 (Saturation)
+        - V: 0-255 (Brightness)
+    Common HSV color references:
+        - Red: (Hue≈0), Green: (Hue≈60), Blue: (Hue≈120), Yellow: (Hue≈30), Purple: (Hue≈150)
+        - Typically used with Saturation=255 for vivid colors.
     Args:
         input_img: Input image or can be URL string of the image or base64 string. Cannot be None.
+        user_input : A list of target specifications for color transformation. Each inner list must contain exactly three elements in the following order: 1. target_object (str) - A short, human-readable description of the object to be modified.Multi-word descriptions are allowed for disambiguation (e.g., "right person shirt"), but they must be at most three words and concise and free of punctuation, symbols, or special characters.2. hue (int) - Desired hue value in the HSV color space, ranging from 0 to 179. Represents the color angle on the HSV color wheel (e.g., 0 = red, 60 = green, 120 = blue)3. saturation_scale (float) - A multiplicative scale factor applied to the current saturation   of the object (must be > 0). For example, 1.0 preserves current saturation, 1.2 increases vibrancy, and 0.8 slightly desaturates. Each target object must be uniquely defined in the list to avoid conflicting transformations.Example: [["hair", 30, 1.2], ["right person shirt", 60, 1.0]]
     Returns:
         Base64-encoded string.
     Raises:
+        ValueError: If user_input format is invalid, hue values are outside [0, 179] range, saturation_scale is not positive, or image format is invalid or corrupted.
         TypeError: If input_img is not a supported type or modal function returns unexpected type.
     """  # noqa: E501
     if len(user_input) == 0 or not isinstance(user_input, list):
         )
     if not input_img:
         raise gr.Error("input img cannot be None or empty.")
     print("before processing input:", user_input)
     valid_pattern = re.compile(r"^[a-zA-Z\s]+$")
     for item in user_input:
+        if len(item) != 3:
             raise gr.Error(
+                "Each item in user_input must be a list of [object, hue, saturation_scale]"  # noqa: E501
             )
         if not item[0] or not valid_pattern.match(item[0]):
             raise gr.Error(
         if not isinstance(item[0], str):
             item[0] = str(item[0])
+        if not item[1]:
+            raise gr.Error("Hue must be set and cannot be empty.")
+        if not isinstance(item[1], (int, float)):
+            try:
+                item[1] = int(item[1])
+            except ValueError:
+                raise gr.Error("Hue must be an integer.")
+            if item[1] < 0 or item[1] > 179:
+                raise gr.Error("Hue must be in the range [0, 179]")
+        if not item[2]:
+            raise gr.Error("Saturation scale must be set and cannot be empty.")
+        if not isinstance(item[2], (int, float)):
+            try:
+                item[2] = float(item[2])
+            except ValueError:
+                raise gr.Error("Saturation scale must be a float number.")
+            if item[2] <= 0:
+                raise gr.Error("Saturation scale must be greater than 0")
     print("after processing input:", user_input)
+    func = modal.Function.from_name("ImageAlfred", "change_image_objects_hsv")
     output_pil = func.remote(image_pil=input_img, targets_config=user_input)
     if output_pil is None:
         - Purple: (L=?, A≈180, B≈100)
     Args:
+        user_input: A list of color transformation instructions, each as a three-element list:[object_name (str), new_a (int, 0-255), new_b (int, 0-255)].- object_name: A short, unique identifier for the object to be recolored. Multi-word names are allowed  for specificity (e.g., "right person shirt") but must be 3 words or fewer and free of punctuation or special symbols.- new_a: The desired 'a' channel value in LAB space (green-red axis, 0-255, with 128 as neutral).- new_b: The desired 'b' channel value in LAB space (blue-yellow axis, 0-255, with 128 as neutral).Each object must appear only once in the list. Example:[["hair", 80, 128], ["right person shirt", 180, 160]]
         input_img : Input image can be URL string of the image. Cannot be None.
     Returns:
         raise gr.Error("input img cannot be None or empty.")
     valid_pattern = re.compile(r"^[a-zA-Z\s]+$")
     print("before processing input:", user_input)
     for item in user_input:
         if len(item) != 3:
             raise gr.Error(
                 raise gr.Error("new B must be in the range [0, 255]")
     print("after processing input:", user_input)
+    func = modal.Function.from_name("ImageAlfred", "change_image_objects_lab")
     output_pil = func.remote(image_pil=input_img, targets_config=user_input)
     if output_pil is None:
         raise ValueError("Received None from modal remote function.")
         raise TypeError(
             f"Expected Image.Image from modal remote function, got {type(output_pil)}"
         )
+    # img_link = upload_image_to_tmpfiles(output_pil)
     return output_pil
+if __name__ == "__main__":
+    image_pil = Image.open("./src/assets/test_image.jpg")
+    change_color_objects_hsv(
+        user_input=[["hair", 30, 1.2], ["shirt", 60, 1.0]], input_img=image_pil
+    )

uv.lock CHANGED Viewed

@@ -831,6 +831,7 @@ version = "0.1.0"
 source = { virtual = "." }
 dependencies = [
     { name = "gradio", extra = ["mcp"] },
     { name = "modal" },
     { name = "numpy" },
     { name = "pillow" },
@@ -839,16 +840,14 @@ dependencies = [
 [package.dev-dependencies]
 dev = [
     { name = "jupyterlab" },
-    { name = "matplotlib" },
     { name = "opencv-contrib-python" },
-    { name = "rapidfuzz" },
     { name = "ruff" },
-    { name = "supervision" },
 ]
 [package.metadata]
 requires-dist = [
     { name = "gradio", extras = ["mcp"], specifier = ">=5.32.1" },
     { name = "modal", specifier = ">=1.0.2" },
     { name = "numpy", specifier = ">=2.2.6" },
     { name = "pillow", specifier = ">=11.2.1" },
@@ -857,11 +856,8 @@ requires-dist = [
 [package.metadata.requires-dev]
 dev = [
     { name = "jupyterlab", specifier = ">=4.4.3" },
-    { name = "matplotlib", specifier = ">=3.10.3" },
     { name = "opencv-contrib-python", specifier = ">=4.11.0.86" },
-    { name = "rapidfuzz", specifier = ">=3.13.0" },
     { name = "ruff", specifier = ">=0.11.12" },
-    { name = "supervision", specifier = ">=0.25.1" },
 ]
 [[package]]
@@ -1572,23 +1568,6 @@ wheels = [
     { url = "https://files.pythonhosted.org/packages/0d/c6/146487546adc4726f0be591a65b466973feaa58cc3db711087e802e940fb/opencv_contrib_python-4.11.0.86-cp37-abi3-win_amd64.whl", hash = "sha256:654758a9ae8ca9a75fca7b64b19163636534f0eedffe1e14c3d7218988625c8d", size = 46185163, upload-time = "2025-01-16T13:52:39.745Z" },
 ]
-[[package]]
-name = "opencv-python"
-version = "4.11.0.86"
-source = { registry = "https://pypi.org/simple" }
-dependencies = [
-    { name = "numpy" },
-]
-sdist = { url = "https://files.pythonhosted.org/packages/17/06/68c27a523103dad5837dc5b87e71285280c4f098c60e4fe8a8db6486ab09/opencv-python-4.11.0.86.tar.gz", hash = "sha256:03d60ccae62304860d232272e4a4fda93c39d595780cb40b161b310244b736a4", size = 95171956, upload-time = "2025-01-16T13:52:24.737Z" }
-wheels = [
-    { url = "https://files.pythonhosted.org/packages/05/4d/53b30a2a3ac1f75f65a59eb29cf2ee7207ce64867db47036ad61743d5a23/opencv_python-4.11.0.86-cp37-abi3-macosx_13_0_arm64.whl", hash = "sha256:432f67c223f1dc2824f5e73cdfcd9db0efc8710647d4e813012195dc9122a52a", size = 37326322, upload-time = "2025-01-16T13:52:25.887Z" },
-    { url = "https://files.pythonhosted.org/packages/3b/84/0a67490741867eacdfa37bc18df96e08a9d579583b419010d7f3da8ff503/opencv_python-4.11.0.86-cp37-abi3-macosx_13_0_x86_64.whl", hash = "sha256:9d05ef13d23fe97f575153558653e2d6e87103995d54e6a35db3f282fe1f9c66", size = 56723197, upload-time = "2025-01-16T13:55:21.222Z" },
-    { url = "https://files.pythonhosted.org/packages/f3/bd/29c126788da65c1fb2b5fb621b7fed0ed5f9122aa22a0868c5e2c15c6d23/opencv_python-4.11.0.86-cp37-abi3-manylinux_2_17_aarch64.manylinux2014_aarch64.whl", hash = "sha256:1b92ae2c8852208817e6776ba1ea0d6b1e0a1b5431e971a2a0ddd2a8cc398202", size = 42230439, upload-time = "2025-01-16T13:51:35.822Z" },
-    { url = "https://files.pythonhosted.org/packages/2c/8b/90eb44a40476fa0e71e05a0283947cfd74a5d36121a11d926ad6f3193cc4/opencv_python-4.11.0.86-cp37-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl", hash = "sha256:6b02611523803495003bd87362db3e1d2a0454a6a63025dc6658a9830570aa0d", size = 62986597, upload-time = "2025-01-16T13:52:08.836Z" },
-    { url = "https://files.pythonhosted.org/packages/fb/d7/1d5941a9dde095468b288d989ff6539dd69cd429dbf1b9e839013d21b6f0/opencv_python-4.11.0.86-cp37-abi3-win32.whl", hash = "sha256:810549cb2a4aedaa84ad9a1c92fbfdfc14090e2749cedf2c1589ad8359aa169b", size = 29384337, upload-time = "2025-01-16T13:52:13.549Z" },
-    { url = "https://files.pythonhosted.org/packages/a4/7d/f1c30a92854540bf789e9cd5dde7ef49bbe63f855b85a2e6b3db8135c591/opencv_python-4.11.0.86-cp37-abi3-win_amd64.whl", hash = "sha256:085ad9b77c18853ea66283e98affefe2de8cc4c1f43eda4c100cf9b2721142ec", size = 39488044, upload-time = "2025-01-16T13:52:21.928Z" },
-]
 [[package]]
 name = "orjson"
 version = "3.10.18"
@@ -2130,44 +2109,6 @@ wheels = [
     { url = "https://files.pythonhosted.org/packages/05/4c/bf3cad0d64c3214ac881299c4562b815f05d503bccc513e3fd4fdc6f67e4/pyzmq-26.4.0-cp313-cp313t-musllinux_1_1_x86_64.whl", hash = "sha256:26a2a7451606b87f67cdeca2c2789d86f605da08b4bd616b1a9981605ca3a364", size = 1395540, upload-time = "2025-04-04T12:04:30.562Z" },
 ]
-[[package]]
-name = "rapidfuzz"
-version = "3.13.0"
-source = { registry = "https://pypi.org/simple" }
-sdist = { url = "https://files.pythonhosted.org/packages/ed/f6/6895abc3a3d056b9698da3199b04c0e56226d530ae44a470edabf8b664f0/rapidfuzz-3.13.0.tar.gz", hash = "sha256:d2eaf3839e52cbcc0accbe9817a67b4b0fcf70aaeb229cfddc1c28061f9ce5d8", size = 57904226, upload-time = "2025-04-03T20:38:51.226Z" }
-wheels = [
-    { url = "https://files.pythonhosted.org/packages/13/4b/a326f57a4efed8f5505b25102797a58e37ee11d94afd9d9422cb7c76117e/rapidfuzz-3.13.0-cp312-cp312-macosx_10_13_x86_64.whl", hash = "sha256:4a1a6a906ba62f2556372282b1ef37b26bca67e3d2ea957277cfcefc6275cca7", size = 1989501, upload-time = "2025-04-03T20:36:13.43Z" },
-    { url = "https://files.pythonhosted.org/packages/b7/53/1f7eb7ee83a06c400089ec7cb841cbd581c2edd7a4b21eb2f31030b88daa/rapidfuzz-3.13.0-cp312-cp312-macosx_11_0_arm64.whl", hash = "sha256:2fd0975e015b05c79a97f38883a11236f5a24cca83aa992bd2558ceaa5652b26", size = 1445379, upload-time = "2025-04-03T20:36:16.439Z" },
-    { url = "https://files.pythonhosted.org/packages/07/09/de8069a4599cc8e6d194e5fa1782c561151dea7d5e2741767137e2a8c1f0/rapidfuzz-3.13.0-cp312-cp312-manylinux_2_17_aarch64.manylinux2014_aarch64.whl", hash = "sha256:5d4e13593d298c50c4f94ce453f757b4b398af3fa0fd2fde693c3e51195b7f69", size = 1405986, upload-time = "2025-04-03T20:36:18.447Z" },
-    { url = "https://files.pythonhosted.org/packages/5d/77/d9a90b39c16eca20d70fec4ca377fbe9ea4c0d358c6e4736ab0e0e78aaf6/rapidfuzz-3.13.0-cp312-cp312-manylinux_2_17_i686.manylinux2014_i686.whl", hash = "sha256:ed6f416bda1c9133000009d84d9409823eb2358df0950231cc936e4bf784eb97", size = 5310809, upload-time = "2025-04-03T20:36:20.324Z" },
-    { url = "https://files.pythonhosted.org/packages/1e/7d/14da291b0d0f22262d19522afaf63bccf39fc027c981233fb2137a57b71f/rapidfuzz-3.13.0-cp312-cp312-manylinux_2_17_ppc64le.manylinux2014_ppc64le.whl", hash = "sha256:1dc82b6ed01acb536b94a43996a94471a218f4d89f3fdd9185ab496de4b2a981", size = 1629394, upload-time = "2025-04-03T20:36:22.256Z" },
-    { url = "https://files.pythonhosted.org/packages/b7/e4/79ed7e4fa58f37c0f8b7c0a62361f7089b221fe85738ae2dbcfb815e985a/rapidfuzz-3.13.0-cp312-cp312-manylinux_2_17_s390x.manylinux2014_s390x.whl", hash = "sha256:e9d824de871daa6e443b39ff495a884931970d567eb0dfa213d234337343835f", size = 1600544, upload-time = "2025-04-03T20:36:24.207Z" },
-    { url = "https://files.pythonhosted.org/packages/4e/20/e62b4d13ba851b0f36370060025de50a264d625f6b4c32899085ed51f980/rapidfuzz-3.13.0-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl", hash = "sha256:2d18228a2390375cf45726ce1af9d36ff3dc1f11dce9775eae1f1b13ac6ec50f", size = 3052796, upload-time = "2025-04-03T20:36:26.279Z" },
-    { url = "https://files.pythonhosted.org/packages/cd/8d/55fdf4387dec10aa177fe3df8dbb0d5022224d95f48664a21d6b62a5299d/rapidfuzz-3.13.0-cp312-cp312-musllinux_1_2_aarch64.whl", hash = "sha256:9f5fe634c9482ec5d4a6692afb8c45d370ae86755e5f57aa6c50bfe4ca2bdd87", size = 2464016, upload-time = "2025-04-03T20:36:28.525Z" },
-    { url = "https://files.pythonhosted.org/packages/9b/be/0872f6a56c0f473165d3b47d4170fa75263dc5f46985755aa9bf2bbcdea1/rapidfuzz-3.13.0-cp312-cp312-musllinux_1_2_i686.whl", hash = "sha256:694eb531889f71022b2be86f625a4209c4049e74be9ca836919b9e395d5e33b3", size = 7556725, upload-time = "2025-04-03T20:36:30.629Z" },
-    { url = "https://files.pythonhosted.org/packages/5d/f3/6c0750e484d885a14840c7a150926f425d524982aca989cdda0bb3bdfa57/rapidfuzz-3.13.0-cp312-cp312-musllinux_1_2_ppc64le.whl", hash = "sha256:11b47b40650e06147dee5e51a9c9ad73bb7b86968b6f7d30e503b9f8dd1292db", size = 2859052, upload-time = "2025-04-03T20:36:32.836Z" },
-    { url = "https://files.pythonhosted.org/packages/6f/98/5a3a14701b5eb330f444f7883c9840b43fb29c575e292e09c90a270a6e07/rapidfuzz-3.13.0-cp312-cp312-musllinux_1_2_s390x.whl", hash = "sha256:98b8107ff14f5af0243f27d236bcc6e1ef8e7e3b3c25df114e91e3a99572da73", size = 3390219, upload-time = "2025-04-03T20:36:35.062Z" },
-    { url = "https://files.pythonhosted.org/packages/e9/7d/f4642eaaeb474b19974332f2a58471803448be843033e5740965775760a5/rapidfuzz-3.13.0-cp312-cp312-musllinux_1_2_x86_64.whl", hash = "sha256:b836f486dba0aceb2551e838ff3f514a38ee72b015364f739e526d720fdb823a", size = 4377924, upload-time = "2025-04-03T20:36:37.363Z" },
-    { url = "https://files.pythonhosted.org/packages/8e/83/fa33f61796731891c3e045d0cbca4436a5c436a170e7f04d42c2423652c3/rapidfuzz-3.13.0-cp312-cp312-win32.whl", hash = "sha256:4671ee300d1818d7bdfd8fa0608580d7778ba701817216f0c17fb29e6b972514", size = 1823915, upload-time = "2025-04-03T20:36:39.451Z" },
-    { url = "https://files.pythonhosted.org/packages/03/25/5ee7ab6841ca668567d0897905eebc79c76f6297b73bf05957be887e9c74/rapidfuzz-3.13.0-cp312-cp312-win_amd64.whl", hash = "sha256:6e2065f68fb1d0bf65adc289c1bdc45ba7e464e406b319d67bb54441a1b9da9e", size = 1616985, upload-time = "2025-04-03T20:36:41.631Z" },
-    { url = "https://files.pythonhosted.org/packages/76/5e/3f0fb88db396cb692aefd631e4805854e02120a2382723b90dcae720bcc6/rapidfuzz-3.13.0-cp312-cp312-win_arm64.whl", hash = "sha256:65cc97c2fc2c2fe23586599686f3b1ceeedeca8e598cfcc1b7e56dc8ca7e2aa7", size = 860116, upload-time = "2025-04-03T20:36:43.915Z" },
-    { url = "https://files.pythonhosted.org/packages/0a/76/606e71e4227790750f1646f3c5c873e18d6cfeb6f9a77b2b8c4dec8f0f66/rapidfuzz-3.13.0-cp313-cp313-macosx_10_13_x86_64.whl", hash = "sha256:09e908064d3684c541d312bd4c7b05acb99a2c764f6231bd507d4b4b65226c23", size = 1982282, upload-time = "2025-04-03T20:36:46.149Z" },
-    { url = "https://files.pythonhosted.org/packages/0a/f5/d0b48c6b902607a59fd5932a54e3518dae8223814db8349b0176e6e9444b/rapidfuzz-3.13.0-cp313-cp313-macosx_11_0_arm64.whl", hash = "sha256:57c390336cb50d5d3bfb0cfe1467478a15733703af61f6dffb14b1cd312a6fae", size = 1439274, upload-time = "2025-04-03T20:36:48.323Z" },
-    { url = "https://files.pythonhosted.org/packages/59/cf/c3ac8c80d8ced6c1f99b5d9674d397ce5d0e9d0939d788d67c010e19c65f/rapidfuzz-3.13.0-cp313-cp313-manylinux_2_17_aarch64.manylinux2014_aarch64.whl", hash = "sha256:0da54aa8547b3c2c188db3d1c7eb4d1bb6dd80baa8cdaeaec3d1da3346ec9caa", size = 1399854, upload-time = "2025-04-03T20:36:50.294Z" },
-    { url = "https://files.pythonhosted.org/packages/09/5d/ca8698e452b349c8313faf07bfa84e7d1c2d2edf7ccc67bcfc49bee1259a/rapidfuzz-3.13.0-cp313-cp313-manylinux_2_17_i686.manylinux2014_i686.whl", hash = "sha256:df8e8c21e67afb9d7fbe18f42c6111fe155e801ab103c81109a61312927cc611", size = 5308962, upload-time = "2025-04-03T20:36:52.421Z" },
-    { url = "https://files.pythonhosted.org/packages/66/0a/bebada332854e78e68f3d6c05226b23faca79d71362509dbcf7b002e33b7/rapidfuzz-3.13.0-cp313-cp313-manylinux_2_17_ppc64le.manylinux2014_ppc64le.whl", hash = "sha256:461fd13250a2adf8e90ca9a0e1e166515cbcaa5e9c3b1f37545cbbeff9e77f6b", size = 1625016, upload-time = "2025-04-03T20:36:54.639Z" },
-    { url = "https://files.pythonhosted.org/packages/de/0c/9e58d4887b86d7121d1c519f7050d1be5eb189d8a8075f5417df6492b4f5/rapidfuzz-3.13.0-cp313-cp313-manylinux_2_17_s390x.manylinux2014_s390x.whl", hash = "sha256:c2b3dd5d206a12deca16870acc0d6e5036abeb70e3cad6549c294eff15591527", size = 1600414, upload-time = "2025-04-03T20:36:56.669Z" },
-    { url = "https://files.pythonhosted.org/packages/9b/df/6096bc669c1311568840bdcbb5a893edc972d1c8d2b4b4325c21d54da5b1/rapidfuzz-3.13.0-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl", hash = "sha256:1343d745fbf4688e412d8f398c6e6d6f269db99a54456873f232ba2e7aeb4939", size = 3053179, upload-time = "2025-04-03T20:36:59.366Z" },
-    { url = "https://files.pythonhosted.org/packages/f9/46/5179c583b75fce3e65a5cd79a3561bd19abd54518cb7c483a89b284bf2b9/rapidfuzz-3.13.0-cp313-cp313-musllinux_1_2_aarch64.whl", hash = "sha256:b1b065f370d54551dcc785c6f9eeb5bd517ae14c983d2784c064b3aa525896df", size = 2456856, upload-time = "2025-04-03T20:37:01.708Z" },
-    { url = "https://files.pythonhosted.org/packages/6b/64/e9804212e3286d027ac35bbb66603c9456c2bce23f823b67d2f5cabc05c1/rapidfuzz-3.13.0-cp313-cp313-musllinux_1_2_i686.whl", hash = "sha256:11b125d8edd67e767b2295eac6eb9afe0b1cdc82ea3d4b9257da4b8e06077798", size = 7567107, upload-time = "2025-04-03T20:37:04.521Z" },
-    { url = "https://files.pythonhosted.org/packages/8a/f2/7d69e7bf4daec62769b11757ffc31f69afb3ce248947aadbb109fefd9f65/rapidfuzz-3.13.0-cp313-cp313-musllinux_1_2_ppc64le.whl", hash = "sha256:c33f9c841630b2bb7e69a3fb5c84a854075bb812c47620978bddc591f764da3d", size = 2854192, upload-time = "2025-04-03T20:37:06.905Z" },
-    { url = "https://files.pythonhosted.org/packages/05/21/ab4ad7d7d0f653e6fe2e4ccf11d0245092bef94cdff587a21e534e57bda8/rapidfuzz-3.13.0-cp313-cp313-musllinux_1_2_s390x.whl", hash = "sha256:ae4574cb66cf1e85d32bb7e9ec45af5409c5b3970b7ceb8dea90168024127566", size = 3398876, upload-time = "2025-04-03T20:37:09.692Z" },
-    { url = "https://files.pythonhosted.org/packages/0f/a8/45bba94c2489cb1ee0130dcb46e1df4fa2c2b25269e21ffd15240a80322b/rapidfuzz-3.13.0-cp313-cp313-musllinux_1_2_x86_64.whl", hash = "sha256:e05752418b24bbd411841b256344c26f57da1148c5509e34ea39c7eb5099ab72", size = 4377077, upload-time = "2025-04-03T20:37:11.929Z" },
-    { url = "https://files.pythonhosted.org/packages/0c/f3/5e0c6ae452cbb74e5436d3445467447e8c32f3021f48f93f15934b8cffc2/rapidfuzz-3.13.0-cp313-cp313-win32.whl", hash = "sha256:0e1d08cb884805a543f2de1f6744069495ef527e279e05370dd7c83416af83f8", size = 1822066, upload-time = "2025-04-03T20:37:14.425Z" },
-    { url = "https://files.pythonhosted.org/packages/96/e3/a98c25c4f74051df4dcf2f393176b8663bfd93c7afc6692c84e96de147a2/rapidfuzz-3.13.0-cp313-cp313-win_amd64.whl", hash = "sha256:9a7c6232be5f809cd39da30ee5d24e6cadd919831e6020ec6c2391f4c3bc9264", size = 1615100, upload-time = "2025-04-03T20:37:16.611Z" },
-    { url = "https://files.pythonhosted.org/packages/60/b1/05cd5e697c00cd46d7791915f571b38c8531f714832eff2c5e34537c49ee/rapidfuzz-3.13.0-cp313-cp313-win_arm64.whl", hash = "sha256:3f32f15bacd1838c929b35c84b43618481e1b3d7a61b5ed2db0291b70ae88b53", size = 858976, upload-time = "2025-04-03T20:37:19.336Z" },
-]
 [[package]]
 name = "referencing"
 version = "0.36.2"
@@ -2317,44 +2258,6 @@ wheels = [
     { url = "https://files.pythonhosted.org/packages/4d/c0/1108ad9f01567f66b3154063605b350b69c3c9366732e09e45f9fd0d1deb/safehttpx-0.1.6-py3-none-any.whl", hash = "sha256:407cff0b410b071623087c63dd2080c3b44dc076888d8c5823c00d1e58cb381c", size = 8692, upload-time = "2024-12-02T18:44:08.555Z" },
 ]
-[[package]]
-name = "scipy"
-version = "1.15.3"
-source = { registry = "https://pypi.org/simple" }
-dependencies = [
-    { name = "numpy" },
-]
-sdist = { url = "https://files.pythonhosted.org/packages/0f/37/6964b830433e654ec7485e45a00fc9a27cf868d622838f6b6d9c5ec0d532/scipy-1.15.3.tar.gz", hash = "sha256:eae3cf522bc7df64b42cad3925c876e1b0b6c35c1337c93e12c0f366f55b0eaf", size = 59419214, upload-time = "2025-05-08T16:13:05.955Z" }
-wheels = [
-    { url = "https://files.pythonhosted.org/packages/37/4b/683aa044c4162e10ed7a7ea30527f2cbd92e6999c10a8ed8edb253836e9c/scipy-1.15.3-cp312-cp312-macosx_10_13_x86_64.whl", hash = "sha256:6ac6310fdbfb7aa6612408bd2f07295bcbd3fda00d2d702178434751fe48e019", size = 38766735, upload-time = "2025-05-08T16:06:06.471Z" },
-    { url = "https://files.pythonhosted.org/packages/7b/7e/f30be3d03de07f25dc0ec926d1681fed5c732d759ac8f51079708c79e680/scipy-1.15.3-cp312-cp312-macosx_12_0_arm64.whl", hash = "sha256:185cd3d6d05ca4b44a8f1595af87f9c372bb6acf9c808e99aa3e9aa03bd98cf6", size = 30173284, upload-time = "2025-05-08T16:06:11.686Z" },
-    { url = "https://files.pythonhosted.org/packages/07/9c/0ddb0d0abdabe0d181c1793db51f02cd59e4901da6f9f7848e1f96759f0d/scipy-1.15.3-cp312-cp312-macosx_14_0_arm64.whl", hash = "sha256:05dc6abcd105e1a29f95eada46d4a3f251743cfd7d3ae8ddb4088047f24ea477", size = 22446958, upload-time = "2025-05-08T16:06:15.97Z" },
-    { url = "https://files.pythonhosted.org/packages/af/43/0bce905a965f36c58ff80d8bea33f1f9351b05fad4beaad4eae34699b7a1/scipy-1.15.3-cp312-cp312-macosx_14_0_x86_64.whl", hash = "sha256:06efcba926324df1696931a57a176c80848ccd67ce6ad020c810736bfd58eb1c", size = 25242454, upload-time = "2025-05-08T16:06:20.394Z" },
-    { url = "https://files.pythonhosted.org/packages/56/30/a6f08f84ee5b7b28b4c597aca4cbe545535c39fe911845a96414700b64ba/scipy-1.15.3-cp312-cp312-manylinux_2_17_aarch64.manylinux2014_aarch64.whl", hash = "sha256:c05045d8b9bfd807ee1b9f38761993297b10b245f012b11b13b91ba8945f7e45", size = 35210199, upload-time = "2025-05-08T16:06:26.159Z" },
-    { url = "https://files.pythonhosted.org/packages/0b/1f/03f52c282437a168ee2c7c14a1a0d0781a9a4a8962d84ac05c06b4c5b555/scipy-1.15.3-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl", hash = "sha256:271e3713e645149ea5ea3e97b57fdab61ce61333f97cfae392c28ba786f9bb49", size = 37309455, upload-time = "2025-05-08T16:06:32.778Z" },
-    { url = "https://files.pythonhosted.org/packages/89/b1/fbb53137f42c4bf630b1ffdfc2151a62d1d1b903b249f030d2b1c0280af8/scipy-1.15.3-cp312-cp312-musllinux_1_2_aarch64.whl", hash = "sha256:6cfd56fc1a8e53f6e89ba3a7a7251f7396412d655bca2aa5611c8ec9a6784a1e", size = 36885140, upload-time = "2025-05-08T16:06:39.249Z" },
-    { url = "https://files.pythonhosted.org/packages/2e/2e/025e39e339f5090df1ff266d021892694dbb7e63568edcfe43f892fa381d/scipy-1.15.3-cp312-cp312-musllinux_1_2_x86_64.whl", hash = "sha256:0ff17c0bb1cb32952c09217d8d1eed9b53d1463e5f1dd6052c7857f83127d539", size = 39710549, upload-time = "2025-05-08T16:06:45.729Z" },
-    { url = "https://files.pythonhosted.org/packages/e6/eb/3bf6ea8ab7f1503dca3a10df2e4b9c3f6b3316df07f6c0ded94b281c7101/scipy-1.15.3-cp312-cp312-win_amd64.whl", hash = "sha256:52092bc0472cfd17df49ff17e70624345efece4e1a12b23783a1ac59a1b728ed", size = 40966184, upload-time = "2025-05-08T16:06:52.623Z" },
-    { url = "https://files.pythonhosted.org/packages/73/18/ec27848c9baae6e0d6573eda6e01a602e5649ee72c27c3a8aad673ebecfd/scipy-1.15.3-cp313-cp313-macosx_10_13_x86_64.whl", hash = "sha256:2c620736bcc334782e24d173c0fdbb7590a0a436d2fdf39310a8902505008759", size = 38728256, upload-time = "2025-05-08T16:06:58.696Z" },
-    { url = "https://files.pythonhosted.org/packages/74/cd/1aef2184948728b4b6e21267d53b3339762c285a46a274ebb7863c9e4742/scipy-1.15.3-cp313-cp313-macosx_12_0_arm64.whl", hash = "sha256:7e11270a000969409d37ed399585ee530b9ef6aa99d50c019de4cb01e8e54e62", size = 30109540, upload-time = "2025-05-08T16:07:04.209Z" },
-    { url = "https://files.pythonhosted.org/packages/5b/d8/59e452c0a255ec352bd0a833537a3bc1bfb679944c4938ab375b0a6b3a3e/scipy-1.15.3-cp313-cp313-macosx_14_0_arm64.whl", hash = "sha256:8c9ed3ba2c8a2ce098163a9bdb26f891746d02136995df25227a20e71c396ebb", size = 22383115, upload-time = "2025-05-08T16:07:08.998Z" },
-    { url = "https://files.pythonhosted.org/packages/08/f5/456f56bbbfccf696263b47095291040655e3cbaf05d063bdc7c7517f32ac/scipy-1.15.3-cp313-cp313-macosx_14_0_x86_64.whl", hash = "sha256:0bdd905264c0c9cfa74a4772cdb2070171790381a5c4d312c973382fc6eaf730", size = 25163884, upload-time = "2025-05-08T16:07:14.091Z" },
-    { url = "https://files.pythonhosted.org/packages/a2/66/a9618b6a435a0f0c0b8a6d0a2efb32d4ec5a85f023c2b79d39512040355b/scipy-1.15.3-cp313-cp313-manylinux_2_17_aarch64.manylinux2014_aarch64.whl", hash = "sha256:79167bba085c31f38603e11a267d862957cbb3ce018d8b38f79ac043bc92d825", size = 35174018, upload-time = "2025-05-08T16:07:19.427Z" },
-    { url = "https://files.pythonhosted.org/packages/b5/09/c5b6734a50ad4882432b6bb7c02baf757f5b2f256041da5df242e2d7e6b6/scipy-1.15.3-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl", hash = "sha256:c9deabd6d547aee2c9a81dee6cc96c6d7e9a9b1953f74850c179f91fdc729cb7", size = 37269716, upload-time = "2025-05-08T16:07:25.712Z" },
-    { url = "https://files.pythonhosted.org/packages/77/0a/eac00ff741f23bcabd352731ed9b8995a0a60ef57f5fd788d611d43d69a1/scipy-1.15.3-cp313-cp313-musllinux_1_2_aarch64.whl", hash = "sha256:dde4fc32993071ac0c7dd2d82569e544f0bdaff66269cb475e0f369adad13f11", size = 36872342, upload-time = "2025-05-08T16:07:31.468Z" },
-    { url = "https://files.pythonhosted.org/packages/fe/54/4379be86dd74b6ad81551689107360d9a3e18f24d20767a2d5b9253a3f0a/scipy-1.15.3-cp313-cp313-musllinux_1_2_x86_64.whl", hash = "sha256:f77f853d584e72e874d87357ad70f44b437331507d1c311457bed8ed2b956126", size = 39670869, upload-time = "2025-05-08T16:07:38.002Z" },
-    { url = "https://files.pythonhosted.org/packages/87/2e/892ad2862ba54f084ffe8cc4a22667eaf9c2bcec6d2bff1d15713c6c0703/scipy-1.15.3-cp313-cp313-win_amd64.whl", hash = "sha256:b90ab29d0c37ec9bf55424c064312930ca5f4bde15ee8619ee44e69319aab163", size = 40988851, upload-time = "2025-05-08T16:08:33.671Z" },
-    { url = "https://files.pythonhosted.org/packages/1b/e9/7a879c137f7e55b30d75d90ce3eb468197646bc7b443ac036ae3fe109055/scipy-1.15.3-cp313-cp313t-macosx_10_13_x86_64.whl", hash = "sha256:3ac07623267feb3ae308487c260ac684b32ea35fd81e12845039952f558047b8", size = 38863011, upload-time = "2025-05-08T16:07:44.039Z" },
-    { url = "https://files.pythonhosted.org/packages/51/d1/226a806bbd69f62ce5ef5f3ffadc35286e9fbc802f606a07eb83bf2359de/scipy-1.15.3-cp313-cp313t-macosx_12_0_arm64.whl", hash = "sha256:6487aa99c2a3d509a5227d9a5e889ff05830a06b2ce08ec30df6d79db5fcd5c5", size = 30266407, upload-time = "2025-05-08T16:07:49.891Z" },
-    { url = "https://files.pythonhosted.org/packages/e5/9b/f32d1d6093ab9eeabbd839b0f7619c62e46cc4b7b6dbf05b6e615bbd4400/scipy-1.15.3-cp313-cp313t-macosx_14_0_arm64.whl", hash = "sha256:50f9e62461c95d933d5c5ef4a1f2ebf9a2b4e83b0db374cb3f1de104d935922e", size = 22540030, upload-time = "2025-05-08T16:07:54.121Z" },
-    { url = "https://files.pythonhosted.org/packages/e7/29/c278f699b095c1a884f29fda126340fcc201461ee8bfea5c8bdb1c7c958b/scipy-1.15.3-cp313-cp313t-macosx_14_0_x86_64.whl", hash = "sha256:14ed70039d182f411ffc74789a16df3835e05dc469b898233a245cdfd7f162cb", size = 25218709, upload-time = "2025-05-08T16:07:58.506Z" },
-    { url = "https://files.pythonhosted.org/packages/24/18/9e5374b617aba742a990581373cd6b68a2945d65cc588482749ef2e64467/scipy-1.15.3-cp313-cp313t-manylinux_2_17_aarch64.manylinux2014_aarch64.whl", hash = "sha256:0a769105537aa07a69468a0eefcd121be52006db61cdd8cac8a0e68980bbb723", size = 34809045, upload-time = "2025-05-08T16:08:03.929Z" },
-    { url = "https://files.pythonhosted.org/packages/e1/fe/9c4361e7ba2927074360856db6135ef4904d505e9b3afbbcb073c4008328/scipy-1.15.3-cp313-cp313t-manylinux_2_17_x86_64.manylinux2014_x86_64.whl", hash = "sha256:9db984639887e3dffb3928d118145ffe40eff2fa40cb241a306ec57c219ebbbb", size = 36703062, upload-time = "2025-05-08T16:08:09.558Z" },
-    { url = "https://files.pythonhosted.org/packages/b7/8e/038ccfe29d272b30086b25a4960f757f97122cb2ec42e62b460d02fe98e9/scipy-1.15.3-cp313-cp313t-musllinux_1_2_aarch64.whl", hash = "sha256:40e54d5c7e7ebf1aa596c374c49fa3135f04648a0caabcb66c52884b943f02b4", size = 36393132, upload-time = "2025-05-08T16:08:15.34Z" },
-    { url = "https://files.pythonhosted.org/packages/10/7e/5c12285452970be5bdbe8352c619250b97ebf7917d7a9a9e96b8a8140f17/scipy-1.15.3-cp313-cp313t-musllinux_1_2_x86_64.whl", hash = "sha256:5e721fed53187e71d0ccf382b6bf977644c533e506c4d33c3fb24de89f5c3ed5", size = 38979503, upload-time = "2025-05-08T16:08:21.513Z" },
-    { url = "https://files.pythonhosted.org/packages/81/06/0a5e5349474e1cbc5757975b21bd4fad0e72ebf138c5592f191646154e06/scipy-1.15.3-cp313-cp313t-win_amd64.whl", hash = "sha256:76ad1fb5f8752eabf0fa02e4cc0336b4e8f021e2d5f061ed37d6d264db35e3ca", size = 40308097, upload-time = "2025-05-08T16:08:27.627Z" },
-]
 [[package]]
 name = "semantic-version"
 version = "2.10.0"
@@ -2468,27 +2371,6 @@ wheels = [
     { url = "https://files.pythonhosted.org/packages/8b/0c/9d30a4ebeb6db2b25a841afbb80f6ef9a854fc3b41be131d249a977b4959/starlette-0.46.2-py3-none-any.whl", hash = "sha256:595633ce89f8ffa71a015caed34a5b2dc1c0cdb3f0f1fbd1e69339cf2abeec35", size = 72037, upload-time = "2025-04-13T13:56:16.21Z" },
 ]
-[[package]]
-name = "supervision"
-version = "0.25.1"
-source = { registry = "https://pypi.org/simple" }
-dependencies = [
-    { name = "contourpy" },
-    { name = "defusedxml" },
-    { name = "matplotlib" },
-    { name = "numpy" },
-    { name = "opencv-python" },
-    { name = "pillow" },
-    { name = "pyyaml" },
-    { name = "requests" },
-    { name = "scipy" },
-    { name = "tqdm" },
-]
-sdist = { url = "https://files.pythonhosted.org/packages/4c/87/3daaa3aec1766f93d4c07d33f933a5ded0a6243a099b6b399b6268053bfe/supervision-0.25.1.tar.gz", hash = "sha256:61781b4abe4fa6ff95c58af6aec7dd3451a78e7e6a797e9ea2787f93771dd031", size = 146657, upload-time = "2024-12-13T13:12:10.64Z" }
-wheels = [
-    { url = "https://files.pythonhosted.org/packages/c1/24/d3bcad7ece751166ed308c6deb7e7d02a62a7f5a6e01e61ff2787c538fb0/supervision-0.25.1-py3-none-any.whl", hash = "sha256:ebc015c22983bc64563beda75f5f529e465e4020b318da07948ce03148307a72", size = 181480, upload-time = "2024-12-13T13:12:08.1Z" },
-]
 [[package]]
 name = "synchronicity"
 version = "0.9.12"

 source = { virtual = "." }
 dependencies = [
     { name = "gradio", extra = ["mcp"] },
+    { name = "matplotlib" },
     { name = "modal" },
     { name = "numpy" },
     { name = "pillow" },
 [package.dev-dependencies]
 dev = [
     { name = "jupyterlab" },
     { name = "opencv-contrib-python" },
     { name = "ruff" },
 ]
 [package.metadata]
 requires-dist = [
     { name = "gradio", extras = ["mcp"], specifier = ">=5.32.1" },
+    { name = "matplotlib", specifier = ">=3.10.3" },
     { name = "modal", specifier = ">=1.0.2" },
     { name = "numpy", specifier = ">=2.2.6" },
     { name = "pillow", specifier = ">=11.2.1" },
 [package.metadata.requires-dev]
 dev = [
     { name = "jupyterlab", specifier = ">=4.4.3" },
     { name = "opencv-contrib-python", specifier = ">=4.11.0.86" },
     { name = "ruff", specifier = ">=0.11.12" },
 ]
 [[package]]
     { url = "https://files.pythonhosted.org/packages/0d/c6/146487546adc4726f0be591a65b466973feaa58cc3db711087e802e940fb/opencv_contrib_python-4.11.0.86-cp37-abi3-win_amd64.whl", hash = "sha256:654758a9ae8ca9a75fca7b64b19163636534f0eedffe1e14c3d7218988625c8d", size = 46185163, upload-time = "2025-01-16T13:52:39.745Z" },
 ]
 [[package]]
 name = "orjson"
 version = "3.10.18"
     { url = "https://files.pythonhosted.org/packages/05/4c/bf3cad0d64c3214ac881299c4562b815f05d503bccc513e3fd4fdc6f67e4/pyzmq-26.4.0-cp313-cp313t-musllinux_1_1_x86_64.whl", hash = "sha256:26a2a7451606b87f67cdeca2c2789d86f605da08b4bd616b1a9981605ca3a364", size = 1395540, upload-time = "2025-04-04T12:04:30.562Z" },
 ]
 [[package]]
 name = "referencing"
 version = "0.36.2"
     { url = "https://files.pythonhosted.org/packages/4d/c0/1108ad9f01567f66b3154063605b350b69c3c9366732e09e45f9fd0d1deb/safehttpx-0.1.6-py3-none-any.whl", hash = "sha256:407cff0b410b071623087c63dd2080c3b44dc076888d8c5823c00d1e58cb381c", size = 8692, upload-time = "2024-12-02T18:44:08.555Z" },
 ]
 [[package]]
 name = "semantic-version"
 version = "2.10.0"
     { url = "https://files.pythonhosted.org/packages/8b/0c/9d30a4ebeb6db2b25a841afbb80f6ef9a854fc3b41be131d249a977b4959/starlette-0.46.2-py3-none-any.whl", hash = "sha256:595633ce89f8ffa71a015caed34a5b2dc1c0cdb3f0f1fbd1e69339cf2abeec35", size = 72037, upload-time = "2025-04-13T13:56:16.21Z" },
 ]
 [[package]]
 name = "synchronicity"
 version = "0.9.12"