Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -21,12 +21,17 @@ Computer Vision models like object detection or image segmentation models are ta
|
|
21 |
The idea of the agentic demo is to provide powerful LLM with access to expert vision models like object detection or image segmentation models.
|
22 |
The agent can fulfill precise perception task on any object present in the image : detection, location, classification, masking, counting, etc...
|
23 |
|
24 |
-
##
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
25 |
|
26 |
-
In this preliminary app, the agent is a CodeAgent (provided by the smolagents framework) provided with access to a set of tools :
|
27 |
-
- Any object detection and image segmentation models available of HuggingFace
|
28 |
-
- Image processing functions
|
29 |
-
- Image annotation functions
|
30 |
|
31 |
To complete a user request
|
32 |
|
|
|
21 |
The idea of the agentic demo is to provide powerful LLM with access to expert vision models like object detection or image segmentation models.
|
22 |
The agent can fulfill precise perception task on any object present in the image : detection, location, classification, masking, counting, etc...
|
23 |
|
24 |
+
## Overview
|
25 |
+
|
26 |
+
In this preliminary app, the agent is a CodeAgent provided by the smolagents framework.
|
27 |
+
Its interface consists of a chat interface with example and a gallery which is used to display the agent's work.
|
28 |
+
The agent is provided with a set of tools :
|
29 |
+
- Task model retriever : a RAG tool which, given a task (object-detection or image-segmentation) and a query (car e.g.), returns a list of models with their model id and the list of classes it is capable of detecting/segmenting. The list if based on a curated dataset of all the models available on the HuggingFace Hub, returns the mo
|
30 |
+
- Computer vision models : Any object detection and image segmentation models available of HuggingFace
|
31 |
+
- Image processing functions : Resizing, cropping, ...
|
32 |
+
- Image annotation functions : Label, bounding box and mask annotators
|
33 |
+
|
34 |
|
|
|
|
|
|
|
|
|
35 |
|
36 |
To complete a user request
|
37 |
|