stevenbucaille commited on
Commit
8091e8c
·
verified ·
1 Parent(s): bda8e49

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -5
README.md CHANGED
@@ -21,12 +21,17 @@ Computer Vision models like object detection or image segmentation models are ta
21
  The idea of the agentic demo is to provide powerful LLM with access to expert vision models like object detection or image segmentation models.
22
  The agent can fulfill precise perception task on any object present in the image : detection, location, classification, masking, counting, etc...
23
 
24
- ##
 
 
 
 
 
 
 
 
 
25
 
26
- In this preliminary app, the agent is a CodeAgent (provided by the smolagents framework) provided with access to a set of tools :
27
- - Any object detection and image segmentation models available of HuggingFace
28
- - Image processing functions
29
- - Image annotation functions
30
 
31
  To complete a user request
32
 
 
21
  The idea of the agentic demo is to provide powerful LLM with access to expert vision models like object detection or image segmentation models.
22
  The agent can fulfill precise perception task on any object present in the image : detection, location, classification, masking, counting, etc...
23
 
24
+ ## Overview
25
+
26
+ In this preliminary app, the agent is a CodeAgent provided by the smolagents framework.
27
+ Its interface consists of a chat interface with example and a gallery which is used to display the agent's work.
28
+ The agent is provided with a set of tools :
29
+ - Task model retriever : a RAG tool which, given a task (object-detection or image-segmentation) and a query (car e.g.), returns a list of models with their model id and the list of classes it is capable of detecting/segmenting. The list if based on a curated dataset of all the models available on the HuggingFace Hub, returns the mo
30
+ - Computer vision models : Any object detection and image segmentation models available of HuggingFace
31
+ - Image processing functions : Resizing, cropping, ...
32
+ - Image annotation functions : Label, bounding box and mask annotators
33
+
34
 
 
 
 
 
35
 
36
  To complete a user request
37