Spaces:

Agents-MCP-Hackathon
/

ScouterAI

Running

stevenbucaille commited on 11 days ago

Commit

8091e8c

verified ·

1 Parent(s): bda8e49

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -21,12 +21,17 @@ Computer Vision models like object detection or image segmentation models are ta
 The idea of the agentic demo is to provide powerful LLM with access to expert vision models like object detection or image segmentation models.
 The agent can fulfill precise perception task on any object present in the image : detection, location, classification, masking, counting, etc...
-##
-In this preliminary app, the agent is a CodeAgent (provided by the smolagents framework) provided with access to a set of tools :
-- Any object detection and image segmentation models available of HuggingFace
-- Image processing functions
-- Image annotation functions
 To complete a user request

 The idea of the agentic demo is to provide powerful LLM with access to expert vision models like object detection or image segmentation models.
 The agent can fulfill precise perception task on any object present in the image : detection, location, classification, masking, counting, etc...
+## Overview
+In this preliminary app, the agent is a CodeAgent provided by the smolagents framework.
+Its interface consists of a chat interface with example and a gallery which is used to display the agent's work.
+The agent is provided with a set of tools :
+- Task model retriever : a RAG tool which, given a task (object-detection or image-segmentation) and a query (car e.g.), returns a list of models with their model id and the list of classes it is capable of detecting/segmenting. The list if based on a curated dataset of all the models available on the HuggingFace Hub, returns the mo
+- Computer vision models : Any object detection and image segmentation models available of HuggingFace
+- Image processing functions : Resizing, cropping, ...
+- Image annotation functions : Label, bounding box and mask annotators
 To complete a user request