Spaces:

Agents-MCP-Hackathon
/

ScouterAI

Running

App Files Files Community

stevenbucaille commited on 11 days ago

Commit

bda8e49

verified ·

1 Parent(s): 72a26fd

Update README.md

Browse files

Files changed (1) hide show

README.md +29 -2

README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 ---
-title: VisionEnhancedAgent
-emoji: 🚀
 colorFrom: green
 colorTo: gray
 sdk: gradio
@@ -11,3 +11,30 @@ license: apache-2.0
 tag: agent-demo-track
 ---

 ---
+title: ScouterAI
+emoji: 👓
 colorFrom: green
 colorTo: gray
 sdk: gradio
 tag: agent-demo-track
 ---
+# ScouterAI - The Vision enhanced Agent
+Welcome to ScouterAI, my [Agents - MCP Hackathon](https://huggingface.co/Agents-MCP-Hackathon) submission.
+This app falls under the track 3 : Agentic Demo.
+The goal of the app is to demonstrate the capabilities of agentic llm's combined with more "traditional" deep learning computer vision.
+LLM's (and VLM's) are great models when it comes to interacting with the user and understanding its queries but are not (yet) capable of a precise perception of the images presented to them.
+Computer Vision models like object detection or image segmentation models are tailored models to accomplish these tasks but require some engineering to wrap them and be user ready.
+The idea of the agentic demo is to provide powerful LLM with access to expert vision models like object detection or image segmentation models.
+The agent can fulfill precise perception task on any object present in the image : detection, location, classification, masking, counting, etc...
+##
+In this preliminary app, the agent is a CodeAgent (provided by the smolagents framework) provided with access to a set of tools :
+- Any object detection and image segmentation models available of HuggingFace
+- Image processing functions
+- Image annotation functions
+To complete a user request
+## Use-cases
+## Stack
+Agent framework : smolagents
+LLM : Anthropic
+Compute : Modal