File size: 2,129 Bytes

---
license: apache-2.0
tags:
- Computer
- computervision
---

# Uses

This LLM is trained on data generated by my code for the yolov8 model. [Github code](https://github.com/bauerhartmut/yolov8-Computervision)
The model is capable of briefly describing what the yolov8 model can detect and can also execute a command (/click). 
When the command is triggered, a dictionary is generated containing the key data of the object to be clicked.

# Testing
You can test the model by giving it this informations:

```json
{
    "Object": [
        {
            "index": "window_0",
            "label": "window",
            "property": "toplayer",
            "coords": [
                189.06007385253906,
                79.33326721191406,
                1156.018798828125,
                750.1478271484375
            ],
            "textes": 24,
            "interactions": [
                {
                    "label": "close_window",
                    "interaction_type": 1,
                    "coords": [
                        1114.04541015625,
                        84.65348815917969,
                        1149.1778564453125,
                        113.41248321533203
                    ]
                },
                {
                    "label": "maximize",
                    "interaction_type": 1,
                    "coords": [
                        1067.0111083984375,
                        84.82215118408203,
                        1099.86328125,
                        112.69491577148438
                    ]
                },
                {
                    "label": "minize_window",
                    "interaction_type": 1,
                    "coords": [
                        1024.7701416015625,
                        85.06327819824219,
                        1053.4327392578125,
                        111.52396392822266
                    ]
                }
            ]
        }
    ]
}
```

You can give the model this informations and a prompt like "Was siehst du" or "Kannst du das Fenster schließen".

The Model is at the moment only trained on german.