lamiaaEl's picture
Update README.md
f8e2a20 verified
---
pipeline_tag: object-detection
tags:
- code
---
# language-levels-yolov10m
This repository contains the fine-tuning script and weights of YOLOv10m on languge levels dataset.
### The language levels dataset :
- The dataset contains about 50000 images (some corrupt images were removed).
### The training :
- We fine-tuned YOLOv10m with this configuration :
Model: yolov10m.pt
Epochs: 10
Batch: 12
device: [0,1] (GPU T4 x 2)
size : {'width': 799, 'height': 151}
- Training time : Wall time: 3h 24min 42s
### The results :
##### Confusion matrix :
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6682bf7c0b72be1367b0a69b/fLDLyFvBAeR8fxmcFUlvH.png)
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6682bf7c0b72be1367b0a69b/FXMHDRp_UBx3glHj-mwUx.png)
##### Training loss :
Best Training Box loss: 2.0493 , on epoch: 10
Best Validation Box loss: 1.7033 , on epoch: 10
==================================================
Best Training Cls loss: 1.1808 , on epoch: 10
Best Validation Cls loss: 0.84118 , on epoch: 10
==================================================
Best Training DFL loss: 1.7253 , on epoch: 10
Best Validation DFL loss: 1.6617 , on epoch: 10
##### Precision, F1 and recall :
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6682bf7c0b72be1367b0a69b/HSCdXds_wtmbmD57to8g2.png)
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6682bf7c0b72be1367b0a69b/XfPWbopRLdTXrnvv7e64W.png)
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6682bf7c0b72be1367b0a69b/uTe0h65KUaLh_iv0kRS46.png)