File size: 1,656 Bytes
f8e2a20
 
 
 
 
a1060f4
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
01efacc
a1060f4
01efacc
a1060f4
f8e2a20
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
---
pipeline_tag: object-detection
tags:
- code
---
# language-levels-yolov10m
This repository contains the fine-tuning script and weights of YOLOv10m on languge levels dataset.

### The language levels dataset :
- The dataset contains about 50000 images (some corrupt images were removed).

### The training :
- We fine-tuned YOLOv10m with this configuration :
  Model:  yolov10m.pt
  
  Epochs:  10
  
  Batch:  12
  
  device: [0,1] (GPU T4 x 2)
  
  size : {'width': 799, 'height': 151}

- Training time : Wall time: 3h 24min 42s

### The results :

##### Confusion matrix :


![image/png](https://cdn-uploads.huggingface.co/production/uploads/6682bf7c0b72be1367b0a69b/fLDLyFvBAeR8fxmcFUlvH.png)


![image/png](https://cdn-uploads.huggingface.co/production/uploads/6682bf7c0b72be1367b0a69b/FXMHDRp_UBx3glHj-mwUx.png)

##### Training loss :

Best Training Box loss:  2.0493 , on epoch:  10 


Best Validation Box loss:  1.7033 , on epoch:  10 

==================================================

Best Training Cls loss:  1.1808 , on epoch:  10 


Best Validation Cls loss:  0.84118 , on epoch:  10 

==================================================

Best Training DFL loss:  1.7253 , on epoch:  10 


Best Validation DFL loss:  1.6617 , on epoch:  10 

##### Precision, F1 and recall :


![image/png](https://cdn-uploads.huggingface.co/production/uploads/6682bf7c0b72be1367b0a69b/HSCdXds_wtmbmD57to8g2.png)    

![image/png](https://cdn-uploads.huggingface.co/production/uploads/6682bf7c0b72be1367b0a69b/XfPWbopRLdTXrnvv7e64W.png)

![image/png](https://cdn-uploads.huggingface.co/production/uploads/6682bf7c0b72be1367b0a69b/uTe0h65KUaLh_iv0kRS46.png)