--- language: - eng license: cc0-1.0 tags: - multilabel-image-classification - multilabel - generated_from_trainer base_model: drone-DinoVdeau-produttoria-probabilities-large-2024_11_06-batch-size16_freeze_probs model-index: - name: drone-DinoVdeau-produttoria-probabilities-large-2024_11_06-batch-size16_freeze_probs results: [] --- drone-DinoVdeau-produttoria-probabilities is a fine-tuned version of [drone-DinoVdeau-produttoria-probabilities-large-2024_11_06-batch-size16_freeze_probs](https://huggingface.co/drone-DinoVdeau-produttoria-probabilities-large-2024_11_06-batch-size16_freeze_probs). It achieves the following results on the test set: - Loss: 0.3261 - F1 Micro: 0.8621 - F1 Macro: 0.8264 - Accuracy: 0.1682 - RMSE: 0.2445 - MAE: 0.1621 - R2: 0.4057 | Class | F1 per class | |----------|-------| | Acropore_branched | 0.8063 | | Acropore_digitised | 0.7335 | | Acropore_tabular | 0.6247 | | Algae | 0.9859 | | Dead_coral | 0.8424 | | Fish | 0.7464 | | Millepore | 0.6453 | | No_acropore_encrusting | 0.7292 | | No_acropore_massive | 0.8681 | | No_acropore_sub_massive | 0.8092 | | Rock | 0.9925 | | Rubble | 0.9693 | | Sand | 0.9904 | --- # Model description drone-DinoVdeau-produttoria-probabilities is a model built on top of drone-DinoVdeau-produttoria-probabilities-large-2024_11_06-batch-size16_freeze_probs model for underwater multilabel image classification.The classification head is a combination of linear, ReLU, batch normalization, and dropout layers. The source code for training the model can be found in this [Git repository](https://github.com/SeatizenDOI/DinoVdeau). - **Developed by:** [lombardata](https://huggingface.co/lombardata), credits to [César Leblanc](https://huggingface.co/CesarLeblanc) and [Victor Illien](https://huggingface.co/groderg) --- # Intended uses & limitations You can use the raw model for classify diverse marine species, encompassing coral morphotypes classes taken from the Global Coral Reef Monitoring Network (GCRMN), habitats classes and seagrass species. --- # Training and evaluation data Details on the estimated number of images for each class are given in the following table: | Class | train | test | val | Total | |:------------------------|--------:|-------:|------:|--------:| | Acropore_branched | 2028 | 684 | 686 | 3398 | | Acropore_digitised | 2006 | 735 | 717 | 3458 | | Acropore_tabular | 1237 | 461 | 451 | 2149 | | Algae | 11086 | 3671 | 3675 | 18432 | | Dead_coral | 6354 | 2161 | 2147 | 10662 | | Fish | 4032 | 1430 | 1430 | 6892 | | Millepore | 1943 | 783 | 772 | 3498 | | No_acropore_encrusting | 2663 | 986 | 957 | 4606 | | No_acropore_massive | 6897 | 2375 | 2375 | 11647 | | No_acropore_sub_massive | 5416 | 1988 | 1958 | 9362 | | Rock | 11164 | 3726 | 3725 | 18615 | | Rubble | 10687 | 3570 | 3572 | 17829 | | Sand | 11151 | 3726 | 3723 | 18600 | --- # Training procedure ## Training hyperparameters The following hyperparameters were used during training: - **Number of Epochs**: 45.0 - **Learning Rate**: 0.001 - **Train Batch Size**: 16 - **Eval Batch Size**: 16 - **Optimizer**: Adam - **LR Scheduler Type**: ReduceLROnPlateau with a patience of 5 epochs and a factor of 0.1 - **Freeze Encoder**: Yes - **Data Augmentation**: Yes ## Data Augmentation Data were augmented using the following transformations : Train Transforms - **PreProcess**: No additional parameters - **Resize**: probability=1.00 - **RandomHorizontalFlip**: probability=0.25 - **RandomVerticalFlip**: probability=0.25 - **ColorJiggle**: probability=0.25 - **RandomPerspective**: probability=0.25 - **Normalize**: probability=1.00 Val Transforms - **PreProcess**: No additional parameters - **Resize**: probability=1.00 - **Normalize**: probability=1.00 ## Training results Epoch | Validation Loss | MAE | RMSE | R2 | Learning Rate --- | --- | --- | --- | --- | --- 0 | N/A | 0.0000 | 0.0000 | 0.0000 | 0.001 1 | 0.36246591806411743 | 0.1880 | 0.2669 | 0.2744 | 0.001 2 | 0.3457428216934204 | 0.1685 | 0.2560 | 0.3367 | 0.001 3 | 0.3518487811088562 | 0.1747 | 0.2597 | 0.3157 | 0.001 4 | 0.3507988750934601 | 0.1751 | 0.2563 | 0.3345 | 0.001 5 | 0.3436409533023834 | 0.1696 | 0.2546 | 0.3371 | 0.001 6 | 0.35096481442451477 | 0.1767 | 0.2598 | 0.3175 | 0.001 7 | 0.3412320613861084 | 0.1750 | 0.2538 | 0.3471 | 0.001 8 | 0.3456409275531769 | 0.1678 | 0.2561 | 0.3435 | 0.001 9 | 0.3425351679325104 | 0.1741 | 0.2545 | 0.3409 | 0.001 10 | 0.33964109420776367 | 0.1711 | 0.2525 | 0.3583 | 0.001 11 | 0.34479108452796936 | 0.1721 | 0.2542 | 0.3498 | 0.001 12 | 0.3415849804878235 | 0.1767 | 0.2527 | 0.3577 | 0.001 13 | 0.33990854024887085 | 0.1677 | 0.2527 | 0.3523 | 0.001 14 | 0.34520208835601807 | 0.1746 | 0.2540 | 0.3443 | 0.001 15 | 0.34849879145622253 | 0.1801 | 0.2568 | 0.3333 | 0.001 16 | 0.34347954392433167 | 0.1718 | 0.2537 | 0.3473 | 0.001 17 | 0.341246634721756 | 0.1711 | 0.2508 | 0.3633 | 0.0001 18 | 0.3398562967777252 | 0.1708 | 0.2507 | 0.3649 | 0.0001 19 | 0.3332718312740326 | 0.1675 | 0.2483 | 0.3775 | 0.0001 20 | 0.333162784576416 | 0.1688 | 0.2478 | 0.3810 | 0.0001 21 | 0.3324449062347412 | 0.1673 | 0.2476 | 0.3810 | 0.0001 22 | 0.3320053517818451 | 0.1671 | 0.2472 | 0.3836 | 0.0001 23 | 0.3301050662994385 | 0.1658 | 0.2461 | 0.3890 | 0.0001 24 | 0.3298528492450714 | 0.1648 | 0.2458 | 0.3899 | 0.0001 25 | 0.32962867617607117 | 0.1641 | 0.2458 | 0.3903 | 0.0001 26 | 0.32889437675476074 | 0.1632 | 0.2454 | 0.3926 | 0.0001 27 | 0.33042922616004944 | 0.1674 | 0.2461 | 0.3891 | 0.0001 28 | 0.32880541682243347 | 0.1645 | 0.2451 | 0.3955 | 0.0001 29 | 0.3293789327144623 | 0.1656 | 0.2451 | 0.3961 | 0.0001 30 | 0.33135533332824707 | 0.1684 | 0.2464 | 0.3914 | 0.0001 31 | 0.32911789417266846 | 0.1608 | 0.2457 | 0.3904 | 0.0001 32 | 0.3289436399936676 | 0.1631 | 0.2453 | 0.3959 | 0.0001 33 | 0.3271527588367462 | 0.1628 | 0.2444 | 0.3972 | 0.0001 34 | 0.32699429988861084 | 0.1621 | 0.2443 | 0.3976 | 0.0001 35 | 0.32638314366340637 | 0.1615 | 0.2439 | 0.3987 | 0.0001 36 | 0.3293066918849945 | 0.1656 | 0.2455 | 0.3946 | 0.0001 37 | 0.3271186649799347 | 0.1597 | 0.2442 | 0.3996 | 0.0001 38 | 0.32695677876472473 | 0.1613 | 0.2437 | 0.4022 | 0.0001 39 | 0.33263665437698364 | 0.1575 | 0.2438 | 0.4007 | 0.0001 40 | 0.33278176188468933 | 0.1651 | 0.2442 | 0.4003 | 0.0001 41 | 0.33069443702697754 | 0.1627 | 0.2435 | 0.4031 | 0.0001 42 | 0.3310275375843048 | 0.1641 | 0.2436 | 0.4030 | 1e-05 43 | 0.32956016063690186 | 0.1603 | 0.2429 | 0.4052 | 1e-05 44 | 0.33022987842559814 | 0.1625 | 0.2432 | 0.4038 | 1e-05 45 | 0.3266430199146271 | 0.1617 | 0.2430 | 0.4047 | 1e-05 --- # Framework Versions - **Transformers**: 4.41.0 - **Pytorch**: 2.5.0+cu124 - **Datasets**: 3.0.2 - **Tokenizers**: 0.19.1