Evaluation metrics
Browse files
README.md
CHANGED
@@ -2,19 +2,35 @@
|
|
2 |
license: cc-by-nc-4.0
|
3 |
base_model:
|
4 |
- Organika/sdxl-detector
|
|
|
|
|
|
|
5 |
---
|
6 |
# AI-image-detector
|
7 |
The purpose of this model is to classify images as AI generated or Real.
|
8 |
-
|
9 |
This model was created by fine-tuning the [Organika/sdxl-detector] on dataset of AI generated and real images from reddit, kaggle and real art from public domain with their text description.
|
10 |
Dataset was balanced to have similar number of real and generated images in each class (e.g. art, photos ...).
|
11 |
Art images from public domain were paired with generated equivalent created from their text descriptions with style transfer (sdxl with ip-adapter) from original piece.
|
12 |
The final dataset consisted of more than 50k images.
|
13 |
-
|
14 |
The testing dataset consisted of 20% split of our base dataset and images outside the training domain from specific popular (as of 2024) image generation models.
|
15 |
Finetuning vastly improved performance over Organika/sdxl-detector during testing, especially on images created by newer models.
|
16 |
|
17 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
18 |
|
19 |
|
20 |
-
|
|
|
|
2 |
license: cc-by-nc-4.0
|
3 |
base_model:
|
4 |
- Organika/sdxl-detector
|
5 |
+
library_name: transformers
|
6 |
+
tags:
|
7 |
+
- image-classification
|
8 |
---
|
9 |
# AI-image-detector
|
10 |
The purpose of this model is to classify images as AI generated or Real.
|
11 |
+
### Dataset
|
12 |
This model was created by fine-tuning the [Organika/sdxl-detector] on dataset of AI generated and real images from reddit, kaggle and real art from public domain with their text description.
|
13 |
Dataset was balanced to have similar number of real and generated images in each class (e.g. art, photos ...).
|
14 |
Art images from public domain were paired with generated equivalent created from their text descriptions with style transfer (sdxl with ip-adapter) from original piece.
|
15 |
The final dataset consisted of more than 50k images.
|
16 |
+
### Testing
|
17 |
The testing dataset consisted of 20% split of our base dataset and images outside the training domain from specific popular (as of 2024) image generation models.
|
18 |
Finetuning vastly improved performance over Organika/sdxl-detector during testing, especially on images created by newer models.
|
19 |
|
20 |
+
|
21 |
+
Test split evaluation
|
22 |
+
| Accuracy | Precision | Recall | F1 |
|
23 |
+
|:-------------:|:---------------:|:--------:|:--------:|
|
24 |
+
| 0.9818 | 0.9829 | 0.9810 | 0.9819 |
|
25 |
+
|
26 |
+
Out of domain evaluation
|
27 |
+
| Generative Model Family | Accuracy |
|
28 |
+
|:-------------:|:---------------:|
|
29 |
+
| DALL-E | 0.9076 |
|
30 |
+
| FluxAi | 0.8333 |
|
31 |
+
| Imagen | 0.7563 |
|
32 |
+
| StableDiffusion | 0.8754 |
|
33 |
|
34 |
|
35 |
+
### License
|
36 |
+
The data used to fine-tune this model was scraped from image dedicated subreddits, some of which may be copyrighted. For this reason, this model should be considered appropriate only for non-commercial use only.
|