Krrer commited on
Commit
a062bdf
·
verified ·
1 Parent(s): 4dbfff3

Evaluation metrics

Browse files
Files changed (1) hide show
  1. README.md +20 -4
README.md CHANGED
@@ -2,19 +2,35 @@
2
  license: cc-by-nc-4.0
3
  base_model:
4
  - Organika/sdxl-detector
 
 
 
5
  ---
6
  # AI-image-detector
7
  The purpose of this model is to classify images as AI generated or Real.
8
-
9
  This model was created by fine-tuning the [Organika/sdxl-detector] on dataset of AI generated and real images from reddit, kaggle and real art from public domain with their text description.
10
  Dataset was balanced to have similar number of real and generated images in each class (e.g. art, photos ...).
11
  Art images from public domain were paired with generated equivalent created from their text descriptions with style transfer (sdxl with ip-adapter) from original piece.
12
  The final dataset consisted of more than 50k images.
13
-
14
  The testing dataset consisted of 20% split of our base dataset and images outside the training domain from specific popular (as of 2024) image generation models.
15
  Finetuning vastly improved performance over Organika/sdxl-detector during testing, especially on images created by newer models.
16
 
17
- The data used to fine-tune this model was scraped from image dedicated subreddits, some of which may be copyrighted. For this reason, this model should be considered appropriate only for non-comercial use only.
 
 
 
 
 
 
 
 
 
 
 
 
18
 
19
 
20
-
 
 
2
  license: cc-by-nc-4.0
3
  base_model:
4
  - Organika/sdxl-detector
5
+ library_name: transformers
6
+ tags:
7
+ - image-classification
8
  ---
9
  # AI-image-detector
10
  The purpose of this model is to classify images as AI generated or Real.
11
+ ### Dataset
12
  This model was created by fine-tuning the [Organika/sdxl-detector] on dataset of AI generated and real images from reddit, kaggle and real art from public domain with their text description.
13
  Dataset was balanced to have similar number of real and generated images in each class (e.g. art, photos ...).
14
  Art images from public domain were paired with generated equivalent created from their text descriptions with style transfer (sdxl with ip-adapter) from original piece.
15
  The final dataset consisted of more than 50k images.
16
+ ### Testing
17
  The testing dataset consisted of 20% split of our base dataset and images outside the training domain from specific popular (as of 2024) image generation models.
18
  Finetuning vastly improved performance over Organika/sdxl-detector during testing, especially on images created by newer models.
19
 
20
+
21
+ Test split evaluation
22
+ | Accuracy | Precision | Recall | F1 |
23
+ |:-------------:|:---------------:|:--------:|:--------:|
24
+ | 0.9818 | 0.9829 | 0.9810 | 0.9819 |
25
+
26
+ Out of domain evaluation
27
+ | Generative Model Family | Accuracy |
28
+ |:-------------:|:---------------:|
29
+ | DALL-E | 0.9076 |
30
+ | FluxAi | 0.8333 |
31
+ | Imagen | 0.7563 |
32
+ | StableDiffusion | 0.8754 |
33
 
34
 
35
+ ### License
36
+ The data used to fine-tune this model was scraped from image dedicated subreddits, some of which may be copyrighted. For this reason, this model should be considered appropriate only for non-commercial use only.