Smogy
/

SMOGY-Ai-images-detector

Image Classification

Inference Endpoints

Model card Files Files and versions Community

Krrer commited on Dec 2, 2024

Commit

a062bdf

·

verified ·

1 Parent(s): 4dbfff3

Evaluation metrics

Files changed (1) hide show

README.md +20 -4

README.md CHANGED Viewed

@@ -2,19 +2,35 @@
 license: cc-by-nc-4.0
 base_model:
 - Organika/sdxl-detector
 ---
 # AI-image-detector
 The purpose of this model is to classify images as AI generated or Real.
 This model was created by fine-tuning the [Organika/sdxl-detector] on dataset of AI generated and real images from reddit, kaggle and real art from public domain with their text description.
 Dataset was balanced to have similar number of real and generated images in each class (e.g. art, photos ...).
 Art images from public domain were paired with generated equivalent created from their text descriptions with style transfer (sdxl with ip-adapter) from original piece.
 The final dataset consisted of more than 50k images.
 The testing dataset consisted of 20% split of our base dataset and images outside the training domain from specific popular (as of 2024) image generation models.
 Finetuning vastly improved performance over Organika/sdxl-detector during testing, especially on images created by newer models.
-The data used to fine-tune this model was scraped from image dedicated subreddits, some of which may be copyrighted. For this reason, this model should be considered appropriate only for non-comercial use only.

 license: cc-by-nc-4.0
 base_model:
 - Organika/sdxl-detector
+library_name: transformers
+tags:
+- image-classification
 ---
 # AI-image-detector
 The purpose of this model is to classify images as AI generated or Real.
+### Dataset
 This model was created by fine-tuning the [Organika/sdxl-detector] on dataset of AI generated and real images from reddit, kaggle and real art from public domain with their text description.
 Dataset was balanced to have similar number of real and generated images in each class (e.g. art, photos ...).
 Art images from public domain were paired with generated equivalent created from their text descriptions with style transfer (sdxl with ip-adapter) from original piece.
 The final dataset consisted of more than 50k images.
+### Testing
 The testing dataset consisted of 20% split of our base dataset and images outside the training domain from specific popular (as of 2024) image generation models.
 Finetuning vastly improved performance over Organika/sdxl-detector during testing, especially on images created by newer models.
+Test split evaluation
+| Accuracy |  Precision | Recall | F1 |
+|:-------------:|:---------------:|:--------:|:--------:|
+| 0.9818    | 0.9829        | 0.9810   | 0.9819 |
+Out of domain evaluation
+| Generative Model Family |  Accuracy |
+|:-------------:|:---------------:|
+| DALL-E          | 0.9076        |
+| FluxAi          | 0.8333        |
+| Imagen          | 0.7563        |
+| StableDiffusion | 0.8754        |
+### License
+The data used to fine-tune this model was scraped from image dedicated subreddits, some of which may be copyrighted. For this reason, this model should be considered appropriate only for non-commercial use only.