license: cc-by-nc-sa-4.0 | |
pipeline_tag: image-to-image | |
tags: | |
- pytorch | |
- super-resolution | |
**Name:** 4x-UltraSharp (config and presets included!) | |
**License:** CC BY-NC-SA 4.0 | |
**Link:** <https://mega.nz/folder/qZRBmaIY#nIG8KyWFcGNTuMX_XNbJ_g> | |
**Model Architecture:** ESRGAN | |
**Scale:** 4 | |
**Purpose:** Any, it's universal. This model performs best on JPEG compressed images. | |
**Iterations:** 150k | |
**batch_size:** 4-8 | |
**HR_size:** 128 | |
**Epoch:** ~480 | |
**Dataset:** So many. I used: RAW images shot by myself, SignatureEdits, AdobeMIT-5K, DIV2K, TLOK from brucethemoose, some rock/stone images from ALSA, and many images provided by @esrgan (thanks!) | |
**Dataset_size:** uh, ignore this. anywhere between 2k and 8k full size images throughout training | |
**OTF Training** Yes (custom augmentation presets) | |
**Pretrained_Model_G:** 4x-UniScale-Balanced | |
**Description:** This is my best model yet! It generates lots and lots of detail and leaves a nice texture on images. It works on most images, whether compressed or not. It does work best on JPEG compression though, as that's mostly what it was trained on. It has the ability to restore highly compressed images as well! | |
The model was trained with KernelGAN (thanks musl for supplying the blur kernels), noise patches, custom augmentation presets (are in with the model), and the losses: pixel, feature, cx, ssim, lpips, and fft. Mixup was used for a while, but abandedoned due to stability issues. | |
Gradient Clipping helped immensely with model stability throughout training. | |
Big thanks to musl for giving advice on how to further improve the model! |