UltraSharp / README.md
OzzyGT's picture
OzzyGT HF staff
Add tags for easy search and to be seen as a linked model in spaces
3cc0ae5 verified
|
raw
history blame
1.61 kB
metadata
license: cc-by-nc-sa-4.0
pipeline_tag: image-to-image
tags:
  - pytorch
  - super-resolution

Name: 4x-UltraSharp (config and presets included!)

License: CC BY-NC-SA 4.0

Link: https://mega.nz/folder/qZRBmaIY#nIG8KyWFcGNTuMX_XNbJ_g

Model Architecture: ESRGAN

Scale: 4

Purpose: Any, it's universal. This model performs best on JPEG compressed images.

Iterations: 150k

batch_size: 4-8

HR_size: 128

Epoch: ~480

Dataset: So many. I used: RAW images shot by myself, SignatureEdits, AdobeMIT-5K, DIV2K, TLOK from brucethemoose, some rock/stone images from ALSA, and many images provided by @esrgan (thanks!)

Dataset_size: uh, ignore this. anywhere between 2k and 8k full size images throughout training

OTF Training Yes (custom augmentation presets)

Pretrained_Model_G: 4x-UniScale-Balanced

Description: This is my best model yet! It generates lots and lots of detail and leaves a nice texture on images. It works on most images, whether compressed or not. It does work best on JPEG compression though, as that's mostly what it was trained on. It has the ability to restore highly compressed images as well!

The model was trained with KernelGAN (thanks musl for supplying the blur kernels), noise patches, custom augmentation presets (are in with the model), and the losses: pixel, feature, cx, ssim, lpips, and fft. Mixup was used for a while, but abandedoned due to stability issues.

Gradient Clipping helped immensely with model stability throughout training.

Big thanks to musl for giving advice on how to further improve the model!