Spaces:

iamvishalksingh
/

codeformer-api

Runtime error

App Files Files Community

sczhou commited on Oct 5, 2022

Commit

00fc5a8

1 Parent(s): 5b3ad16

support video input.

Browse files

Files changed (2) hide show

README.md +16 -3
inference_codeformer.py +41 -9

README.md CHANGED Viewed

@@ -23,6 +23,7 @@ S-Lab, Nanyang Technological University
 **[<font color=#d1585d>News</font>]**: :whale: *Due to copyright issues, we have to delay the release of the training code (expected by the end of this year). Please star and stay tuned for our future updates!*
 ### Update
 - **2022.09.14**: Integrated to :hugs: [Hugging Face](https://huggingface.co/spaces). Try out online demo! [![Hugging Face](https://img.shields.io/badge/Demo-%F0%9F%A4%97%20Hugging%20Face-blue)](https://huggingface.co/spaces/sczhou/CodeFormer)
 - **2022.09.09**: Integrated to :rocket: [Replicate](https://replicate.com/explore). Try out online demo! [![Replicate](https://img.shields.io/badge/Demo-%F0%9F%9A%80%20Replicate-blue)](https://replicate.com/sczhou/codeformer)
 - **2022.09.04**: Add face upsampling `--face_upsample` for high-resolution AI-created face enhancement.
@@ -94,18 +95,30 @@ You can put the testing images in the `inputs/TestWhole` folder. If you would li
 #### Testing on Face Restoration:
-[Note] when comparing our model in your paper, please run the following command indicating `--has_aligned` (for cropped and aligned faces), as the command for the whole image will involve a process of face-background fusion that may damage hair texture on the boundary, which leads to unfair comparison.
 ```
 # For cropped and aligned faces
 python inference_codeformer.py --w 0.5 --has_aligned --test_path [input folder]
 ```
 ```
-# For the whole images
 # Add '--bg_upsampler realesrgan' to enhance the background regions with Real-ESRGAN
 # Add '--face_upsample' to further upsample restorated face with Real-ESRGAN
-python inference_codeformer.py --w 0.7 --test_path [input folder/image path]
 ```
 Fidelity weight *w* lays in [0, 1]. Generally, smaller *w* tends to produce a higher-quality result, while larger *w* yields a higher-fidelity result.
 The results will be saved in the `results` folder.

 **[<font color=#d1585d>News</font>]**: :whale: *Due to copyright issues, we have to delay the release of the training code (expected by the end of this year). Please star and stay tuned for our future updates!*
 ### Update
+- **2022.10.05**: Support video input `--test_path [YOUR_VIDOE.mp4]`. Try it to enhance your videos! :clapper:
 - **2022.09.14**: Integrated to :hugs: [Hugging Face](https://huggingface.co/spaces). Try out online demo! [![Hugging Face](https://img.shields.io/badge/Demo-%F0%9F%A4%97%20Hugging%20Face-blue)](https://huggingface.co/spaces/sczhou/CodeFormer)
 - **2022.09.09**: Integrated to :rocket: [Replicate](https://replicate.com/explore). Try out online demo! [![Replicate](https://img.shields.io/badge/Demo-%F0%9F%9A%80%20Replicate-blue)](https://replicate.com/sczhou/codeformer)
 - **2022.09.04**: Add face upsampling `--face_upsample` for high-resolution AI-created face enhancement.
 #### Testing on Face Restoration:
+[Note] If you want to compare CodeFormer in your paper, please run the following command indicating `--has_aligned` (for cropped and aligned face), as the command for the whole image will involve a process of face-background fusion that may damage hair texture on the boundary, which leads to unfair comparison.
+👨🏻 Face Restoration (cropped and aligned face)
 ```
 # For cropped and aligned faces
 python inference_codeformer.py --w 0.5 --has_aligned --test_path [input folder]
 ```
+:framed_picture: Whole Image Enhancement
 ```
+# For whole image
 # Add '--bg_upsampler realesrgan' to enhance the background regions with Real-ESRGAN
 # Add '--face_upsample' to further upsample restorated face with Real-ESRGAN
+python inference_codeformer.py --w 1.0 --test_path [input folder/image path]
+```
+:clapper: Video Enhancement
+```
+# For video clips
+# Set frame rate of saved video via '--save_video_fps 24'
+python inference_codeformer.py --bg_upsampler realesrgan --face_upsample --w 0.7 --test_path [video path] --save_video_fps 24
 ```
 Fidelity weight *w* lays in [0, 1]. Generally, smaller *w* tends to produce a higher-quality result, while larger *w* yields a higher-fidelity result.
 The results will be saved in the `results` folder.

inference_codeformer.py CHANGED Viewed

@@ -64,6 +64,7 @@ if __name__ == '__main__':
     parser.add_argument('--bg_upsampler', type=str, default='None', help='background upsampler. Optional: realesrgan')
     parser.add_argument('--face_upsample', action='store_true', help='face upsampler after enhancement.')
     parser.add_argument('--bg_tile', type=int, default=400, help='Tile size for background sampler. Default: 400')
     args = parser.parse_args()
@@ -72,15 +73,24 @@ if __name__ == '__main__':
     if args.test_path.endswith(('jpg', 'png')): # input single img path
         input_img_list = [args.test_path]
         result_root = f'results/test_img_{w}'
     else: # input img folder
         if args.test_path.endswith('/'):  # solve when path ends with /
             args.test_path = args.test_path[:-1]
         input_img_list = sorted(glob.glob(os.path.join(args.test_path, '*.[jp][pn]g')))
         result_root = f'results/{os.path.basename(args.test_path)}_{w}'
     # ------------------ set up background upsampler ------------------
     if args.bg_upsampler == 'realesrgan':
         bg_upsampler = set_realesrgan()
@@ -128,15 +138,20 @@ if __name__ == '__main__':
         device=device)
     # -------------------- start to processing ---------------------
-    # scan all the jpg and png images
-    for img_path in input_img_list:
         # clean all the intermediate results to process the next image
         face_helper.clean_all()
-        img_name = os.path.basename(img_path)
-        print(f'Processing: {img_name}')
-        basename, ext = os.path.splitext(img_name)
-        img = cv2.imread(img_path, cv2.IMREAD_COLOR)
         if args.has_aligned:
             # the input faces are already cropped and aligned
@@ -208,4 +223,21 @@ if __name__ == '__main__':
             save_restore_path = os.path.join(result_root, 'final_results', f'{basename}.png')
             imwrite(restored_img, save_restore_path)
     print(f'\nAll results are saved in {result_root}')

     parser.add_argument('--bg_upsampler', type=str, default='None', help='background upsampler. Optional: realesrgan')
     parser.add_argument('--face_upsample', action='store_true', help='face upsampler after enhancement.')
     parser.add_argument('--bg_tile', type=int, default=400, help='Tile size for background sampler. Default: 400')
+    parser.add_argument('--save_video_fps', type=int, default=24, help='frame rate for saving video. Default: 24')
     args = parser.parse_args()
     if args.test_path.endswith(('jpg', 'png')): # input single img path
         input_img_list = [args.test_path]
         result_root = f'results/test_img_{w}'
+    elif args.test_path.endswith(('mp4', 'mov', 'avi')): # input video path
+        input_img_list = []
+        vidcap = cv2.VideoCapture(args.test_path)
+        success, image = vidcap.read()
+        while success:
+            input_img_list.append(image)
+            success, image = vidcap.read()
+        input_video = True
+        video_name = os.path.basename(args.test_path)[:-4]
+        result_root = f'results/{video_name}_{w}'
     else: # input img folder
         if args.test_path.endswith('/'):  # solve when path ends with /
             args.test_path = args.test_path[:-1]
+        # scan all the jpg and png images
         input_img_list = sorted(glob.glob(os.path.join(args.test_path, '*.[jp][pn]g')))
         result_root = f'results/{os.path.basename(args.test_path)}_{w}'
+    test_img_num = len(input_img_list)
     # ------------------ set up background upsampler ------------------
     if args.bg_upsampler == 'realesrgan':
         bg_upsampler = set_realesrgan()
         device=device)
     # -------------------- start to processing ---------------------
+    for i, img_path in enumerate(input_img_list):
         # clean all the intermediate results to process the next image
         face_helper.clean_all()
+        if isinstance(img_path, str):
+            img_name = os.path.basename(img_path)
+            basename, ext = os.path.splitext(img_name)
+            print(f'[{i+1}/{test_img_num}] Processing: {img_name}')
+            img = cv2.imread(img_path, cv2.IMREAD_COLOR)
+        else: # for video processing
+            basename = str(i).zfill(6)
+            img_name = f'{video_name}_{basename}' if input_video else basename
+            print(f'[{i+1}/{test_img_num}] Processing: {img_name}')
+            img = img_path
         if args.has_aligned:
             # the input faces are already cropped and aligned
             save_restore_path = os.path.join(result_root, 'final_results', f'{basename}.png')
             imwrite(restored_img, save_restore_path)
+    # save enhanced video
+    if input_video:
+        # load images
+        video_frames = []
+        img_list = sorted(glob.glob(os.path.join(result_root, 'final_results', '*.[jp][pn]g')))
+        for img_path in img_list:
+            img = cv2.imread(img_path)
+            video_frames.append(img)
+        # write images to video
+        h, w = video_frames[0].shape[:2]
+        save_restore_path = os.path.join(result_root, f'{video_name}.mp4')
+        writer = cv2.VideoWriter(save_restore_path, cv2.VideoWriter_fourcc(*"mp4v"),
+                                    args.save_video_fps, (w, h))
+        for f in video_frames:
+            writer.write(f)
+        writer.release()
     print(f'\nAll results are saved in {result_root}')