/AIGC/ Image Generation (Pix2Pix)

2021-11-16 | 阅读：次

This project implements the Pix2Pix model, which uses U-Net as the image generator to produce estimated light images. Structured like the contracting path of the U-Net, the PatchGAN discriminator tells how much the estimated images are close to the real light images. Generator loss consists of adversarial loss and reconstruction loss while discriminator loss includes the difference between contracted real images and ones-arrary, and the difference between conratcted generated fake images and zeros-array.

Mandatory python packages:

torch (cpu/cuda)
torchvision
PIL
matplotlib

Training Images

The training dataset includes 170 paired augmented dark/light toy images taken by mobile phone with a fixed ISO value. Training evolution is saved in the cell output of trainer jupyter notebook. Changes of generated images can be easily observed, which proves the success of learning.

Testing Images

After training 8500 steps, the model generated good estimations for training images. Then 3 new images are used for testing. The color is indeed lighten as expected. What surprises is the shadow of the toy in the image is also estimated by Pix2Pix model. As we can see, the shadow doesn’t exist in the dark image and only appears in the light image because the light is turned on when taking the photo. Although the shape of generated shadow is not exactly the same with the shadow in the real image, it’s pretty close. The quality of generated image is not as good as the real image, the clearness could be improved. It could be combioned with another machine learning model for increasing image resolution.

Siyue Zhang

/AIGC/ Image Generation (Pix2Pix)

目录

Training Images

Testing Images