# PConv2D_Keras

**Repository Path**: jasonli666/PConv2D_Keras

## Basic Information

- **Project Name**: PConv2D_Keras
- **Description**: 部分卷积的图像修复
- **Primary Language**: Python
- **License**: MIT
- **Default Branch**: master
- **Homepage**: None
- **GVP Project**: No

## Statistics

- **Stars**: 1
- **Forks**: 0
- **Created**: 2020-03-22
- **Last Updated**: 2024-05-28

## Categories & Tags

**Categories**: Uncategorized

**Tags**: None

## README

# Image Inpainting Based on Partial Convolutions in Keras
Unofficial implementation of [Liu et al., 2018. Image Inpainting for Irregular Holes Using Partial Convolutions](https://arxiv.org/abs/1804.07723).

This implementation was inspired by and is partially based on the early version of [this repository](https://github.com/MathiasGruber/PConv-Keras). Many ideas, e.g. random mask generator using OpenCV, were taken and used here.

## Requirements
- Python 3.6
- TensorFlow 1.13 
- Keras 2.2.4
- OpenCV and NumPy (for mask generator)

## How to run the code
First, set proper paths to your datasets (```IMG_DIR_TRAIN``` and ```IMG_DIR_VAL``` in the ```inpainter_main.py``` file). Note, the code is using the ImageDataGenerator class from Keras. These paths should therefore point to one level above in the directory tree, i.e. if e.g. your train images are stored in the directory ```path/to/train/images/dir/subdir/``` then you set ```IMG_DIR_TRAIN = path/to/train/images/dir/```. If there is more than one directory in ```path/to/train/images/dir/``` (e.g. associated with different classes), they all will be used in training. 

Second, download the VGG16 weights ported from PyTorch [here](https://github.com/ezavarygin/vgg16_pytorch2keras) and set  ```VGG16_WEIGHTS``` in the ```inpainter_main.py``` file to be the path to these weights.

When the paths are set, run the code
```
python inpainter_main.py
```
This will start the initial training stage 1. When it is complete, set ```STAGE_1``` to ```False``` and ```LAST_CHECKPOINT``` to be the path to the checkpoint from the last epoch on stage 1. Then run the code again. This will start the fine-tuning stage 2.

You can also do all this in the jupyter notebook provided.

## VGG16 model for feature extraction
The authors of the paper used PyTorch to implement the model. The VGG16 model was chosen for feature extraction. The [VGG16 model in PyTorch](https://pytorch.org/docs/stable/torchvision/models.html) was trained with the following image pre-processing:
1. Divide the image by 255,
2. Subtract [0.485, 0.456, 0.406] from the RGB channels, respectively,
3. Divide the RGB channels by [0.229, 0.224, 0.225], respectively.

The same pre-processing scheme was used in the paper. The [VGG16 model in Keras](https://keras.io/applications/#vgg16) comes with weights ported from the original Caffe implementation and expects another image pre-processing:
1. Convert the images from RGB to BGR,
2. Subtract [103.939, 116.779, 123.68] from the BGR channels, respectively.

Due to different pre-processing, the scales of features extracted using the VGG16 model from PyTorch and Keras are different. If we were to use the build-in VGG16 model in Keras, we would need to modify the loss term normalizations in Eq. 7. To avoid this, the weights of the VGG16 model were ported from PyTorch using [this script](https://github.com/ezavarygin/vgg16_pytorch2keras) and are provided in the file ```data/vgg16_weights/vgg16_pytorch2keras.h5```. The PyTorch-style image pre-processing is used in the code.
<!---
*L<sub>total</sub>* = *L<sub>valid</sub>* + 6*L<sub>hole</sub>* + 0.05*L<sub>perceptual</sub>* + 120(*L<sub>style out</sub>* + *L<sub>style comp</sub>*) + 0.1*L<sub>tv</sub>*
--->

## Mask dataset
Random masks consisting of circles and lines are generated using the OpenCV library. The mask generator is the modified version of the one used [here](https://github.com/MathiasGruber/PConv-Keras). It was modified to generate reproducible masks for validation images.
To generate consistent masks, i.e. the same set of masks after each epochs, for validation images, make sure the number of images in your validation set is equal to the product of ```STEPS_VAL``` and ```BATCH_SIZE_VAL```. I used 400 validation images with ```STEPS_VAL = 100``` and ```BATCH_SIZE_VAL = 4```. You might need to change these parameters if you want to use more/less validation images with consistent masks between different epochs.

## Image dataset
The examples shown below were generated using the model trained on the [Open Images Dataset](https://storage.googleapis.com/openimages/web/index.html) (subset with bounding boxes, partitions 1 to 5). You can train the model using other datasets.

## Training
The model was trained in two steps:

1. Initial training (BatchNorm enabled): 70 epochs with learning rate 0.0002, then 10 epochs with learning rate 0.0001,
2. Fine-tuning (BatchNorm disabled in encoder): 40 epochs with learning rate 0.00005

with the batch size of 5 and 2500 steps per epoch.

![Training history](data/history/training_history.png?raw=true "Training history")

The weights of the trained model can be downloaded via [this link](https://drive.google.com/open?id=1XdcKQASsa8mtpPIt3aAkcvije0U1J2Fy).

## Inpainting results
![Examples](data/examples/examples.png?raw=true "Examples")

## Comments
I cannot reach the same inpainting quality as was demostrated in the paper. Suggestions and bug reports are welcome.

## Acknowledgements
A big thank you goes to [Mathias Gruber](https://github.com/MathiasGruber) for making his repository public and to [Guilin Liu](https://github.com/liuguilin1225) for his feedback on the losses and the image pre-processing scheme used in the paper.