Human Pose Transfer

Implemented paper

Pose Guided Person Image Generation (NIPS2017)
Disentangled Person Image Generation (CVPR2018 Spotlight)
Progressive Pose Attention Transfer for Person Image Generation (CVPR2019 Oral)
Pose-normalized image generation for person re-identification

Prepare

Requirement

PyTorch 1.1+(if you use Pytorch 1.0, tensorboardX is also needed.)
ignite
torchvision
TensorBoard
numpy
tqdm

pip install -r requirements.txt

DataSet

For fair comparison, all implementation use 263,632 training pairs and 12,000 testing pairs from Market-1501 as in PATN

description	download from
Market1501 dataset images	Market1501
train/test splits `market-pairs-train.csv`, `market-pairs-test.csv`	Pose-Transfer
train/test key points annotations `market-annotation-train.csv`, `market-annotation-train.csv`	Pose-Transfer
Attribute of images not necessary for now	Market-1501_Attribute

copy&rename above pair and annotation file to ./data

Finally, your ./data folder looks like:

data
├── fashion
│   ├── annotation-test.csv
│   ├── annotation-train.csv
│   ├── pairs-test.csv
│   ├── pairs-train.csv
│   ├── test # WILL BE GENERATED IN NEXT STEP
│   │   ├── pose_map_image
│   │   └── pose_mask_image
│   └── train # WILL BE GENERATED IN NEXT STEP
│       ├── pose_map_image
│       └── pose_mask_image
├── market
│   ├── annotation-test.csv
│   ├── annotation-train.csv
│   ├── pairs-test.csv
│   ├── pairs-train.csv
│   ├── attribute
│   │   ├── evaluate_market_attribute.m
│   │   ├── gallery_market.mat
│   │   ├── market_attribute.mat
│   │   ├── README.md
│   │   └── sample_image.jpg
│   ├── test # WILL BE GENERATED IN NEXT STEP
│   │   ├── pose_map_image
│   │   └── pose_mask_image
│   └── train # WILL BE GENERATED IN NEXT STEP
│       ├── pose_map_image
│       └── pose_mask_image

Generate Pose 18-channels image and corresponding mask

python3 tool/generate_pose_map_add_mask.py --dataset fashion --type train --x_offset 40
python3 tool/generate_pose_map_add_mask.py --dataset fashion --type test --x_offset 40
python3 tool/generate_pose_map_add_mask.py --dataset market --type train
python3 tool/generate_pose_map_add_mask.py --dataset market --type test

Train & Test

Obviously, there is a lot of duplicate code between different implementations. In order to avoid repeating myself, I introduced a concept called engine. Whether it is training or testing, it can be seen as an engine, and some repetitive tasks (like loading config, preparing GPU) are done in the run.py. run.py is the start point to train or test.

the usage of run.py:

$python3 run.py -h
usage: Train [-h] -g GPU_ID -c CONFIG -o OUTPUT [-t TOML]
             {PG2-1,PG2-2,PG2-Generator}

positional arguments:
  {PG2-1,PG2-2,PG2-Generator}
                        run which?

optional arguments:
  -h, --help            show this help message and exit
  -g GPU_ID, --gpu_id GPU_ID
                        gpu_id: e.g. 0
  -c CONFIG, --config CONFIG
                        config file path
  -o OUTPUT, --output OUTPUT
                        output path
  -t TOML, --toml TOML  overwrite toml config use cli arg

During training, you can inspect log/generated_images/model_weights with tensorboard:

# tensorboard --logdir </path/to/checkpoint> --port <port>
tensorboard --logdir ./checkpoints/PG2-1 --port 8000

example

# python3 run.py <engine_name> -g <gpu_id> -c </path/to/config> -o </path/to/checkpoint>
# start to train PG2 stage1.
python3 run.py PG2-1 -g 1 -c ./implementations/PG2/stage1.toml -o ./checkpoints/PG2-1
# the way to generated images
python3 run.py PG2-Generator -g 1 -c ./implementations/PG2/stage2.toml -o ./generated_images/PG2

I use TOML as the config format, and you can overwrite the config file with cli arg like this:

python3 run.py PG2-1 -g 1 -c ./implementations/PG2/stage1.toml -o ./checkpoints/PG2-1 \
        -t "loss.mask_l1.weight=20" -t "train.data.replacement=true"

So, if you want to specify the amount of images to generate(default: full/12000), you can add a option: -t "generated_limit=100"

the full command example:

python3 run.py PG2-Generator -g 1 -c ./implementations/PG2/stage2.toml -t "generated_limit=100"  -t "model.generator1.pretrained_path='./checkpoint/PG2-1/network_G1_26000.pth'" -t "model.generator2.pretrained_path='./checkpoint/PG2-2_26000/network_G2_13000.pth'" -o generated_images

Implement result

PG2

First, please change dataset.path.train.image in ./implementations/PG2/stage[1|2].toml

train stage 1: python3 run.py PG2-1 --gpu_id 0 -c ./implementations/PG2/stage1.toml -o ./checkpoints/PG2-1

train stage 2: python3 run.py PG2-2 -g 2 -c ./implementations/PG2/stage2.toml -o ./checkpoints/PG2-2

generate images: python3 run.py PG2-Generator -c ./implementations/PG2/stage2.toml -o ./generated_images/PG2 -g 3

generate a grid image as example: python3 tool/generate_grid.py -r ./generated_images -o images.jpg

Mobile PG2

train stage 1: python3 run.py PG2-1 --gpu_id 0 -c ./implementations/PG2/mobile/stage1.toml -o ./checkpoints/mPG2-1 --mobilenet

train stage 2: python3 run.py PG2-2 -g 2 -c ./implementations/PG2/mobile/stage2.toml -o ./checkpoints/mPG2-2 --mobilenet

generate images: python3 run.py PG2-Generator -c ./implementations/PG2/mobile/stage2.toml -o ./generated_images/mPG2 -g 3 --mobilenet -t "generated_limit=100"

Pre-trained model

Please note that this is NOT the best result, the result is just not bad. it can not be used in academic papers.

Pre-trained model can be downloaded Google Drive or Baidu Disk 提取码(code): n9nj

I also provided the tensorboard log file.

# download&extract files above.
unzip weights.zip
# generate all test images
python3 run.py PG2-Generator -c ./implementations/PG2/stage2.toml -o ./generated_images -g 3  -t "model.generator1.pretrained_path='path/to/weights/G1.pth'" -t "model.generator2.pretrained_path='path/to/weights/G2.pth'"
# random select some images to display
`python3 tool/generate_grid.py -r ./generated_images -o images.jpg`
# see training logs and images.
tensorboard --logdir path/to/weights/train2 --port 8080

Evaluate

For fair comparisons, I just copy&use the same evaluation codes in previous works, like Deform, PG2 and PATN.

I recommend using docker to evaluate the result because evaluation codes use some outdated frameworks(Tensorflow 1.4.1).

So, next:

build docker image with ./evaluate/Dockerfile
run evaluate script

$ cd evaluate
$ docker build -t hpt_evaluate . 
$  # For user in China, you can build docker image like this:
$ docker build -t hpt_evaluate . --build-arg PIP_PYPI="https://pypi.tuna.tsinghua.edu.cn/simple"
$ cd ..
$ docker run -v $(pwd):/tmp -e NVIDIA_VISIBLE_DEVICES=0 -w /tmp --runtime=nvidia -it --rm hpt_evaluate:latest python evaluate/getMetrics_market.py

Or use image tensorflow/tensorflow:1.4.1-gpu-py3 to evaluate in docker bash:

docker run -v $(pwd):/tmp -w /tmp --runtime=nvidia -it --rm tensorflow/tensorflow:1.4.1-gpu-py3 bash
# now in docker:
$ pip install scikit-image tqdm 
$ python evaluate/getMetrics_market.py

Thanks

Liqian Ma - PG2's Tensorflow implementation Thanks for his patience. (￣▽￣)"

@tengteng95 - Pose-Transfer for clear code structure and his great paper.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
dataset		dataset
doc/image		doc/image
evaluate		evaluate
helper		helper
implementations		implementations
tool		tool
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
generate.py		generate.py
requirements.txt		requirements.txt
run.py		run.py

License

kevinleewy/Human-Pose-Transfer

Folders and files

Latest commit

History

Repository files navigation