protein-dimer-inpainting

Installation

Clone this repo.

git clone https://github.com/huangh0408/protein-dimer-inpainting.git

Prerequisites

Python 2.7
tensorflow-gpu 1.12.0
ipdb
opencv-python
glob
cPickle

Datasets

train set and validation set

We use 3dcomplex and masif datasets.

Independent test set

We use gremlin, evcomplex,benchmark 5.0 and casp-capri datasets.

Your own datasets

Please prepare the pdb file and the corresponding chain file, which are required by our scripts to generate inter-protein and intra-protein contact/distance map.

# To generate the whole contact/distance map.
cd generate_contact_map
bash work.sh

Training New Models

on our datasets

There are three folders to present three kinds of datasets respectively. You can download the data here.

# To train on the dataset. Notice that you should modify the input file directory and checkpoint directory in the work_train.sh file.
bash work_train.sh

on your own datasets

# To train on the you dataset, for example.
python train.py --input_dir[the path of original images] --mode=[contact distance slice] --netsize[128 256 512]

There are many options you can specify. Please use python train.py --help or see the options

Pre-trained weights and test model

There are three folders to present pre-trained for three kinds of datasets respectively. You can download the pre-trained model here.

testing

# To test on the dataset. Notice that you should modify the test set directory and checkpoint directory in the work_test.sh file.
bash work_test.sh

Evaluation

We calculate the precision,which is defined as TP/N. Such as Top 5, 10, 20, L/10, L/5, L/2, L used in the intra-protein contact map prediction. For the overall results, we calculate the mean precision. Additionaly, we calculate the success rate, which is defined the percentage of the targets with at least one successfully predicted contact when a certain number of predicted contacts are considered, compared to all the targets in the test set.

Precision & Success rate

# To evaluate on the dataset. Notice that you should modify the output file directory and groundtruth file directory in th work_evaluate.sh file.
bash work_evaluate.sh

Different Version

scripts_version1.0

original scripts without mask

scripts_version2.0

scripts with region mask

scripts_version3.0

implement in environment python 3.7

scripts_demo

jupyter-notebook to test our model

Citation

If you use this code for your research, please cite our papers.

@article{huang2021,
  title={Generate inter-protein contact map by image inpainting},
  author={He Huang, Chengshi Zeng, Xinqi Gong},
  journal={In Preparation},
  year={2021}
}

Acknowledgments

Our inpainting codes refer to Inpainting and the readme.md file refers to Rethinking-Inpainting-MEDFE.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
generate_contact_map		generate_contact_map
material		material
scripts_demo		scripts_demo
scripts_version1.0		scripts_version1.0
scripts_version2.0		scripts_version2.0
scripts_version3.0		scripts_version3.0
README.md		README.md

MIALAB-RUC/protein-dimer-inpainting

Folders and files

Latest commit

History

Repository files navigation