Skip to content

haddocking/whiscy

Repository files navigation

WHISCY - WHat Information does Surface Conservation Yield?

Downloads PyPI - License PyPI - Status PyPI - Python Version ci Codacy Badge Codacy Badge fair-software.eu

WHISCY

WHISCY is a program to predict protein-protein interfaces.

It is primarily based on conservation, but it also takes into account structural information. A sequence alignment is used to calculate a prediction score for each surface residue of your protein.

This repository contains an updated version of WHISCY. The original code was published in the following paper:

Table of contents

How does WHISCY work?

WHISCY requires a protein structure and a sequence alignment. First, it identifies a master sequence, the sequence that best matches the structure.

The sequence distance (amount of mutation) between the master sequence and all sequences is estimated. This determines the amount of expected mutation.

Then, for each residue, the expected mutation is compared with the observed mutation. Less change than expected means conservation, translated into a positive WHISCY score.

compare

Next, the interface propensity is taken into account.

Phenylalanines, for example, are likely to be in a protein-protein interface, so all phenylalanines receive a higher score. Lysines are much less likely to be in a protein-protein interface, so lysines receive a lower score.

Finally, all scores are smoothed over the surface of the protein structure.

Interfaces often form patches, so that neighbours of interface residues often are interface residues, too. The smoothing means that the scores of these neighbours are taken into account.

interface

Installation

Please refer to INSTALLATION.md for a detailed guide on how to install WHISCY in macOS and GNU/Linux.

Usage

Check USAGE.md to learn how to use WHISCY to predict protein-protein interfaces.