Biologists are interested in studying variation on a protein level. Does a mutation change an amino acid in a protein. Does the changed protein have the same function? ProteinMagicBlast answers these questions. All it needs is a list of SRA accessions and Refseq proteins. It reports differences between sequenced mRNAs and provided proteins.
The purpose of this approach is to provide answers to questions raised by scientific community in regard to changes in the protein sequences when compared with SRA data using Magic Blast tool and supporting software. The main question answered is given the genomic sequences if it has sequences related with a given protein and also if any changes in protein has occurred. The software will provide the protein alignments and changes in amino acids will be displayed on the viewer.
ProteinMagicBlast solves the problem of user when finding relationship of given sequence data with interested proteins. User wants to know if in the new genome(sequences) are interested protein sequences be found? if yes, are there any significant changes related with interested protein? Has the amino acid changed at any of the positions?
Amino acid point mutations (nsSNPs) may change protein structure and function. The second approach lets us identify the changes and the magnitude may vary depending on how similar or dissimilar the replaced amino acids are, as well as on their position in the sequence or the structure.
In the simplest form ProteinMagicBlast requires only two inputs: PROTEIN Accession and SRR Accession. For Example:
python webui/manage.py run server
supermagicblast.py -prot_accs NP_066251.1 -srr SRR5150787
The output is standard blastx output in json format.
We provide two options for installing ProteinMagicBlast: Docker or directly from Github.
The Docker image contains ProteinMagicBlast as well as a webserver and FTP server in case you want to deploy the FTP server. It does also contain a web server for testing the main website (but should only be used for debug purposes).
docker pull ncbihackathons/<this software>
command to pull the image from the DockerHubdocker run ncbihackathons/<this software>
Run the docker image from the master shell script- Edit the configuration files as below
Clone github repository:
git clone https://github.com/NCBI-Hackathons/ProteinMagicBlast.git
Install dependencies:
Python packages from requirements.txt file:
pip install -r requirements.txt
- Grzegorz M Boratyn
- Monika Bihan
- Karina Zile
- Yevhen I Krasichkov
- Vladislav I Evgeniev
- Jan Piotr Buchmann
- Expand it use on annotated whole genome sequences.
- Extending it to large protein databases.
- Expand it to take more than one SRR or Refseq accession input
- Improve the speed of the analysis, especially the assembly of the consensus sequence.