Skip to content

metagenomic Pan Genome Analysis Pipeline

Notifications You must be signed in to change notification settings

wangdi2014/metaPGAP

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

metaPGAP : metagenomic Pan Genome Analysis Pipeline

metaPGAP is a pipeline for building core genome using PAN genome approach. This pipeline takes complete or draft genome assemblies and perform annotation using Prokka. The Prokka predictions then used to build core genome using get_homologues tool. For core genes phylogeny Mafft and RaXML used.


Full list of required softwares and dependencies:

metaPGAP steps:

(1). Download data from https://github.com/mitul-patel/data/archive/master.zip
(2). Genome Annotation using Prokka
(3). PAN genome analysis using get_homologues: BDBH, COG and OMCL
(4). Multiple sequence alignment of CORE genes
(5). Phylogenetic analysis of CORE genes using RaXML
(6). Visualization of phylogenetic tree using Newick tools


metaPGAP requirements:

Python (http://python.org)
BioPython (1.5 or higher) with NumPy (http://biopython.org/wiki/Download)
Prokka (https://github.com/tseemann/prokka)
get_homologues (https://github.com/eead-csic-compbio/get_homologues)
Mafft (http://mafft.cbrc.jp/alignment/software/)
Perl (http://Perl.org)
AMAS (https://github.com/marekborowiec/AMAS)
RaXML (https://github.com/stamatak/standard-RAxML)
Newick tools (http://cegg.unige.ch/newick_utils)


Install metaPGAP

wget https://github.com/mitul-patel/metaPGAP/archive/master.zip
unzip metaPGAP-master && mv metaPGAP-master metaPGAP


Running metaPGAP

cd metaPGAP
python metaPGAP.py


About

metagenomic Pan Genome Analysis Pipeline

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C 45.3%
  • Perl 28.3%
  • Shell 17.2%
  • Makefile 3.5%
  • HTML 2.0%
  • C++ 0.9%
  • Other 2.8%