Skip to content

theboocock/ancient_dna_pipeline

Repository files navigation

ancient DNA pipeline.

Processes ancient DNA data that was generated using NGS.

This pipeline employs a best practices approach for taking the data from Raw FASTQ files and creates a number of useful output files.

Installation

Run ./install.sh but before running make sure the INSTALL_DIR variable in this script points to a directory that is located on your path.

You will need to change the hardcoded path in rscripts/coverage_script.R

to point to your version of the file rscripts/coverage_plot.R

Python

Python dependencies can be installed by running.

pip install -r requirements.txt

This will install pysam, PyVCF, pyfasta, and Biopython.

mapDamage

Navigate to src/mapDamage/ and follow all installation instructions at http://ginolhac.github.io/mapDamage/.

SeqMagick

Navigate to src/seqmagick/ and run python setup.py install

AdapterRemoval

Navigate to src/AdapterRemoval/ and run the following commands.

tar xvzf AdapterRemoval-1.5.4.tar.gz
make

Then ensure that the executable AdapterRemoval is on your path.

External dependencies.

Unix tools realpath tool and zcat

All the following executables must be installed, and accesible from your path.

Example Run of the pipeline.

Navigate into the tests/test_data/ directory and run the following commands.

# generate pipeline_setup.txt 
create_pipeline_setup.sh . raw
# run pipeline for human mtDNA. make sure you replace the path to the reference file. 
ancient_pipeline.sh -C "gi|251831106|ref|NC_012920.1|" \
-r ~/Programming/OpenSource/MyGitHub/ancient_dna_pipeline/ref/contamination.fa  \ 
-S "human"  -P 1

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published