Project035 analysis pipeline

This pipeline chains together tasks required to analyse RNA-seq and small RNA-seq datasets used in project035. It is not a general purpose pipeline, and as such some variables are hard-coded. It contains the necessary custom Python and R scripts written for the project. It relies heavily on the CGAT code collection and ruffus, the former is available here on GitHub: https://github.com/CGATOxford/cgat

Third party tool (including R packages) requirements:

DESeq2
gplots
RColorBrewer
ggplot2
featureCounts v1.4.6

Prior read QC and mapping was carried out using the CGATPipelines, pipeline_readqc.py and pipeline_mapping.py found in the CGATPipelines repo: https://github.com/CGATOxford/CGATPipelines

###Input The pipeline expects compressed alignment files (.bam), with files named according to the convention: condition-tissue-replicate.file_suffix

It also requires a gtf file of annotations of interest. These were either downloaded from ensembl, and processed directly using the CGATPipelines script pipeline_annotations.py (protein-coding genes and miRNAs), or converted from .bed format using the script bed2gff.py in the CGAT scripts repo.

For pipeline use instructions see the wiki.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
power_data		power_data
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

power_data

power_data

src

src

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Project035 analysis pipeline

About

Releases

Packages

Languages

License

MikeDMorgan/proj035

Folders and files

Latest commit

History

Repository files navigation

Project035 analysis pipeline

About

Resources

License

Stars

Watchers

Forks

Languages