Skip to content

edajeda/bcbb

 
 

Repository files navigation

Collection of useful code related to biological analysis. Much of this is discussed with examples at Blue collar bioinformatics.

Some projects which may be especially interesting:

  • CloudBioLinux -- An automated environment to install useful biological software and libraries. This is used to bootstrap blank machines, such as those you'd find on Cloud providers like Amazon, to ready to go analysis workstations. See the CloudBioLinux effort for more details. This project moved to it's own repository at https://github.com/chapmanb/cloudbiolinux.
  • gff -- A GFF parsing library in Python, aimed for inclusion into Biopython.
  • nextgen -- Automated analysis pipeline for processing next generation sequencing data. This is tightly integrated with the Galaxy web framework.
  • distblast -- A distributed BLAST analysis running for identifying best hits in a wide variety of organisms for downstream phylogenetic analyses. The code is generalized to run on local multi-processor and distributed Hadoop clusters.

About

Useful bioinformatics code, primarily in Python and R

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published