Chanjo is coverage analysis for clinical sequencing. It's implemented in Python with a command line interface that adheres to UNIX pipeline philosophy.
Chanjo is distributed through pip
. Install the latest stable release by
running:
pip install chanjo
... or locally for development:
git clone https://github.com/Clinical-Genomics/chanjo.git
cd chanjo
conda install --channel bioconda sambamba
pip install -r requirements-dev.txt --editable .
Chanjo exposes a decomposable command line interface with a nifty config file implementation.
chanjo init --setup
chanjo load /path/to/sambamba.output.bed
chanjo calculate mean
{"metrics": {"completeness_10": 90.92, "mean_coverage": 193.85}, "sample_id": "sample1"}
Read the Docs is hosting the official documentation.
If you are looking to learn more about handling sequence coverage data in clinical sequencing, feel free to download and skim through my own Master's thesis and article references.
Chanjo leverages Sambamba to annotate coverage and completeness for a general BED-file. The output can then easily to loaded into a SQL database that enables investigation of coverage across regions and samples. The database also works as an API to downstream tools like the Chanjo Coverage Report generator.
Chanjo is not the right choice if you care about coverage for every base across the entire genome. Detailed histograms is something BEDTools already handles with confidence.
- Robin Andeer (robinandeer)
- Luca Beltrame (lbeltrame)
- John Kern (kern3020)
- Måns Magnusson (moonso)
- Patrik Grenfeldt (patrikgrenfeldt)
MIT. See the LICENSE file for more details.
Anyone can help make this project better - read CONTRIBUTION to get started!