Skip to content

sorrywm/mafTools

 
 

Repository files navigation

mafTools

mafTools is a collection of tools that operate on Multiple Alignment Fomat (maf) files.

Authors

Dent Earl, Benedict Paten, Mark Diekhans

Dependencies

Installation

  1. Download the package. Consider making it a sibling directory to sonLib/.
  2. cd into the directory.
  3. Type make.

Components

  • mafBlockDuplicateFilter A program to filter alignment blocks to remove duplicate species. One sequence per species is allowed to remain, chosen by comparing the sequence to the consensus for the block and computing a similarity bit score between the IUPAC formatted consensus and the sequence. The highest scoreing duplicate stays, or in the case of ties, the sequence closest to the start of the file stays.
  • mafBlockExtractor A program to extract all alignment blocks that contain a region in a particular sequence. Useful for isolating regions of interest in large maf files.
  • mafBlockFinder A program to search for a position in a particular sequence. Useful for determining where in maf a particular part of the alignment resides.
  • mafBlockSorter A program to sort all of the blocks in a MAF based on the (absolute) start position of one of the sequences. Blocks without the sequence are placed at the start of the output in their original order.
  • mafComparator A program to compare two maf files by sampling. Useful when testing predicted alignments against known true alignments.
  • mafCoveragePickles A set of programs to assess the pairwise coverage between sequences and to extract the indel distribution of a set of sequences contained in the maf.
  • mafValidator A program to assess whether or not a given maf file's formatting is valid.

External tools

  • mafTools internal tests use Asim Jalis' CuTest C unit testing framework. The license for CuTest is spelled out in external/license.txt.
  • mafTools internal tests will use valgrind if installed on your system.

About

Bioinformatics tools for dealing with Multiple Alignment Format (MAF) files.

Resources

License

Stars

Watchers

Forks

Packages

No packages published