-
Notifications
You must be signed in to change notification settings - Fork 1
FilipDom/AppBio-Noise-Reduce
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
#-------# | USAGE | #-------# All scripts have been written so that they work correctly when called from the project root path, ie. the project folder. Otherwise, the relative path settings would be incorrect. The only exception to this rule are the Octave scripts used for statistical analysis which should be run from the folder in which they reside. All the scripts use relative paths so they can be run on another system without much effort. In order to run them, python needs to be installed on the system, and the path and enviromental variables need to be set so that a call such as "python <script_name>" works. Note, the scripts were not written on a unix/linux-like system, so they are not marked as executable. The scripts used for an individual experiment are in the results folder, under a particular experiment subfolder. If multiple scripts are found, then a runall and runcontrol scripts are provided. These scripts run all the data generation or control scripts respectively. WARNING: Some scripts take a long time to finisih. One such script is runtrees.py which calculates all the trees both for noisy and data with reduced noise. The names "infile", "outfile", "outtree" and "tmp_denoised.fa" should not be used as file names in the root directory. These file names are used by some scripts and files with those names will be overwritten and deleted. For this reason, no two scripts which depend upon these files should be invoked at the same time. #--------------# | DEPENDENCIES | #--------------# Python 2.7 was used to write the scripts and a compatible version needs to be available on the system. The path must also be set so that the "python" command may be invoked from the command line. The Biopython and DendroPy libraries also need to be installed. The noise reduction script requires just the Biopython library. Phylip is used to calculate a distance matrix for the mulitple sequence alignments and to form a tree using neighbor joining. The path must be set so that the commands "protdist" and "neighbor" can be invoked from the command line. #-------------# | ASSUMPTIONS | #-------------# In order for the program to work correctly, all input files should be valid multiple sequence alignments in fasta format. This means that there should be at least three sequences in each file and all sequences should be of the same length. Indels should be denoted by the character '-'. It is assumed that all the sequences are protein sequences.
About
Applied Bioinformatics project
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published