Skip to content

rlabduke/suitename

Repository files navigation

suitename

Suitename - a tool for classifying backbone "suite" conformations (linkages between ribose sugars) in an RNA molecule.

Obsolete

This repository contains obsolete code that is no longer maintained as of August 2021. SuiteName development has been moved into the CCTBX code base and it can be run from a CCTBX installation as phenix.suitename or molprobity.suitename. Its location within that code base is here.

Requirements

Suitename requires Python 3 or later and it requires numpy to have been installed.

Running

Suitename is integrated into the CCTBX library and can be run from within it.

Suitename can also be run from the command line using a Python 3 interpreter: python3 suitename.py [arguments]

Suitename takes an input file as its first argument, or if none provided, reads from standard input. It writes its results to standard output. It classifies the conformation of each suite into one of several dozen predefined clusters that have been determined by years of study.

Two forms of input are supported:

  1. A list of the 6 dihedral angles $/alpha$, $/beta$, $/gamma$, $/delta$, $/epsilon$, $/zeta$) for each residue of the RNA molecule. Suitename will re-parse the dihedral angles in the residues to obtain the 7 dihedral angles in each suite ($/delta/-1, $/epsilon/-1, $/zeta/-1, $/alpha$, $/beta$, $/gamma$, $/delta$), and then operate on those. This is the default input.

    Each line of this format describes one residue, with fields separated by colons. The first several (default 6) are ID information, the remainder are angles. Sample: 1: A: 7: : : U:-75.533:-154.742:48.162:80.895:-148.423:-159.688

  2. A kinemage file providing a list of 7 or 9 dihedral angles in each suite. Mark this by using the --suitein command line flag. 9 angles include $/chi$-1 and $/chi$.

Three forms of output are supported:

  1. A text report, showing the classification of each suite into a cluster or outlier, and the neatness of fit ("suiteness") of the suite into that cluster. Suiteness is the cosine of the normalized distance of a suite datapoint from the power 3 hyperellipsoid boundary of the cluster in toward its center. A statistical summary is included at the end. This is the default output format.
  2. A kinemage file, which will display a 7D data point for each suite in the data. Points are color-coded according to the clusters to which they have been assigned. Each cluster is displayed as a colored ring surrounding its defined center. Specify this by using the --kinemage command line flag.
  3. A brief string showing only the cluster assignments. It consists of three characters per suite - base identity (uc) and 2-character number-letter name of the suite cluster (e.g., C1aG1gU1aA1aA1cG). Specify this by using the --string command line flag.

Many other command line options are available; type python3 suitename.py --help to display them.

Directories

C Original C implementation of SuiteName along with build files to compile into a running executable. The Conversion subdirectory contains tools that were used during the re-implementation in Python in 2021.

command_line This is a directory that is searched by CCTBX when it is building SuiteName as a module. It includes a Python script that has specially-formatted comments at its top to describe how SuiteName should be run within CCTBX/PHENIX. It basically runs the suitename.py script in the root directory and passes any arguments along to it.

About

Categorize each suite in an RNA backbone as belonging to either one of 53 defined types or as outlier

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages