Python BLAST.eraseFalsePosi Examples

Programming Language: Python

Namespace/Package Name: pyphylogenomics

Class/Type: BLAST

Method/Function: eraseFalsePosi

Examples at hotexamples.com: 3

Python BLAST.eraseFalsePosi - 3 examples found. These are the top rated real world Python examples of pyphylogenomics.BLAST.eraseFalsePosi extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

blastn(8)

getLargestExon(4)

storeExonsInFrame(4)

blastParser(3)

eraseFalsePosi(3)

get_cds(2)

makeblastdb(2)

wellSeparatedExons(2)

do_blast(1)

filterByMinDist(1)

Example #1

Show file

File: test_BLAST.py Project: carlosp420/PyPhyloGenomics

 def test_eraseFalsePosi(self):
     exons = BLAST.getLargestExon(self.cwd + "/BLAST/query_blastn_out.csv", E_value=0.001, ident=98, exon_len=300)
     exons = BLAST.eraseFalsePosi(exons)
     result = len(exons)
     self.assertEqual(result, 3)

Example #2

Show file

File: gene_search_blast_filtering_exons.py Project: carlosp420/PyPhyloGenomics_ms

from pyphylogenomics import MUSCLE


"""
As stated before, we prefer long exons for each of the candidate genes ( > 300
nucleotides):
"""
exons = BLAST.getLargestExon("data/pulled_seqs_blastn_out.csv", E_value=0.001, ident=98, exon_len=300)


"""
Some small segments of sequences might be similar to non-homologous regions of
the genome. We will use the function eraseFalsePosi to keep those matches of
longest length:
"""
exons = BLAST.eraseFalsePosi(exons) # Drop presumable false positives.


"""
Ideally we want exons that are not too close to each other in the genome to
avoid gene linkage. So we will keep only those exons that are apart by 810
kilobases:
"""
exons = BLAST.wellSeparatedExons(exons) # Keep exons separated by > 810KB


"""
Finally we can use a function to save the obtained exons while making sure they
are in frame. We need to use as additional arguments the genome file and output
filename:
"""

Example #3

Show file

File: search_genes_from_Bmori.py Project: carlosp420/dy_genome

from pyphylogenomics import OrthoDB
from pyphylogenomics import BLAST

in_file = 'grefs/OrthoDB7_Arthropoda_tabtext'
genes = OrthoDB.single_copy_genes(in_file, 'Bombyx mori')
cds_file = 'grefs/silkcds.fa'
BLAST.get_cds(genes, cds_file)
BLAST.blastn('pulled_seqs.fasta', 'grefs/silkgenome.fa')
exons = BLAST.getLargestExon("pulled_seqs_blastn_out.csv", E_value=0.001, ident=98, exon_len=300)
exons = BLAST.eraseFalsePosi(exons)
BLAST.storeExonsInFrame(exons, "pulled_seqs.fasta", "grefs/Bombyx_exons.fas")