Python NGS.prune示例

编程语言: Python

命名空间/包名称: pyphylogenomics

类/类型: NGS

方法/功能: prune

hotexamples.com的示例: 2

Python NGS.prune - 已找到2个示例。这些是从开源项目中提取的最受好评的pyphylogenomics.NGS.prune现实Python示例。您可以评价示例，以帮助我们提高示例质量。

常用方法

显示隐藏

filter_reads(2)

parse_blast_results(2)

prepare_data(2)

prune(2)

split_ionfile_by_results(2)

find_index_in_seq(1)

示例#1

显示文件

文件： test_NGS.py 项目： mezarino/PyPhyloGenomics

    def test_prune(self):
        folder = "NGS"

        blast_data = []
        f = open("NGS/blast_data.csv", "r")
        tmp = f.readlines()
        f.close()
        for i in tmp:
            blast_data.append(i.strip())

        seq_record = SeqIO.parse("NGS/seq_record.fastq", "fastq")

        ion_id = "3856"
        min_aln_length = "40"

        result = NGS.prune(folder, blast_data, seq_record, ion_id,
                            min_aln_length)
        # It should drop on seq_record from the blast_data
        self.assertEqual(len(result), 998)

示例#2

显示文件

文件： NGS.py 项目： mezarino/PyPhyloGenomics

def filter_reads(ion_chunk, blast_chunk, folder):
    from Bio import SeqIO;
    from pyphylogenomics import NGS
    '''
    \* *Internal function* \*

    Accepting alignment lengths higher than 40 bp
    longer than our primer lengths
    '''
    min_aln_length = 40;

    blast_file = open(blast_chunk, "r");
    tmp = blast_file.readlines();
    blast_file.close();

    blast_data = []
    for i in tmp:
        blast_data.append(i.strip())
        

    # iterate over ion torrent reads
    for seq_record in SeqIO.parse(ion_chunk, "fastq"):
        if len(blast_data) > 0:
            #print "\n\nNew record--------------------"
            #print "seq record id @%s" % seq_record.id
            # avoid processing seq_records that are not in blast file
            # first id in blast_data
            #print blast_data
            first_id_in_blast_data = blast_data[0].split(",")[0]
            #print "fist id in blast_data %s" % first_id_in_blast_data

            if int(seq_record.id) >= int(first_id_in_blast_data):
                #if str(seq_record.id) == ion_id and aln_length > min_aln_length:
                if str(seq_record.id) == first_id_in_blast_data:
                    #print "prune"
                    blast_data = NGS.prune(folder, blast_data, seq_record,
                                first_id_in_blast_data, min_aln_length)
                else:
                    break