Python match_words示例

编程语言: Python

命名空间/包名称: franklin.seq.alignment

方法/功能: match_words

hotexamples.com的示例: 2

Python match_words - 已找到2个示例。这些是从开源项目中提取的最受好评的franklin.seq.alignment.match_words现实Python示例。您可以评价示例，以帮助我们提高示例质量。

示例#1

显示文件

文件： alignment_test.py 项目： BioinformaticsArchive/franklin

    def test_forward_words():
        'It test that we can match words against in the same orientation'

        seq = 'gCACAggTGTGggTATAgg'
        seq = SeqWithQuality(seq=Seq(seq))

        result = match_words(seq, ['CACA', 'TATA', 'KK'])[0]
        assert result['query'] == seq

        #The match por CACA
        match = result['matches'][0]
        assert match['subject'] == 'CACA'
        assert match['start'] == 1
        assert match['end'] == 10
        assert len(match['match_parts']) == 2
        #the reverse match part
        assert match['match_parts'][1] == {'query_start':7,
                                           'query_end':10,
                                           'query_strand':1,
                                           'subject_start':0,
                                           'subject_end':3,
                                           'subject_strand':-1}

        #The match por TATA
        match = result['matches'][1]
        assert match['subject'] == 'TATA'
        assert match['start'] == 13
        assert match['end'] == 16
        assert len(match['match_parts']) == 2

        #No matches for KK
        assert len(result['matches']) == 2

示例#2

显示文件

文件： seq_cleaner.py 项目： JoseBlanca/franklin

    def strip_words_by_matching(sequence):
        """It strips the given words from a sequence.

        It returns a striped sequence with the longest segment without the
        words.
        """
        if sequence is None:
            return None
        if not words:
            return sequence

        alignments = match_words(sequence, words)
        if not alignments:
            return sequence
        locations = _get_non_matched_locations(alignments)
        segments = _get_longest_non_matched_seq_region_limits(sequence, locations)
        if segments is None:
            return None
        segments = _get_non_matched_from_matched_locations([segments], len(sequence))
        _add_trim_segments(segments, sequence)
        return sequence