Python match_words 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: franklin.seq.alignment

메소드/함수: match_words

hotexamples.com에서의 예제들: 2

Python match_words - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 franklin.seq.alignment.match_words에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: alignment_test.py 프로젝트: BioinformaticsArchive/franklin

    def test_forward_words():
        'It test that we can match words against in the same orientation'

        seq = 'gCACAggTGTGggTATAgg'
        seq = SeqWithQuality(seq=Seq(seq))

        result = match_words(seq, ['CACA', 'TATA', 'KK'])[0]
        assert result['query'] == seq

        #The match por CACA
        match = result['matches'][0]
        assert match['subject'] == 'CACA'
        assert match['start'] == 1
        assert match['end'] == 10
        assert len(match['match_parts']) == 2
        #the reverse match part
        assert match['match_parts'][1] == {'query_start':7,
                                           'query_end':10,
                                           'query_strand':1,
                                           'subject_start':0,
                                           'subject_end':3,
                                           'subject_strand':-1}

        #The match por TATA
        match = result['matches'][1]
        assert match['subject'] == 'TATA'
        assert match['start'] == 13
        assert match['end'] == 16
        assert len(match['match_parts']) == 2

        #No matches for KK
        assert len(result['matches']) == 2

예제 #2

파일 보기

파일: seq_cleaner.py 프로젝트: JoseBlanca/franklin

    def strip_words_by_matching(sequence):
        """It strips the given words from a sequence.

        It returns a striped sequence with the longest segment without the
        words.
        """
        if sequence is None:
            return None
        if not words:
            return sequence

        alignments = match_words(sequence, words)
        if not alignments:
            return sequence
        locations = _get_non_matched_locations(alignments)
        segments = _get_longest_non_matched_seq_region_limits(sequence, locations)
        if segments is None:
            return None
        segments = _get_non_matched_from_matched_locations([segments], len(sequence))
        _add_trim_segments(segments, sequence)
        return sequence