Python breakup_word 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: miro.ngrams

메소드/함수: breakup_word

hotexamples.com에서의 예제들: 8

Python breakup_word - 8개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 miro.ngrams.breakup_word에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: searchtest.py 프로젝트: nxmirrors/miro

 def test_memory(self):
     # make sure we aren't leaking memory in our C module
     gc.collect()
     start_count = len(gc.get_objects())
     results = ngrams.breakup_list(['foo', 'bar', 'bazbaz'], 1, 3)
     results2 = ngrams.breakup_word('miroiscool', 1, 3)
     del results
     del results2
     gc.collect()
     end_count = len(gc.get_objects())
     self.assertEquals(start_count, end_count)

예제 #2

파일 보기

파일: search.py 프로젝트: nxmirrors/miro

def _ngrams_for_term(term):
    """Given a term, return a list of N-grams that we should search for.

    If the term is shorter than NGRAM_MAX, this is just the term itself.
    If it's longer, we split it up into a bunch of N-grams to search for.
    """
    if len(term) <= NGRAM_MAX:
        return [term]
    else:
        # Note that we only need to use the longest N-grams, since shorter
        # N-grams will just be substrings of those.
        return ngrams.breakup_word(term, NGRAM_MAX, NGRAM_MAX)

예제 #3

파일 보기

    def _ngrams_for_term(self, term):
        """Given a term, return a list of N-grams that we should search for.

        If the term is shorter than NGRAM_MAX, this is just the term itself.
        If it's longer, we split it up into a bunch of N-grams to search for.
        """
        if len(term) <= NGRAM_MAX:
            return [term]
        else:
            # Note that we only need to use the longest N-grams, since shorter
            # N-grams will just be substrings of those.
            return ngrams.breakup_word(term, NGRAM_MAX, NGRAM_MAX)

예제 #4

파일 보기

def _ngrams_for_term(term):
    """Given a term, return a list of N-grams that we should search for.

    If the term is shorter than NGRAM_MAX, this is just the term itself.
    If it's longer, we split it up into a bunch of N-grams to search for.
    """
    if len(term) < NGRAM_MIN:
        # term is shorter than our smallest ngrams, return an empty list,
        # which causes us to match everything
        return []
    elif len(term) <= NGRAM_MAX:
        # normal case, search for term in using the N-grams we've calculated
        return [term]
    else:
        # term is longer than our longest N-grams, try the best we can using
        # substrings of term.  We only need to use the longest N-grams, since
        # shorter N-grams will just be substrings of those.
        return ngrams.breakup_word(term, NGRAM_MAX, NGRAM_MAX)

예제 #5

파일 보기

파일: search.py 프로젝트: ktan2020/miro

def _ngrams_for_term(term):
    """Given a term, return a list of N-grams that we should search for.

    If the term is shorter than NGRAM_MAX, this is just the term itself.
    If it's longer, we split it up into a bunch of N-grams to search for.
    """
    if len(term) < NGRAM_MIN:
        # term is shorter than our smallest ngrams, return an empty list,
        # which causes us to match everything
        return []
    elif len(term) <= NGRAM_MAX:
        # normal case, search for term in using the N-grams we've calculated
        return [term]
    else:
        # term is longer than our longest N-grams, try the best we can using
        # substrings of term.  We only need to use the longest N-grams, since
        # shorter N-grams will just be substrings of those.
        return ngrams.breakup_word(term, NGRAM_MAX, NGRAM_MAX)

예제 #6

파일 보기

파일: searchtest.py 프로젝트: ShriramK/miro

 def test_simple(self):
     results = ngrams.breakup_word('foobar', 2, 3)
     self.assertSameSet(results, [
         'fo', 'oo', 'ob', 'ba', 'ar',
         'foo', 'oob', 'oba', 'bar'])

예제 #7

파일 보기

파일: searchtest.py 프로젝트: kfatehi/miro

 def test_simple(self):
     results = ngrams.breakup_word("foobar", 2, 3)
     self.assertSameSet(results, ["fo", "oo", "ob", "ba", "ar", "foo", "oob", "oba", "bar"])

예제 #8

파일 보기

 def test_simple(self):
     results = ngrams.breakup_word('foobar', 2, 3)
     self.assertSameSet(
         results,
         ['fo', 'oo', 'ob', 'ba', 'ar', 'foo', 'oob', 'oba', 'bar'])