Python Unigram 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: resources.unigram

클래스/타입: Unigram

hotexamples.com에서의 예제들: 2

Python Unigram - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 resources.unigram.Unigram에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

add(2)

get_count(2)

get_size(2)

get_sum(1)

get_tokens(1)

get_value(1)

예제 #1

파일 보기

파일: repetition.py 프로젝트: drammock/sppas

    def relevancy(self, inputtier):
        """
        Add very frequent tokens in a copy of the stopwords list.
        Return a WordsList instance

        Estimate the relevance of each term by using the number of
        occurrences of this term in the input and compare this value
        to a threshold, to add the term (or not) in the stopwords list.

        @param inputtier (Tier)

        """
        l = self.stopwords.copy()

        # Create the Unigram and put data
        u = Unigram()
        for a in inputtier:
            if a.GetLabel().IsSpeech() is True:
                u.add( a.GetLabel().GetValue() )

        # Estimate if a token is relevant, put in the stoplist
        for token in u.get_tokens():
            freq  = u.get_value(token)
            proba = float(freq) / float(u.get_sum())
            relevant = 1.0 / (float(u.get_size())*float(self._alpha))
            if proba > relevant:
                l.add( token )
                if self.logfile is not None:
                    self.logfile.print_message('Add in the stoplist: '+token, indent=3)
                elif DEBUG is True:
                    print(' ... ... ... Add in the stoplist: '+token.encode('utf8'))

        return l

예제 #2

파일 보기

파일: test_dict.py 프로젝트: brigittebigi/sppas

 def test_unigram(self):
     gram = Unigram()
     gram.add( 'a' )
     self.assertEqual( gram.get_size(), 1)
     self.assertEqual( gram.get_count('a'), 1)
     gram.add( 'a' )
     self.assertEqual( gram.get_size(), 1)
     self.assertEqual( gram.get_count('a'), 2)
     gram.add( 'a',3 )
     self.assertEqual( gram.get_size(), 1)
     self.assertEqual( gram.get_count('a'), 5)