Python ProbeCoherence 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: charset_normalizer.probe_coherence

클래스/타입: ProbeCoherence

hotexamples.com에서의 예제들: 6

Python ProbeCoherence - 6개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 charset_normalizer.probe_coherence.ProbeCoherence에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

ProbeCoherence(6)

자주 사용되는 메소드들

ProbeCoherence (6)

예제 #1

파일 보기

파일: test_probe_coherence.py 프로젝트: jayvdb/charset_normalizer

    def test_obvious_coherence_gap(self):

        should_be_most_coherent = CharsetNormalizerMatches.from_path(
            './data/sample.1.ar.srt').best().first().coherence

        with open('./data/sample.1.ar.srt', 'r',
                  encoding='mac_cyrillic') as fp:
            r_ = ProbeCoherence(HashableCounter(fp.read())).ratio

        with open('./data/sample.1.ar.srt', 'r', encoding='cp1251') as fp:
            t_ = ProbeCoherence(HashableCounter(fp.read())).ratio

        self.assertLess(should_be_most_coherent, r_)

        self.assertLess(should_be_most_coherent, t_)

예제 #2

파일 보기

 def languages(self):
     """
     Return a list of probable language in text
     :return: List of language
     :rtype: list[str]
     """
     return ProbeCoherence(self.char_counter).most_likely

예제 #3

파일 보기

파일: normalizer.py 프로젝트: jayvdb/charset_normalizer

 def language(self):
     """
     Return the most probable language found in text
     :return: Most used/probable language in text
     :rtype: str
     """
     languages = ProbeCoherence(self.char_counter).most_likely
     return languages[0] if len(languages) > 0 else 'Unknown'

예제 #4

파일 보기

 def coherence(self):
     """
     Return a value between 0. and 1.
     Closest to 0. means that the initial string is considered coherent,
     Closest to 1. means that the initial string SEEMS NOT coherent.
     :return: Ratio as floating number
     :rtype: float
     """
     return ProbeCoherence(self.char_counter).ratio

예제 #5

파일 보기

    def language(self):
        """
        Return the most probable language found in text
        :return: Most used/probable language in text
        :rtype: str
        """
        probe_coherence = ProbeCoherence(self.char_counter)
        languages = probe_coherence.most_likely

        if len(languages) == 0:
            return 'English' if len(self.alphabets) == 1 and self.alphabets[0] == 'Basic Latin' else 'Unknown'

        return languages[0]

예제 #6

파일 보기

 def coherence_non_latin(self):
     return ProbeCoherence(self.char_counter).non_latin_covered_any