Python HierarchicalNormalizerChain 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: cahoots.confidence.normalizer

hotexamples.com에서의 예제들: 4

Python HierarchicalNormalizerChain - 4개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 cahoots.confidence.normalizer.HierarchicalNormalizerChain에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

HierarchicalNormalizerChain(2)

normalize(2)

예제 #1

파일 보기

파일: parser.py 프로젝트: msabramo/cahoots

    def parse(self, data_string):
        """Parses input data and returns a dict of result data"""

        start_time = time.time()
        results = []
        threads = []

        # Creating/starting a thread for each parser module
        for module in self.config.enabled_modules:
            thread = ParserThread(self.config, module, data_string)
            thread.start()
            threads.append(thread)

        # Synchronizing/finishing parser threads
        for thr in threads:
            thr.join()

        # The threads are done, let's get the results out of them
        for thr in threads:
            results.extend(thr.results)

        # Unique list of all major types
        types = list(set([result.type for result in results]))

        if results:
            # Getting a unique list of result types.
            all_types = []
            for res in results:
                all_types.extend([res.type, res.subtype])

            # Hierarchical Confidence Normalization
            normalizer_chain = HierarchicalNormalizerChain(
                self.config,
                types,
                list(set(all_types))
            )
            results = normalizer_chain.normalize(results)

            # Sorting our results by confidence value
            results = sorted(
                results,
                key=lambda result: result.confidence,
                reverse=True
            )

        return {
            'query': truncate_text(data_string),
            'date': datetime.datetime.now(),
            'execution_seconds': time.time() - start_time,
            'top': results[0] if len(results) > 0 else None,
            'results': {
                'count': len(results),
                'types': types,
                'matches': results
            }
        }

예제 #2

파일 보기

파일: normalizer.py 프로젝트: SerenitySoftwareLLC/cahoots

    def test_normalizer_normalizes(self):
        res = [
            ParseResult('Test', 'Test', 100),
            ParseResult('Test', 'Test', 0)
        ]

        conf = TestConfig()
        conf.enabled_confidence_normalizers.append(NormalizerStub)
        hnc = HierarchicalNormalizerChain(conf, [], [])
        results = hnc.normalize(res)

        self.assertEqual(1, len(results))
        self.assertIsInstance(results[0], ParseResult)

예제 #3

파일 보기

파일: normalizer.py 프로젝트: pombredanne/cahoots

    def test_normalizer_normalizes(self):
        res = [
            ParseResult('Test', 'Test', 100),
            ParseResult('Test', 'Test', 0)
        ]

        conf = TestConfig()
        conf.enabled_confidence_normalizers.append(NormalizerStub)
        hnc = HierarchicalNormalizerChain(conf, [], [])
        results = hnc.normalize(res)

        self.assertEqual(1, len(results))
        self.assertIsInstance(results[0], ParseResult)

예제 #4

파일 보기

    def parse(self, data_string):
        """
        Parses input data and returns a dict of result data

        :param data_string: the string we want to parse
        :type data_string: str
        :return: yields parse result data if there is any
        :rtype: dict
        """
        start_time = time.time()
        results = []
        threads = []

        # Creating/starting a thread for each parser module
        for module in self.config.enabled_modules:
            thread = ParserThread(self.config, module, data_string)
            thread.start()
            threads.append(thread)

        # Synchronizing/finishing parser threads
        for thr in threads:
            thr.join()

        # The threads are done, let's get the results out of them
        for thr in threads:
            results.extend(thr.results)

        # Unique list of all major types
        types = list(set([result.type for result in results]))

        if results:
            # Getting a unique list of result types.
            all_types = []
            for res in results:
                all_types.extend([res.type, res.subtype])

            # Hierarchical Confidence Normalization
            normalizer_chain = HierarchicalNormalizerChain(
                self.config,
                types,
                list(set(all_types))
            )
            results = normalizer_chain.normalize(results)

            # Sorting our results by confidence value
            results = sorted(
                results,
                key=lambda result: result.confidence,
                reverse=True
            )

        return {
            'query': truncate_text(data_string),
            'date': datetime.datetime.now(),
            'execution_seconds': time.time() - start_time,
            'top': results[0] if len(results) > 0 else None,
            'results': {
                'count': len(results),
                'types': types,
                'matches': results
            }
        }