Python DictUtils 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: etl_utils

클래스/타입: DictUtils

hotexamples.com에서의 예제들: 3

Python DictUtils - 3개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 etl_utils.DictUtils에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

add_default_value(2)

예제 #1

파일 보기

    def __init__(self, documents_or_func, cache_dir):
        """
        1.  input is { { item_id : {feature1:count1, feature2: count2, ...} }, ... }
        2. output is { { item_id : {feature1: rank1, feature2:  rank2, ...} }, ... }
        They're all acts like a dict, whatever persistent or not.
        """
        self.documents_or_func = documents_or_func
        self.cache_dir = cache_dir

        # Always load idf cache, and it's really small.
        d1 = DictUtils.add_default_value(self.idf_cache)
        self.idf_cache = IdfResult(d1.default_factory, d1)

예제 #2

파일 보기

파일: classify.py 프로젝트: 17zuoye/textmulclassify

    def entropy_cache(self):
        """ 训练+测试 预料全在里面了 """
        from .lib.entropy import EntropyFunc
        result = EntropyFunc.process(self.documents_with_segments, self.cache_dir)
        self.entropy_file = result  # 采用自己的熵，给FeaturesWeight用
        result = DictUtils.add_default_value(result)

        """
        from etl_utils import is_regular_word, Unicode
        for k1 in result.keys():
            if not (is_regular_word(k1) or Unicode.is_chinese(k1)):
                del result[k1]
        """

        self.entropy_file = result
        return result

예제 #3

파일 보기

파일: classify.py 프로젝트: mvj3/textmulclassify

    def entropy_cache(self):
        """ 训练+测试 预料全在里面了 """
        from .lib.entropy import EntropyFunc
        result = EntropyFunc.process(self.documents_with_segments,
                                     self.cache_dir)
        self.entropy_file = result  # 采用自己的熵，给FeaturesWeight用
        result = DictUtils.add_default_value(result)
        """
        from etl_utils import is_regular_word, Unicode
        for k1 in result.keys():
            if not (is_regular_word(k1) or Unicode.is_chinese(k1)):
                del result[k1]
        """

        self.entropy_file = result
        return result