Python MongoUtil.distinct_count 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: source

클래스/타입: MongoUtil

메소드/함수: distinct_count

hotexamples.com에서의 예제들: 3

Python MongoUtil.distinct_count - 3개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 source.MongoUtil.distinct_count에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

find(14)

find_one(14)

isExist(8)

upsert_mary(4)

distinct_count(3)

insert(3)

capacity_find_most(2)

count(2)

save(2)

sort_with_values(2)

create_index(1)

remove(1)

sort(1)

예제 #1

파일 보기

def showData():
    print("总app数量：" + str(MongoUtil.count("app_table")))
    locationCount = 0
    catas = json.load(open(const.WANDOUJIA_CATA_JSON_FILE))
    for cataname in catas:
        cataname = cataname.strip()
        print(cataname +" 数量：" + str(MongoUtil.count(cataname)))
        locationCount += len(MongoUtil.distinct_count(cataname, "appid"))
    print("获取评论的app数量："+str(locationCount),end="\n\n")
    print("word数量：" + str(MongoUtil.count("word_table")))

예제 #2

파일 보기

def showData(cataname):
    print("总app数量：" + str(MongoUtil.count("app_table")))
    print("word数量：" + str(MongoUtil.count("word_table")))
    appCount = MongoUtil.find("app_table", {"catagory":cataname}).count()
    print(cataname+"的 app数量: "+str(appCount))
    locationCount = 0
    cataname = cataname.strip()
    print(cataname +"的 location 数量：" + str(MongoUtil.count(cataname)))
    locationCount += len(MongoUtil.distinct_count(cataname, "appid"))
    print("已获取评论的 app数量："+str(locationCount))
    print("未获取评论的 app数量："+str(appCount-locationCount))

예제 #3

파일 보기

    def tf_idf(self):

        if self.worddict == None or len(self.worddict) == 0:
            print("请初始化词频统计")
            return
        if self.wordcount < 100:
            print("该app的评论数量过少，获取关键词将会不准确")
            return

        #文档总数
        docu_count = len(
            MongoUtil.distinct_count(self.app["catagory"], "appid",
                                     value=None))
        #减去它本身
        docu_count -= 1

        tf_idfdict = {}
        for item in self.worddict.items():
            result = MongoUtil.find_one("word_table", {"word": item[0]})
            wordid = result["_id"]
            include_count = len(
                MongoUtil.distinct_count(self.app["catagory"],
                                         "appid",
                                         value={"wordid": wordid}))
            #减去它本身
            include_count -= 1

            # print(item[0]+"->"+str(item[1])+"  包含的总文档数"+str(include_count))
            # print(str(docu_count) + " "+str(include_count))
            if docu_count <= 0:
                docu_count = 0

            wordidf = float(math.log(docu_count / (include_count + 1)))
            wordtf = float(item[1] / self.wordcount)
            tf_idfdict[item[0]] = wordtf * wordidf

        for item in tf_idfdict.items():
            print(item[0] + "    出现的次数：" + str(self.worddict[item[0]]) +
                  "     tf-idf计算值：" + str(item[1]))

        return tf_idfdict