Python wenzhi_analysis 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: tencent_qcloud_classifier.wenzhi_utils

메소드/함수: wenzhi_analysis

hotexamples.com에서의 예제들: 2

Python wenzhi_analysis - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 tencent_qcloud_classifier.wenzhi_utils.wenzhi_analysis에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: data_preprocess.py 프로젝트: petchat/app.wechat.tagger

def tencent_classify_rawtext_files(files_root_path, result_path, pass_num=-1):
    count = 0
    flist = os.listdir(files_root_path)
    for f in flist:

        print '%s:%s' % (count, f)
        count += 1
        if count < pass_num:
            continue
        ftext = codecs.open(os.path.join(files_root_path, f), 'r', encoding='utf8').read()
        try:
            # json_obj = json.loads(ftext)
            ftext = ftext.replace('\n', '')
            ftext = ftext.replace(' ', '')
            refined_text = wenzhi_utils.remove_illegal_characters(ftext)
            result = wenzhi_utils.wenzhi_analysis(refined_text)
            # result = tencent_classify(ftext)
        except Exception, e:  # 懒得差各种异常了，直接重复
            print e
            continue
        if result['code'] == 0:
            for class_type in result['classes']:
                if class_type['conf'] > 0.5:
                    try:
                        fout = codecs.open(os.path.join(result_path, class_type['class'], f + '.txt'), 'w')
                    except IOError, e:
                        print e
                        os.mkdir(os.path.join(result_path, class_type['class']))
                        fout = codecs.open(os.path.join(result_path, class_type['class'], f + ".txt"), 'w')
                    except KeyError, ke:
                        print ke
                        continue
                    fout.write(refined_text)

예제 #2

파일 보기

파일: app.py 프로젝트: petchat/app.wechat.tagger

def analyzse_article():
    """
    抽离文章分析接口
    :return:
    """
    req_data = json.loads(request.data)
    content_list = req_data.get('article_content')
    article_content = req_data.get('article_content')
    result = wenzhi_utils.wenzhi_analysis(article_content)
    # topic_list = tagging_utils.passage_second_level_classify(web_content)
    tag_result = []
    if result['code'] == 0:
        for class_item in result['classes']:
            class_type = class_item['class']
            class_prob = class_item['conf']
            tag_result.append({'tag': class_type, 'prob': class_prob})
    return json.dumps({'code': 0, 'tag_result': tag_result}, ensure_ascii=False)