Python readListFromTxt 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: utils.io

메소드/함수: readListFromTxt

hotexamples.com에서의 예제들: 7

Python readListFromTxt - 7개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 utils.io.readListFromTxt에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: topic_extract_theme_ai.py 프로젝트: michaelwangtd/graphToolset

def extractTheme(tagList,tagbaseFilePath):
    themeList = []
    tagbaseList = io.readListFromTxt(tagbaseFilePath)
    for item in tagList:
        if item in tagbaseList:
            themeList.append(item)
    return themeList

예제 #2

파일 보기

파일: tag_extract_itjz.py 프로젝트: michaelwangtd/graphToolset

def filterTagFromTagbase(content,tagbaseFilePath):
    resultList = []
    # 获取标签库列表
    tagbaseList = io.readListFromTxt(tagbaseFilePath)
    for item in tagbaseList:
        if item in content:
            resultList.append(item)
    return resultList

예제 #3

파일 보기

def extractTheme(tagList,tagbaseFilePath):
    themeList = []
    tagbaseList = io.readListFromTxt(tagbaseFilePath)
    for item in tagList:
        if item not in index.TAGBASE_STOP_WORD_LIST:
            if item in tagbaseList:
                themeList.append(item)
    return themeList

예제 #4

파일 보기

파일: topic_extract_theme_ai.py 프로젝트: michaelwangtd/graphToolset

def cleanTheme(tagList):
    themeList = []
    # 获取标签库中标签
    filePath = io.getSourceFilePath('tagbase.txt')
    tagbaseList = io.readListFromTxt(filePath)
    for item in tagList:
        if item in tagbaseList:
            themeList.append(item)
    return themeList

예제 #5

파일 보기

파일: topic_extract_theme_ai.py 프로젝트: michaelwangtd/graphToolset

def updateTagbase():
    '''
        作为一个单独模块，对tagbase.txt进行调整
    '''
    # 对标签库进行了去重操作
    tagbaseFilePath = io.getSourceFilePath('tagbase.txt')

    tagbaseList = io.readListFromTxt(tagbaseFilePath)   # 68638
    cleanTagbaseList = list(set(tagbaseList))   # 67523
    io.writeList2Txt('tagbase.txt',cleanTagbaseList)

예제 #6

파일 보기

파일: tag_extract_itjz.py 프로젝트: michaelwangtd/graphToolset

def scanTheme2Tag(themeList,tagbaseFilePath):
    '''
        从标签库中筛选标签
    '''
    tagList = []
    tagbaseList = io.readListFromTxt(tagbaseFilePath)
    for item in themeList:
        if item in tagbaseList:
            tagList.append(item)
    return tagList

예제 #7

파일 보기

파일: iron_tag_all_info.py 프로젝트: michaelwangtd/graphToolset

 inputFilePath = io.getSourceFilePath('investEvents_20161227144154.txt')
 outputFilePath = io.getSourceFilePath(
     'investEvents_taged_20161227144154.txt')
 tagbaseFilePath = io.getSourceFilePath(
     'tagbase_iron_tag_all_product_company.txt')
 newseedInfoOutputFilePath = io.getProcessedFilePath(
     'newseed_taged_info.csv')
 # get infoList
 infoList = io.loadData2Json(inputFilePath)
 # persist tagbase from redis
 tagbaseDic = util.getTagbaseDicFromRedis(initDic, tagbaseNameList)
 util.persistentTagbase(tagbaseDic, tagbaseFilePath)
 # load cut word user dict
 jieba.load_userdict(tagbaseFilePath)
 # get tagbaseList
 tagbaseList = io.readListFromTxt(tagbaseFilePath)
 # prepare for output
 fw = open(outputFilePath, 'w', encoding='utf-8')
 i = 1
 j = 0
 # traverse infoList
 for item in infoList:
     if item['startup']['productDesc']:
         productDesc = item['startup']['productDesc']
         # get cleaned desc
         cleanedDesc = getCleanedDesc(productDesc)
         # get cut word list
         cutWordList = getCutWordList(cleanedDesc)
         # extract tag
         ironTagList = extractTag(cutWordList, tagbaseList)
         print(i, 'extracted tag:', ironTagList)