Python get_spelling_variants 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: embem.emotools.lexicon

메소드/함수: get_spelling_variants

hotexamples.com에서의 예제들: 2

Python get_spelling_variants - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 embem.emotools.lexicon.get_spelling_variants에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: generate_historic_wnaffect.py 프로젝트: NLeSC/embodied-emotions-scripts

    with codecs.open(args.dict_file, 'rb', 'latin1') as f:
        lines = f.readlines()

    count = 0

    spelling_vars = {}
    for line in lines:
        count += 1
        entry = line.split(';')
        # lexicon service needs lower case input
        term = entry[0].lower()
        term = term.replace('"', '')
        while True:
            try:
                sleep(1)
                words = get_spelling_variants(term, [], 1600, 1830)
                words = list(set(words))
                break
            except:
                print 'Retry!'
                sleep(5)
                pass

        if len(words) > 0:
            spelling_vars[term] = words

        if count % 1000 == 0:
            print count

        print term, words

예제 #2

파일 보기

파일: generate_historic_liwc.py 프로젝트: NLeSC/embodied-emotions-scripts

    liwc_category_output = []
    spelling_vars = {}
    liwc_output = {}
    for line in lines:
        # legend
        if line[0].isdigit() or line.startswith(('%', '\r')):
            liwc_category_output.append(line.strip())
        # word
        else:
            entry = line.split()
            # lexicon service needs lower case input
            term = entry[0].lower()
            categories = entry[1:]
            sleep(0.3)
            words = get_spelling_variants(term, categories, 1600, 1830)
            words.append(term)
            words = list(set(words))

            spelling_vars[term] = words

            print term, words
            for word in words:
                if liwc_output.get(
                        word) and not categories == liwc_output[word]:
                    new_c = list(set(categories + liwc_output.get(word)))
                    new_c.sort()
                    liwc_output[word] = new_c
                else:
                    liwc_output[word] = categories
    #with codecs.open('liwc_output.json', 'w', 'utf8') as f: