Python stem 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: lib.stemming.porter2

메소드/함수: stem

hotexamples.com에서의 예제들: 4

Python stem - 4개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 lib.stemming.porter2.stem에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: non_continuous_n_gram.py 프로젝트: Zebreu/SentenceCompletionProject

def get_predictions(test_data):
    """compute predictions for the test set given as argument using the two different scoring heuristics
    """
    with open(test_data) as f:
        i = 0
        predictions = []
        options = []  # stores the different options possible for a single sentence
        for line in f:
            match = re.search('\[([\d\w\'\-,]+)\]', line)
            option = match.group(1)
            if i % OPTIONS_PER_SENTENCE == 0:
                options = [option]
                line = line.replace("[%s]" % option, "")  # remove fill word from sentence
                words_in_sentence = line.split()[1:]  # start from index 1 since 1st cell contains the question number
            elif i % OPTIONS_PER_SENTENCE == 4:
                if not STEMMING:
                    options.append(option)
                    best_option_1, best_option_2 = get_best_option(options, words_in_sentence)
                    predictions.append((best_option_1, best_option_2))
                else:
                    options.append(option)
                    stemmed_options = [stem(option) for option in options]
                    stemmed_words_in_sentence = [stem(word) for word in words_in_sentence]
                    best_option_1, best_option_2 = get_best_option(stemmed_options, stemmed_words_in_sentence)
                    if best_option_1 and best_option_2:
                        best_option_index_1 = stemmed_options.index(best_option_1)
                        best_option_index_2 = stemmed_options.index(best_option_2)
                        predictions.append((options[best_option_index_1], options[best_option_index_2]))
                    else:
                        predictions.append((None, None))
            else:
                options.append(option)
            i += 1
    return predictions

예제 #2

파일 보기

def get_predictions(test_data):
    """compute predictions for the test set given as argument using the two different scoring heuristics
    """
    with open(test_data) as f:
        i = 0
        predictions = []
        options = [
        ]  # stores the different options possible for a single sentence
        for line in f:
            match = re.search('\[([\d\w\'\-,]+)\]', line)
            option = match.group(1)
            if i % OPTIONS_PER_SENTENCE == 0:
                options = [option]
                line = line.replace("[%s]" % option,
                                    "")  # remove fill word from sentence
                words_in_sentence = line.split(
                )[1:]  # start from index 1 since 1st cell contains the question number
            elif i % OPTIONS_PER_SENTENCE == 4:
                if not STEMMING:
                    options.append(option)
                    best_option_1, best_option_2 = get_best_option(
                        options, words_in_sentence)
                    predictions.append((best_option_1, best_option_2))
                else:
                    options.append(option)
                    stemmed_options = [stem(option) for option in options]
                    stemmed_words_in_sentence = [
                        stem(word) for word in words_in_sentence
                    ]
                    best_option_1, best_option_2 = get_best_option(
                        stemmed_options, stemmed_words_in_sentence)
                    if best_option_1 and best_option_2:
                        best_option_index_1 = stemmed_options.index(
                            best_option_1)
                        best_option_index_2 = stemmed_options.index(
                            best_option_2)
                        predictions.append((options[best_option_index_1],
                                            options[best_option_index_2]))
                    else:
                        predictions.append((None, None))
            else:
                options.append(option)
            i += 1
    return predictions

예제 #3

파일 보기

파일: preprocessing_pipeline.py 프로젝트: zbxzc35/SentenceCompletionProject

def filter_stem(input_path, output_path):
    """filter every word so that only the stem remains

    Keyword arguments:
    input_path -- input file path
    output_path -- output file path
    """
    with open(input_path) as inp, open(output_path, 'w') as out:
        for line in inp:
            line = " ".join([stem(word) for word in line.split()])
            out.write(line+'\n')

예제 #4

파일 보기

파일: geoindexer.py 프로젝트: pavelsimo/GeoSearchEngine

 def term_normalize(term):
    res = ''.join(e for e in term if e.isalpha())
    res = stem(res.lower())
    return res