Python Stemmer.stem 예제들

프로그래밍 언어: Python

클래스/타입: Stemmer

메소드/함수: stem

hotexamples.com에서의 예제들: 5

Python Stemmer.stem - 5개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 Stemmer.stem 패키지로부터 djangoadminongae에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

Stemmer(30)

algorithms(4)

stem(4)

PorterStemmer(2)

language(2)

stemming(2)

aplicaStemmer(1)

buscaFrequencia(1)

buscaPalavras(1)

buscaPalavrasUnicas(1)

stemWords(1)

예제 #1

파일 보기

def remove_stopwords(data):
    table = str.maketrans('', '', string.punctuation)
    result = []
    for word in data:
        if (word.isalpha()):
            word = word.strip()
            word = word.translate(table)
            word = word.strip()
            if len(word) > 2:
                try:
                    if stp[word.strip()] != 1:
                        result.append(str(Stemmer.stem(word)).lower())
                except KeyError:
                    result.append(str(Stemmer.stem(word)).lower())

    return result

예제 #2

파일 보기

파일: FilterInterface.py 프로젝트: hmbachelor/bachelor

def porterStemmer(string):

    """
    Accepts a string and optionally a stemmer function working on
    single words, it defaults to the nltk PorterStemmer algorithm.

    Returns a stemmed string.
    """

    return Stemmer.stem(string)

예제 #3

파일 보기

    s = re.sub(r'[.,!?;:{}[]()-_]', '',
               word)  # с помощью регулярных выражений удаляем знаки препинания
    unsymboled.append(s)

listed = [s.split(" ")
          for s in unsymboled]  # разделяем предложения на отдельные слова

new = []
for sentence in listed:
    s = [i for i in sentence
         if i not in stop_words_list]  # удаляем стоп-символы
    new.append(s)

result = []
for sentence in new:
    s = [_stemmer.stem(i) for i in sentence]  # производится стемминг
    result.append(s)

print(result)

# преобразование массива result в строку для удаления уникальных вхождений
text = [" ".join(i) for i in result]
text = " ".join(text)
words = text.split(" ")

#print(words)                                                # Все слова по отдельности
#print(text)                                                 # Сам текст

newtext = ''
for word in words:
    i = text.count(word)

예제 #4

파일 보기

파일: phase1.py 프로젝트: sajjadpsh/information-retrival

def stem_words(tokens):
    stemmer = Stemmer()
    stemmed_words = [stemmer.stem(token) for token in tokens]
    return stemmed_words

예제 #5

파일 보기

파일: MainCode.py 프로젝트: imoizuddin/MorphAnaylzer

def mainfunctioncodestem(String):
    stringtrimmed = String.strip()
    token_list = Tokenizer.ClassTokenizer.code_tokenizer(stringtrimmed)
    stem_dict = Stemmer.stem(token_list)
    return stem_dict