Python Stemmer.stemWord примеры использования

Язык программирования: Python

Пространство имен/Пакет: stemmer

Класс/Тип: Stemmer

Метод/Функция: stemWord

Примеров на hotexamples.com: 1

Python Stemmer.stemWord - 1 пример найден. Это лучшие примеры Python кода для stemmer.Stemmer.stemWord, полученные из open source проектов. Вы можете ставить оценку каждому примеру, чтобы помочь нам улучшить качество примеров.

Основные методы

Показать Скрыть

Stemmer(30)

stem_term(15)

stem(8)

find_basic_form(2)

normalize_list(2)

get_stems(1)

lower(1)

m(1)

ngramStemmer(1)

stemWord(1)

stem_text(1)

stem_words(1)

upper(1)

Пример #1

Показать файл

        string = re.split(
            ' |-|\n|\u00e3|\u2019|\u201c|\u201d|\u2014|\u2018|\u00a9|\u00af|\u00aa|\u00b4|\u00a7|\u00a8',
            f.read())

        location = 0

        for word in string:

            word = word.lower()
            # Stripping word and removing " ' : ; - _ # + @ ( ) / ? ~ ` [ ] { } =
            word = word.strip(
                ',|!|.|"|;|:|-|_|#|+|@|)|(|/|?|~|`|[|]|{|}|=|\u00e3')

            if word not in stopWords:  # If word is not a stopword

                word = stemmer.stemWord(word)  # Stemming

                # ========= Building Inverted Index =========
                # If the word already exists in the II, Append the word's document number in the II.
                # If the word does not exist in the II, add the word as a key and also add the document number.
                if word in invertedIndex:
                    if file.split('.')[0] not in invertedIndex[word]:
                        invertedIndex[word].append(file.split('.')[0])
                else:
                    invertedIndex[word] = [file.split('.')[0]]

            # ========= Building Positional Index =========
            # If the word already exists in the PI, Append the word's document number and its position in the PI.
            # If the word does not exist in the OI, add the word as a key and also add the document number and the position of the word.
            if word in positionalIndex:
                if word not in stopWords: