Python PickledCorpusReader.categories примеры использования

Язык программирования: Python

Пространство имен/Пакет: reader

Класс/Тип: PickledCorpusReader

Метод/Функция: categories

Примеров на hotexamples.com: 3

Python PickledCorpusReader.categories - 3 примера найдено. Это лучшие примеры Python кода для reader.PickledCorpusReader.categories, полученные из open source проектов. Вы можете ставить оценку каждому примеру, чтобы помочь нам улучшить качество примеров.

Основные методы

Показать Скрыть

PickledCorpusReader(16)

docs(6)

fileids(6)

words(3)

categories(2)

sents(2)

Пример #1

Показать файл

Файл: info.py Проект: yokeyong/atap

from reader import PickledCorpusReader

reader = PickledCorpusReader('../corpus')

for category in reader.categories():

    n_docs = len(reader.fileids(categories=[category]))
    n_words = sum(1 for word in reader.words(categories=[category]))

    print("- '{}' contains {:,} docs and {:,} words".format(category, n_docs, n_words))

Пример #2

Показать файл

from reader import PickledCorpusReader

reader = PickledCorpusReader('../corpus')

for category in reader.categories():

    n_docs = len(reader.fileids(categories=[category]))
    n_words = sum(1 for word in reader.words(categories=[category]))

    print("- '{}' contains {:,} docs and {:,} words".format(
        category, n_docs, n_words))

Пример #3

Показать файл

Файл: splits.py Проект: AmalfiTrader/Text-Analysis

from sklearn.model_selection import train_test_split as tts
from reader import PickledCorpusReader

reader = PickledCorpusReader('../corpus')

labels = ["books", "cinema", "cooking", "gaming", "sports", "tech"]
docs = reader.fileids(categories=labels)
X = list(reader.docs(fileids=docs))
y = [reader.categories(fileids=[fileid])[0] for fileid in docs]