Exemplos de TaggedCorpusReader.sents em Python

Linguagem de programação: Python

Espaço para nome / nome do pacote: nltk.corpus.reader.tagged

Classe / Tipo: TaggedCorpusReader

Método / Função: sents

Exemplos em hotexamples.com: 2

TaggedCorpusReader.sents em Python - 2 exemplos encontrados. Esses são os exemplos do mundo real mais bem avaliados de nltk.corpus.reader.tagged.TaggedCorpusReader.sents em Python extraídos de projetos de código aberto. Você pode avaliar os exemplos para nos ajudar a melhorar a qualidade deles.

Métodos Frequentes

Exibir Ocultar

TaggedCorpusReader(7)

__init__(6)

words(3)

fileids(2)

sents(2)

tagged_sents(2)

tagged_words(2)

paras(1)

Métodos Frequentes

TaggedCorpusReader (7)

__init__ (6)

words (3)

fileids (2)

sents (2)

tagged_sents (2)

tagged_words (2)

paras (1)

Exemplo n.º 1

0

Exibir arquivo

Arquivo: solution1.py Projeto: aparnamani/nested-syntatic-constituents

def main(): #''' Accessing Folder''' dirpath = str( sys.argv[1] ) #sys.argv[0] is the name of the python program, sys.arg[1] is the directory path folder = nltk.data.find(dirpath) #''' Reading Corpus files ''' corpus = TaggedCorpusReader(folder, '.*\.prd') #''' Extracting sentences in Corpus files ''' corpusSents = corpus.sents() #''' Splitting notes & Combining elements ''' corpusElems = [] for corpusSent in corpusSents: for elem in corpusSent: corpusElems.append(elem) solution = TallySolution(corpusElems) solution.countS() solution.countNP() solution.countVP() solution.countDVP() solution.countIVP()

Exemplo n.º 2

0

Exibir arquivo

Arquivo: creacionCorpus.py Projeto: MariaIsabelLL/Python_NLTK

loc = '/Users/rmoura/nltk_data/corpora/rai/textoSimples/' corpus1 = PlaintextCorpusReader(loc, '.*\.txt') print(corpus1.fileids()) print(corpus1.sents()) print(corpus1.words()) # Corpus texto etiquetado from nltk.corpus.reader.tagged import TaggedCorpusReader loc = '/Users/rmoura/nltk_data/corpora/rai/textoEtiquetas/' corpus2 = TaggedCorpusReader(loc, '.*\.txt') print(corpus2.fileids()) print(corpus2.words()) print("Palavras etiquetadas: ", corpus2.tagged_words()) print(corpus2.tagged_words('003.txt')) print("Sentencas diretas:") for s in corpus2.sents(): print(' '.join(s)) from nltk.corpus.reader import CategorizedPlaintextCorpusReader loc = '/Users/rmoura/nltk_data/corpora/rai/textoCategorias/' corpus3 = CategorizedPlaintextCorpusReader(loc, '.*\.txt', cat_file="categorias.txt") print(corpus3.fileids()) print(corpus3.categories()) print(corpus3.words(categories='brasnam')) # Definicao de stopwords stopwords = nltk.corpus.stopwords.words('portuguese') fd = nltk.FreqDist(w.lower() for w in corpus3.words()) fd1 = nltk.FreqDist(w.lower() for w in corpus3.words() if w.isalpha() and w not in stopwords)