Ejemplos de ConllCorpusReader.tagged_words en Python

Lenguaje de programación: Python

Namespace/Package Name: nltk.corpus.reader

Clase / Tipo: ConllCorpusReader

Método / Función: tagged_words

Ejemplos en hotexamples.com: 4

Python ConllCorpusReader.tagged_words - 4 ejemplos encontrados. Estos son los ejemplos en Python del mundo real mejor valorados de nltk.corpus.reader.ConllCorpusReader.tagged_words extraídos de proyectos de código abierto. Puedes valorar ejemplos para ayudarnos a mejorar la calidad de los ejemplos.

Métodos usados con frecuencia

Mostrar Ocultar

ConllCorpusReader(15)

tagged_sents(8)

iob_sents(4)

sents(3)

tagged_words(3)

words(2)

chunked_sents(1)

chunked_words(1)

fileids(1)

iob_words(1)

parsed_sents(1)

raw(1)

srl_instances(1)

srl_spans(1)

Ejemplo n.º 1

Mostrar archivo

 def tagged_words(self, fileids=None, categories=None):
     return ConllCorpusReader.tagged_words(
         self, self._resolve(fileids, categories))

Ejemplo n.º 2

Mostrar archivo

Archivo: Train.py Proyecto: gonchandrei/ANLP_viterbi

from __future__ import division
from nltk.corpus.reader import ConllCorpusReader
from nltk.probability import FreqDist, DictionaryProbDist, LaplaceProbDist, SimpleGoodTuringProbDist, MLEProbDist

conllreader = ConllCorpusReader(".", "de-train.tt", ('words', 'pos'))  # getting a train corpus from file
states = ('VERB', 'NOUN', 'PRON', 'ADJ', 'ADV', 'ADP', 'CONJ', 'DET', 'NUM', 'PRT', 'X', '.')  # list of 12 POS tags
sentslen = len(conllreader.tagged_sents())  # getting number of sentences

tagfdist = FreqDist(pair[1] for pair in conllreader.tagged_words())   # getting frequence of (word,tag)

firsttagfdist = FreqDist(pair[0][1] for pair in conllreader.tagged_sents())  # getting frequence of first tags
A0j = DictionaryProbDist(dict(map(lambda (k, x): (k, x/sentslen), firsttagfdist.iteritems())))
A0jLap = LaplaceProbDist(firsttagfdist)
A0jGT = SimpleGoodTuringProbDist(firsttagfdist)
A0jMLE = MLEProbDist(firsttagfdist)

TagPair = []
words = conllreader.tagged_words()
for i in range(0, len(words)-1):
    TagPair.append((words[i][1], words[i+1][1]))

TagPairfdist = FreqDist(TagPair)
Aij = DictionaryProbDist(dict(map(lambda (k, x): (k, x/tagfdist.get(k[0])), TagPairfdist.iteritems())))
AijLap = LaplaceProbDist(TagPairfdist)
AijGT = SimpleGoodTuringProbDist(TagPairfdist)
AijMLE = MLEProbDist(TagPairfdist)

TagWordfdist = FreqDist(conllreader.tagged_words())
Biw = DictionaryProbDist(dict(map(lambda (k, x): (k, x/tagfdist.get(k[1])), TagWordfdist.iteritems())))
BiwLap = LaplaceProbDist(TagWordfdist)
BiwGT = SimpleGoodTuringProbDist(TagWordfdist)

Ejemplo n.º 3

Mostrar archivo

Archivo: catchunked.py Proyecto: RomanZacharia/python_text_processing_w_nltk2_cookbook

	def tagged_words(self, fileids=None, categories=None):
		return ConllCorpusReader.tagged_words(self, self._resolve(fileids, categories))

Ejemplo n.º 4

Mostrar archivo

## Function to add an adjective to a noun key
def add_adj(noun_param, adj_param):
    if (noun_param in a):
        a[noun_param].append(adj_param)
    else:
        a[noun_param] = [adj_param]


filedir = '/Users/fnascime/Documents/Sicily_Project/texts/'
filename = 'ilgattopardo_prima'

mycorpus = ConllCorpusReader(filedir, filename + '.conll',
                             ('ignore', 'words', 'ignore', 'pos', 'ignore',
                              'ignore', 'ignore', 'ignore'))

words = mycorpus.tagged_words()
list_len = len(words)

## Loop through file and retrieve adjetives directly associated with nouns (adjunct words)
for i in range(list_len):

    if (words[i][1] == 'S'):
        if ((i > 0) and (words[i - 1][1] == 'A')):
            add_adj(words[i][0], words[i - 1][0])
        elif ((i < list_len - 1) and (words[i + 1][1] == 'A')):
            add_adj(words[i][0], words[i + 1][0])

## Loop throught the list of words and verify the ones with more adjective

nouns_counting = len(a)
adj_counting = 0