Python MBSP.chunk Exemples

Langage de programmation: Python

Class/Type: MBSP

Méthode/Fonction: chunk

Exemples au hotexamples.com: 2

Python MBSP.chunk - 2 exemples trouvés. Ce sont les exemples réels les mieux notés de MBSP.chunk extraits de projets open source. Vous pouvez noter les exemples pour nous aider à en améliorer la qualité.

Méthodes fréquemment utilisées

Afficher Cacher

parse(7)

Mbt(2)

TokenString(2)

chunk(2)

lemmatize(2)

split(2)

Sentence(1)

Server(1)

pprint(1)

start(1)

tag(1)

tokenize(1)

xml(1)

Méthodes fréquemment utilisées

parse (7)

Mbt (2)

TokenString (2)

chunk (2)

lemmatize (2)

split (2)

Sentence (1)

Server (1)

pprint (1)

start (1)

Méthodes fréquemment utilisées

tag (1)

tokenize (1)

xml (1)

Associées

get_version_names

should_stream

write_blade_surf

TabLabel

Read

get_possible_pig_config_from

function_gate

xmlToTag

write_png

uninstall_plugins

Related in langs

SAMBA_HAVE_POSIX_ACLS (PHP)

Amazon_EC2_Model_DisassociateAddressResponse (PHP)

IQuestionBox (C#)

PlayerInputMapping (C#)

LaplaceBil_x (C++)

utf8_test (C++)

GetMinSeq (Go)

New (Go)

XmlEvent (Java)

DataPair (Java)

Exemple #1

0

Afficher le fichier

Fichier : PartsOfSpeech-MBSP-annotate.py Projet : annabonazzi/NLP

text = str(text).replace('\x00 ','').replace('\xef\xbf\xbd','') text = str(text).replace('\xf7','').replace('\xc3\xba','').replace('\xb6','').replace('\xa9','').replace('\xe2\x99\xaa','') text = str(text).replace('\xc3\xaf','').replace('\x5c','').replace('\xf1','').replace('\xe1','').replace('\xe7','').replace('\xfa','') text = str(text).replace('\xf3','').replace('\xed','').replace('\xe9','').replace('\xe0','').replace('\xae','').replace('\xc2','') text = str(text).replace('\xc3','').replace('\xa2','').replace('\xbf','') if text.isupper(): text = text.lower() # print text except IndexError: print line continue # G. Remove clearly wrong unicode characters -- BOM, NULL (only utf8 hex works) line = str(line).replace('\x00 ','').replace('\xef\xbf\xbd','') print line, # H. Parts of speech with MBSP -- resplit the text if needed try: pos = MBSP.chunk(text, tokenize=True, lemmata=True) for pos in pos.splitlines(): pos = str(pos).replace(' ','|') print "".join([field[0],"|",field[1],"|POS_01","|",pos]) except (UnicodeDecodeError, UnicodeEncodeError, IndexError, AssertionError): # Tag failed UTF-8 lines NA to enable repair print "".join([field[0],"|",field[1],"|POS_01","|NA"]) continue # I. Close the file fp.close() # EOF

Exemple #2

0

Afficher le fichier

Fichier : crf.py Projet : yuancz/scientific-summ

def _chunk_MBSP(self, txt): chunked = MBSP.chunk(txt) return unicode(chunked)