Python word_tokenize Exemples

Langage de programmation: Python

Espace de nommage/Pack: txttk.nlptools

Méthode/Fonction: word_tokenize

Exemples au hotexamples.com: 9

Python word_tokenize - 9 exemples trouvés. Ce sont les exemples réels les mieux notés de txttk.nlptools.word_tokenize extraits de projets open source. Vous pouvez noter les exemples pour nous aider à en améliorer la qualité.

Associées

QListWidget

get_similar_artists

run

power_spectrum_1d

render_post

dbstoropen

mangle_expn_vectors

get_hashes

parse

loadGCode

Related in langs

ActivityPollRequest (PHP)

create_model (PHP)

HunspellGenerateDelegate (C#)

CubicInterpolation.BoundaryCondition (C#)

up_ledoff (C++)

jpl_get_double (C++)

Privilege (Go)

VertexShader (Go)

KTypeArrayList (Java)

UserRoleDAO (Java)

Exemple #1

0

Afficher le fichier

Fichier : test_nlptools.py Projet : jeroyang/txttk

def test_word_tokenzie(self): sentence = "A 2.1 cm tumor (right tongue) noted on 2013-11-11." wanted = [ "A", " ", "2.1", " ", "cm", " ", "tumor", " ", "(", "right", " ", "tongue", ")", " ", "noted", " ", "on", " ", "2013-11-11", ".", ] self.assertEqual(list(nlptools.word_tokenize(sentence)), wanted)

Exemple #2

0

Afficher le fichier

Fichier : corpus.py Projet : jeroyang/txttk

def normalize_sent(text): output = [] tokens = list(word_tokenize(text)) if is_title(text): for token in tokens: output.append(normalize(token)) else: output.append(normalize(tokens[0])) output.extend(tokens[1:]) return ''.join(output)

Exemple #3

0

Afficher le fichier

Fichier : corpus.py Projet : jeroyang/txttk

def is_title(text): tokens = word_tokenize(text) bol_list = [] for i, token in enumerate(tokens): if i==0: bol_list.append(True) elif token.lower() in stop_words: bol_list.append(True) elif token[0] not in string.ascii_lowercase: bol_list.append(True) else: bol_list.append(False) return all(bol_list)

Exemple #4

0

Afficher le fichier

Fichier : test_nlptools.py Projet : jeroyang/txttk

def test_word_tokenize_intergration(self): for sent in self.sentences: self.assertEqual(''.join(list(nlptools.word_tokenize(sent))), sent)

Exemple #5

0

Afficher le fichier

Fichier : test_nlptools.py Projet : jeroyang/txttk

def test_word_tokenzie2(self): sentence = '-999 1,234,000 3.1415' wanted = ['-999', ' ', '1,234,000', ' ', '3.1415'] self.assertEqual(list(nlptools.word_tokenize(sentence)), wanted)

Exemple #6

0

Afficher le fichier

Fichier : test_nlptools.py Projet : jeroyang/txttk

def test_word_tokenzie(self): sentence = 'A 2.1 cm tumor (right tongue) noted on 2013-11-11.' wanted = ['A', ' ', '2.1', ' ', 'cm', ' ', 'tumor', ' ', '(', 'right', ' ', 'tongue', ')', ' ', 'noted', ' ', 'on', ' ', '2013-11-11', '.'] self.assertEqual(list(nlptools.word_tokenize(sentence)), wanted)

Exemple #7

0

Afficher le fichier

Fichier : test_nlptools.py Projet : jeroyang/txttk

def test_word_tokenize_intergration(self): for sent in self.sentences: self.assertEqual("".join(list(nlptools.word_tokenize(sent))), sent)

Exemple #8

0

Afficher le fichier

Fichier : test_nlptools.py Projet : jeroyang/txttk

def test_word_tokenzie2(self): sentence = "-999 1,234,000 3.1415" wanted = ["-999", " ", "1,234,000", " ", "3.1415"] self.assertEqual(list(nlptools.word_tokenize(sentence)), wanted)

Exemple #9

0

Afficher le fichier

Fichier : test_nlptools.py Projet : Wkryst/txttk

def test_word_tokenzie(self): sentence = 'A 2.1 x 3.3 cm tumor arising from the tongue base (right side) is noted.' wanted = ['A', ' ', '2.1', ' ', 'x', ' ', '3.3', ' ', 'cm', ' ', 'tumor', ' ', 'arising', ' ', 'from', ' ', 'the', ' ', 'tongue', ' ', 'base', ' ', '(', 'right', ' ', 'side', ')', ' ', 'is', ' ', 'noted', '.'] self.assertEqual(list(nlptools.word_tokenize(sentence)), wanted)