Exemplos de tokenize em Python

Linguagem de programação: Python

Espaço para nome / nome do pacote: scripts.twokenize

Método / Função: tokenize

Exemplos em hotexamples.com: 4

tokenize em Python - 4 exemplos encontrados. Esses são os exemplos do mundo real mais bem avaliados de scripts.twokenize.tokenize em Python extraídos de projetos de código aberto. Você pode avaliar os exemplos para nos ajudar a melhorar a qualidade deles.

Relacionados

xml2ref

World

normalize

emul32prefixDIR

filter_dictionary

run

createCompleteHostOSHByInterfaceList

ln

create_figure

XSDataXdsGenerateInput

Related in langs

print_nice (PHP)

AbstractRouter (PHP)

Delegation (C#)

CreateVenueType (C#)

LOG_V (C++)

LOG_W (C++)

Step (Go)

StringFunc (Go)

List (Java)

SmartToast (Java)

Exemplo n.º 1

0

Exibir arquivo

Arquivo: extract.py Projeto: AvenTu/emote-cat

def train(file): """ Dictionary of word counts for each label """ features_list = [] f = open(romney_file) for line in f.readlines(): line = line.rstrip().lower() [HITID, tweet, W1, A1, W2, A2, Agmt, label, date] = line.split(',') tokens = twokenize.tokenize(tweet) if "no_agreement" in label: continue print label for token in tokens: if not ok_word(token): continue features_list.append(({'token': token}, label)) """ Feature Extraction """ random.shuffle(features_list) classifier = nltk.NaiveBayesClassifier.train(features_list) classifier.show_most_informative_features(100) print nltk.classify.accuracy(classifier, features_list)

Exemplo n.º 2

0

Exibir arquivo

Arquivo: extract.py Projeto: AvenTu/emote-cat

def train(file): """ Dictionary of word counts for each label """ features_list = [] f = open(romney_file) for line in f.readlines(): line = line.rstrip().lower() [HITID, tweet, W1, A1, W2, A2, Agmt, label, date] = line.split(',') tokens = twokenize.tokenize(tweet) if "no_agreement" in label: continue print label for token in tokens: if not ok_word(token): continue features_list.append(({'token': token}, label)) """ Feature Extraction """ random.shuffle(features_list) classifier = nltk.NaiveBayesClassifier.train(features_list) classifier.show_most_informative_features(100) print nltk.classify.accuracy(classifier, features_list)

Exemplo n.º 3

0

Exibir arquivo

Arquivo: extract.py Projeto: AvenTu/emote-cat

def test(file): f = open(romney_file) total_guesses = 0.0 total_correct = 0.0 for line in f.readlines(): line = line.rstrip() [HITID, tweet, W1, A1, W2, A2, Agmt, label, date] = line.split(',') tokens = twokenize.tokenize(tweet) if "no_agreement" in label: continue votes = Counter() for token in tokens: if not ok_word(token): continue feature = {'token': token} guess = classifier.classify(feature) votes[guess] += 1 [(pred_label, count)] = votes.most_common(1) if pred_label == label: total_correct += 1.0 total_guesses += 1.0 print total_guesses, total_correct print total_correct / total_guesses

Exemplo n.º 4

0

Exibir arquivo

Arquivo: extract.py Projeto: AvenTu/emote-cat

def test(file): f = open(romney_file) total_guesses = 0.0 total_correct = 0.0 for line in f.readlines(): line = line.rstrip() [HITID, tweet, W1, A1, W2, A2, Agmt, label, date] = line.split(',') tokens = twokenize.tokenize(tweet) if "no_agreement" in label: continue votes = Counter() for token in tokens: if not ok_word(token): continue feature = {'token': token} guess = classifier.classify(feature) votes[guess] += 1 [(pred_label, count)] = votes.most_common(1) if pred_label == label: total_correct += 1.0 total_guesses += 1.0 print total_guesses, total_correct print total_correct / total_guesses