Python tokenize_files Exemples

Langage de programmation: Python

Espace de nommage/Pack: utils

Méthode/Fonction: tokenize_files

Exemples au hotexamples.com: 2

Python tokenize_files - 2 exemples trouvés. Ce sont les exemples réels les mieux notés de utils.tokenize_files extraits de projets open source. Vous pouvez noter les exemples pour nous aider à en améliorer la qualité.

Associées

get_crawlers

Stroke

write_bem_solution

QParallelAnimationGroup

format_code_dump_with_labels

subnet_find

verify

WelcomeData

SortingDirectoryHarvester

XmlPrinter

Related in langs

test_remote_download (PHP)

getXajaxUri (PHP)

ModifiedBefore (C#)

RemotePermissionScheme (C#)

pose_in (C++)

PORT_HAL_SetDigitalFilterCmd (C++)

GetSession (Go)

NewCompositeReporter (Go)

PatActionStatusPointVo (Java)

FailureHandler (Java)

Exemple #1

0

Afficher le fichier

Fichier : rnn_extended_test.py Projet : pombredanne/tangerine

def testRNN(vocabulary_file, training_dir): print("Reading vocabulary " + vocabulary_file + "...") words, dictionary = read_vocabulary(vocabulary_file, MAX_VOCAB_SIZE) print("Reading sentences and training RNN...") start = timer() rnn = RNNExtended(len(words), HIDDEN_LAYER_SIZE) num_words = 0 for i in range(NUM_ITER): sentences = tokenize_files(dictionary, training_dir) for sentence in itertools.islice(sentences, MAX_SENTENCES): # Todo, create context window for each sentence? rnn.train(sentence) num_words += len(sentence) print("Iteration " + str(i + 1) + "/" + str(NUM_ITER) + " finished (" + str(num_words) + " words)") num_words = 0 print("- Took %.2f sec" % (timer() - start))

Exemple #2

0

Afficher le fichier

Fichier : skipgram_test.py Projet : pombredanne/tangerine

def testSkipGram(vocabulary_file, training_dir): last_sentence = None print("Reading vocabulary " + vocabulary_file + "...") words, dictionary = read_vocabulary(vocabulary_file, MAX_VOCAB_SIZE) print("Reading sentences and training SkipGram...") start = timer() skip_gram = SkipGram(len(words), WINDOW_SIZE, HIDDEN_LAYER_SIZE) num_words = 0 for i in range(NUM_ITER): sentences = tokenize_files(dictionary, training_dir) for sentence in itertools.islice(sentences, MAX_SENTENCES): last_sentence = sentence skip_gram.train(sentence) num_words += len(sentence) ll = skip_gram.train(last_sentence, compute_ll=True) print("Iteration " + str(i + 1) + "/" + str(NUM_ITER) + " finished (" + str(num_words) + " words)") print("Log-likelihood: " + str(ll)) num_words = 0 print("- Took %.2f sec" % (timer() - start))