Esempi in Python per DataReader.get_words

Linguaggio di programmazione: Python

Spazio dei nomi/nome del pacchetto: data_reader

Classe/tipologia: DataReader

Metodo/funzione: get_words

Esempi su hotexamples.com: 2

DataReader.get_words in Python: 2 esempi trovati. Questi sono i migliori esempi reali in Python per data_reader.DataReader.get_words, estratti da progetti open source. Li puoi valutare, per aiutarci a migliorare la qualità dei nostri esempi.

Metodi utilizzati di frequente

Mostra Nascondi

DataReader(30)

get_all_data(13)

get_pandas_df(11)

data_generator(10)

get_vocab(8)

get_region_labels(6)

get_next_pic_id(5)

get_pic_qa(5)

createHostsFromFile(4)

generate_output_goals(4)

get_instance(4)

get_train_batch(4)

cnv_data_reader_pipeline(3)

combine_outcome_data(3)

createInstancesFromFile(3)

dequeue(3)

get_segmented_data(2)

get_gene_data(2)

get_table_int(2)

getInstance(2)

get_words(2)

getDataArray(2)

generate_score(2)

get_data(2)

__init__(2)

class_to_name(2)

generateXYPairs(1)

append_files(1)

batch_generator_train(1)

batch_generator_test(1)

get_sentence_coordinates(1)

get_sequences(1)

get_t(1)

get_table(1)

get_table_float(1)

_make_data(1)

get_testing_data(1)

get_pos(1)

get_training_data(1)

get_tsdata(1)

get_vocab_size(1)

GetNextBatch(1)

get_x_values(1)

parse(1)

parse_data(1)

get_probe_to_gene_table(1)

get_pic_data(1)

batcher(1)

get_description(1)

getAllData(1)

Esempio n. 1

Mostra file

File: evaluate_model.py Progetto: HaukurPall/NLP-1-Project-HSS

def main():
  # First load the models as it was, that is loading it with the training sizes and vocab
  training_data = DataReader(training_data_filepath)

  vocab = training_data.vocab

  # Build a list of trigrams
  words = training_data.get_words()

  # Get the pretrained word vectors
  word_to_index, embed_dict = get_pretrained_word_indexes(pretrained_filepath)

  # Update word_to_index and vocabulary
  word_to_index, vocab = update_word_indexes_vocab(word_to_index, vocab)

  # Get the numpy matrix containing the pretrained word vectors
  # with randomly initialized unknown words from the corpus
  word_embeddings = get_embeddings_matrix(word_to_index, embed_dict, WORD_EMBEDDINGS_DIMENSION)

  model = NGramLanguageModeler(len(vocab), 50, CONTEXT_SIZE, word_embeddings)
  model.load_state_dict(torch.load("AWS_model.pt"))

  test_data = DataReader(test_data_filepath, read_limit=READ_LIMIT)

  evaluate_model(model, test_data, word_to_index)

Esempio n. 2

Mostra file

        self.embeddings = nn.Embedding(vocab_size, embedding_dim)
        self.embeddings.weight.data.copy_(torch.from_numpy(word_embeddings))
        self.linear1 = nn.Linear(context_size * embedding_dim, vocab_size)
        self.embeddings.weight.requires_grad = False # Do not train the pre-calculated embeddings

    def forward(self, inputs):
        embeds = self.embeddings(inputs).view((1, -1))
        out = F.tanh(self.linear1(embeds))
        log_probs = F.log_softmax(out)
        return log_probs

# Read corpus and compile the vocabulary
training_data = DataReader(training_data_filepath, read_limit=READ_LIMIT)
vocab = training_data.vocab
# Build a list of trigrams
words = training_data.get_words()
trigrams = extract_list_of_ngrams(words, CONTEXT_SIZE + 1)
# Get the pretrained word vectors
word_to_ix, embed_dict = get_pretrained_word_indexes(pretrained_filepath)
# Update word_to_ix and vocabulary
word_to_ix, vocab = update_word_indexes_vocab(word_to_ix, vocab)
# Get the numpy matrix containing the pretrained word vectors
# with randomly initialized unknown words from the corpus
word_embeddings = get_embeddings_matrix(word_to_ix, embed_dict, EMBEDDING_DIM)

losses = []
loss_function = nn.NLLLoss()
model = NGramLanguageModeler(len(vocab), EMBEDDING_DIM, CONTEXT_SIZE, word_embeddings)
optimizer = optim.SGD(filter(lambda p: p.requires_grad, model.parameters()), lr=LEARNING_RATE)

epoch_times = []