Python DataReader.get_wordsの例

プログラミング言語: Python

名前空間/パッケージ名: data_reader

クラス/型: DataReader

メソッド/関数: get_words

hotexamples.comのコード掲載数: 2

Python DataReader.get_words - 2件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのdata_reader.DataReader.get_wordsの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

DataReader(30)

get_all_data(13)

get_pandas_df(11)

data_generator(10)

get_vocab(8)

get_region_labels(6)

get_next_pic_id(5)

get_pic_qa(5)

createHostsFromFile(4)

generate_output_goals(4)

get_instance(4)

get_train_batch(4)

cnv_data_reader_pipeline(3)

combine_outcome_data(3)

createInstancesFromFile(3)

dequeue(3)

get_segmented_data(2)

get_gene_data(2)

get_table_int(2)

getInstance(2)

get_words(2)

getDataArray(2)

generate_score(2)

get_data(2)

__init__(2)

class_to_name(2)

generateXYPairs(1)

append_files(1)

batch_generator_train(1)

batch_generator_test(1)

get_sentence_coordinates(1)

get_sequences(1)

get_t(1)

get_table(1)

get_table_float(1)

_make_data(1)

get_testing_data(1)

get_pos(1)

get_training_data(1)

get_tsdata(1)

get_vocab_size(1)

GetNextBatch(1)

get_x_values(1)

parse(1)

parse_data(1)

get_probe_to_gene_table(1)

get_pic_data(1)

batcher(1)

get_description(1)

getAllData(1)

コード例 #1

ファイルを表示

ファイル: evaluate_model.py プロジェクト: HaukurPall/NLP-1-Project-HSS

def main():
  # First load the models as it was, that is loading it with the training sizes and vocab
  training_data = DataReader(training_data_filepath)

  vocab = training_data.vocab

  # Build a list of trigrams
  words = training_data.get_words()

  # Get the pretrained word vectors
  word_to_index, embed_dict = get_pretrained_word_indexes(pretrained_filepath)

  # Update word_to_index and vocabulary
  word_to_index, vocab = update_word_indexes_vocab(word_to_index, vocab)

  # Get the numpy matrix containing the pretrained word vectors
  # with randomly initialized unknown words from the corpus
  word_embeddings = get_embeddings_matrix(word_to_index, embed_dict, WORD_EMBEDDINGS_DIMENSION)

  model = NGramLanguageModeler(len(vocab), 50, CONTEXT_SIZE, word_embeddings)
  model.load_state_dict(torch.load("AWS_model.pt"))

  test_data = DataReader(test_data_filepath, read_limit=READ_LIMIT)

  evaluate_model(model, test_data, word_to_index)

コード例 #2

ファイルを表示

        self.embeddings = nn.Embedding(vocab_size, embedding_dim)
        self.embeddings.weight.data.copy_(torch.from_numpy(word_embeddings))
        self.linear1 = nn.Linear(context_size * embedding_dim, vocab_size)
        self.embeddings.weight.requires_grad = False # Do not train the pre-calculated embeddings

    def forward(self, inputs):
        embeds = self.embeddings(inputs).view((1, -1))
        out = F.tanh(self.linear1(embeds))
        log_probs = F.log_softmax(out)
        return log_probs

# Read corpus and compile the vocabulary
training_data = DataReader(training_data_filepath, read_limit=READ_LIMIT)
vocab = training_data.vocab
# Build a list of trigrams
words = training_data.get_words()
trigrams = extract_list_of_ngrams(words, CONTEXT_SIZE + 1)
# Get the pretrained word vectors
word_to_ix, embed_dict = get_pretrained_word_indexes(pretrained_filepath)
# Update word_to_ix and vocabulary
word_to_ix, vocab = update_word_indexes_vocab(word_to_ix, vocab)
# Get the numpy matrix containing the pretrained word vectors
# with randomly initialized unknown words from the corpus
word_embeddings = get_embeddings_matrix(word_to_ix, embed_dict, EMBEDDING_DIM)

losses = []
loss_function = nn.NLLLoss()
model = NGramLanguageModeler(len(vocab), EMBEDDING_DIM, CONTEXT_SIZE, word_embeddings)
optimizer = optim.SGD(filter(lambda p: p.requires_grad, model.parameters()), lr=LEARNING_RATE)

epoch_times = []