Python VocabManager.get_word_given_id 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: dlm.io.vocabReader

클래스/타입: VocabManager

메소드/함수: get_word_given_id

hotexamples.com에서의 예제들: 4

Python VocabManager.get_word_given_id - 4개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 dlm.io.vocabReader.VocabManager.get_word_given_id에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

VocabManager(5)

get_ids_given_word_list(2)

get_word_given_id(2)

예제 #1

파일 보기

파일: lookuptable.py 프로젝트: tamhd/corelm

	def initialize(self, emb_path, vocab_path):
		L.info('Initializing lookup table')
		vm = VocabManager(vocab_path)
		w2v = W2VEmbReader(emb_path)
		U.xassert(w2v.get_emb_dim() == self.emb_matrix.shape[1], 'The embeddings dimension does not match with the given word embeddings')
		for i in range(self.emb_matrix.shape[0]):
			vec = w2v.get_emb_given_word(vm.get_word_given_id(i))
			if vec:
				self.emb_matrix[i] = vec

예제 #2

파일 보기

 def initialize(self, emb_path, vocab_path):
     L.info('Initializing lookup table')
     vm = VocabManager(vocab_path)
     w2v = W2VEmbReader(emb_path)
     U.xassert(
         w2v.get_emb_dim() == self.emb_matrix.shape[1],
         'The embeddings dimension does not match with the given word embeddings'
     )
     for i in range(self.emb_matrix.shape[0]):
         vec = w2v.get_emb_given_word(vm.get_word_given_id(i))
         if vec:
             self.emb_matrix[i] = vec

예제 #3

파일 보기

#

from dlm.io.ngramsReader import NgramsReader
from dlm.io.vocabReader import VocabManager

testset = NgramsReader(dataset_path=args.input_path, ngram_size=classifier.ngram_size, vocab_path=args.vocab_path)
vocab = VocabManager(args.vocab_path)

## Loading restricted vocab
restricted_ids = []
restricted_vocab = []
if args.restricted_vocab_path:
    with open(args.restricted_vocab_path) as restricted_vocab_file:
        for line in restricted_vocab_file:
            restricted_vocab.append(line.strip())
    restricted_ids = vocab.get_ids_given_word_list(restricted_vocab)


#########################
## Compiling theano function
#

evaluator = eval.Evaluator(testset, classifier)


if args.output_path:
    with open(args.output_path, "w") as output:
        for i in xrange(testset._get_num_samples()):
            out = evaluator.get_class(i, restricted_ids)
            output.write(vocab.get_word_given_id(out) + "\n")

예제 #4

파일 보기

#

from dlm.io.ngramsReader import NgramsReader
from dlm.io.vocabReader import VocabManager

testset = NgramsReader(dataset_path=args.input_path,
                       ngram_size=classifier.ngram_size,
                       vocab_path=args.vocab_path)
vocab = VocabManager(args.vocab_path)

## Loading restricted vocab
restricted_ids = []
restricted_vocab = []
if args.restricted_vocab_path:
    with open(args.restricted_vocab_path) as restricted_vocab_file:
        for line in restricted_vocab_file:
            restricted_vocab.append(line.strip())
    restricted_ids = vocab.get_ids_given_word_list(restricted_vocab)

#########################
## Compiling theano function
#

evaluator = eval.Evaluator(testset, classifier)

if args.output_path:
    with open(args.output_path, 'w') as output:
        for i in xrange(testset._get_num_samples()):
            out = evaluator.get_class(i, restricted_ids)
            output.write(vocab.get_word_given_id(out) + '\n')