Python Vocabulary.size 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: utils.vocabulary

클래스/타입: Vocabulary

메소드/함수: size

hotexamples.com에서의 예제들: 2

Python Vocabulary.size - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 utils.vocabulary.Vocabulary.size에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

Vocabulary(30)

load(18)

save(14)

build(10)

process_sentence(7)

load_vocabulary(3)

new(3)

size(2)

add_word(2)

add_words(2)

build_vocabulary_from_tokens(2)

compute_frequency(2)

fromlist(2)

load_glove_vocabulary(1)

merge_vocabularies(1)

save_counts(1)

observe_word(1)

setup_corpus_vocabulary(1)

ix2sent_drop_pad(1)

sent2ix(1)

sent2ix_andpad(1)

save_vocab(1)

get_word(1)

index(1)

get_char_vocab(1)

add(1)

add_token(1)

build_from_scratch(1)

construct_embedding_matrix(1)

freeze(1)

from_serializable(1)

get_index(1)

has_word(1)

get_language(1)

get_pad(1)

get_sentence(1)

get_unk(1)

abstract2sents(1)

get_word_vocab(1)

type_to_id(1)

예제 #1

파일 보기

    def load_embeddings(self, src_embeddings, tgt_embeddings,
                        vocabulary: Vocabulary):
        aligned_embeddings = torch.div(torch.randn(vocabulary.size(), 300), 10)
        found_count = 0
        for i in range(len(vocabulary.index2word)):
            word = vocabulary.get_word(i)
            language = vocabulary.get_language(i)
            if language == "src" and word in src_embeddings.wv:
                aligned_embeddings[i] = torch.FloatTensor(
                    src_embeddings.wv[word])
                found_count += 1
            elif language == "src" and word.lower() in src_embeddings.wv:
                aligned_embeddings[i] = torch.FloatTensor(
                    src_embeddings.wv[word.lower()])
                found_count += 1

            if language == "tgt" and word in tgt_embeddings.wv:
                aligned_embeddings[i] = torch.FloatTensor(
                    tgt_embeddings.wv[word])
                found_count += 1
            elif language == "tgt" and word.lower() in tgt_embeddings.wv:
                aligned_embeddings[i] = torch.FloatTensor(
                    tgt_embeddings.wv[word.lower()])
                found_count += 1
        logger.info("Embeddings filled: " + str(found_count) + " of " +
                    str(vocabulary.size()))

        enable_training = self.encoder.embedding.weight.requires_grad
        self.encoder.embedding.weight = nn.Parameter(
            aligned_embeddings, requires_grad=enable_training)
        self.decoder.embedding.weight = nn.Parameter(
            aligned_embeddings, requires_grad=enable_training)

예제 #2

파일 보기

파일: loss.py 프로젝트: sungjinlees/UNMT

 def __init__(self, vocabulary: Vocabulary, use_cuda):
     weight = torch.ones(vocabulary.size())
     weight[vocabulary.get_pad("src")] = 0
     weight[vocabulary.get_pad("tgt")] = 0
     weight = weight.cuda() if use_cuda else weight
     self.criterion = nn.NLLLoss(weight, size_average=False)