Python Vocabulary._extend 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: allennlp.data

클래스/타입: Vocabulary

메소드/함수: _extend

hotexamples.com에서의 예제들: 4

Python Vocabulary._extend - 4개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 allennlp.data.Vocabulary._extend에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

Vocabulary(30)

from_files(30)

get_vocab_size(30)

get_token_index(30)

from_params(30)

from_instances(30)

add_token_to_namespace(30)

get_token_from_index(29)

get_token_to_index_vocabulary(26)

get_index_to_token_vocabulary(25)

add_tokens_to_namespace(18)

from_dataset(11)

get_namespaces(10)

save_to_files(9)

set_from_file(8)

list_available(5)

_extend(4)

by_name(3)

resolve_class_name(3)

add_transformer_vocab(3)

from_pretrained_transformer(2)

empty(2)

print_statistics(1)

extend_from_instances(1)

slices(1)

예제 #1

파일 보기

파일: embedding_test.py 프로젝트: vin-ivar/allennlp

    def test_embedding_vocab_extension_without_stored_namespace(self):
        vocab = Vocabulary()
        vocab.add_token_to_namespace("word1", "tokens_a")
        vocab.add_token_to_namespace("word2", "tokens_a")
        embedding_params = Params({
            "vocab_namespace": "tokens_a",
            "embedding_dim": 10
        })
        embedder = Embedding.from_vocab_or_file(
            vocab, **embedding_params.as_dict(quiet=True))

        # Previous models won't have _vocab_namespace attribute. Force it to be None
        embedder._vocab_namespace = None
        original_weight = embedder.weight

        assert original_weight.shape[0] == 4

        extension_counter = {"tokens_a": {"word3": 1}}
        vocab._extend(extension_counter)

        embedder.extend_vocab(vocab, "tokens_a")  # specified namespace

        extended_weight = embedder.weight
        assert extended_weight.shape[0] == 5
        assert torch.all(extended_weight[:4, :] == original_weight[:4, :])

예제 #2

파일 보기

파일: embedding_test.py 프로젝트: wgc20/GrailQA

    def test_embedding_vocab_extension_with_default_namespace(self):
        vocab = Vocabulary()
        vocab.add_token_to_namespace('word1')
        vocab.add_token_to_namespace('word2')
        embedding_params = Params({"vocab_namespace": "tokens",
                                   "embedding_dim": 10})
        embedder = Embedding.from_params(vocab, embedding_params)
        original_weight = embedder.weight

        assert original_weight.shape[0] == 4

        extension_counter = {"tokens": {"word3": 1}}
        vocab._extend(extension_counter)

        embedder.extend_vocab(vocab) # default namespace

        extended_weight = embedder.weight
        assert extended_weight.shape[0] == 5
        assert torch.all(extended_weight[:4, :] == original_weight[:4, :])

예제 #3

파일 보기

    def test_embedding_vocab_extension_with_specified_namespace(self):
        vocab = Vocabulary()
        vocab.add_token_to_namespace("word1", "tokens_a")
        vocab.add_token_to_namespace("word2", "tokens_a")
        embedding_params = Params({
            "vocab_namespace": "tokens_a",
            "embedding_dim": 10
        })
        embedder = Embedding.from_params(embedding_params, vocab=vocab)
        original_weight = embedder.weight

        assert original_weight.shape[0] == 4

        extension_counter = {"tokens_a": {"word3": 1}}
        vocab._extend(extension_counter)

        embedder.extend_vocab(vocab, "tokens_a")  # specified namespace

        extended_weight = embedder.weight
        assert extended_weight.shape[0] == 5
        assert torch.all(extended_weight[:4, :] == original_weight[:4, :])

예제 #4

파일 보기

파일: nlp_utils.py 프로젝트: polixir/abl-sym

def add_env_tokens_to_vocab(vocab: Vocabulary, actions: List[Union[str, int]] = None, stack_states: Enum = None,
                            exec_states: Enum = None):
    if actions:
        actions = map(lambda x: str(x), actions)
    else:
        actions = []
    if stack_states:
        stack_states = map(lambda x: x.name, stack_states)
    else:
        stack_states = []
    if exec_states:
        exec_states = map(lambda x: x.name, exec_states)
    else:
        exec_states = []
    extra_vob_counter = {
        'stack': OrderedDict({state: 1 for state in stack_states}),
        'exec': OrderedDict({state: 1 for state in exec_states}),
        'action': OrderedDict({action: 1 for action in actions})
    }
    vocab._extend(extra_vob_counter, non_padded_namespaces=['stack', 'exec', 'action'])
    return vocab