Exemplos de Dictionary.pad em Python

Linguagem de programação: Python

Espaço para nome / nome do pacote: fairseq_stchde.data

Classe / Tipo: Dictionary

Método / Função: pad

Exemplos em hotexamples.com: 2

Dictionary.pad em Python - 2 exemplos encontrados. Esses são os exemplos do mundo real mais bem avaliados de fairseq_stchde.data.Dictionary.pad em Python extraídos de projetos de código aberto. Você pode avaliar os exemplos para nos ajudar a melhorar a qualidade deles.

Métodos Frequentes

Exibir Ocultar

load(7)

Dictionary(6)

add_symbol(5)

add_file_to_dictionary(2)

index(2)

pad(2)

finalize(1)

pad_to_multiple_(1)

Métodos Frequentes

load (7)

Dictionary (6)

add_symbol (5)

add_file_to_dictionary (2)

index (2)

pad (2)

finalize (1)

pad_to_multiple_ (1)

Exemplo n.º 1

0

Exibir arquivo

Arquivo: dummy_mt.py Projeto: dubelbog/st_ch_de

def setup_task(cls, args, **kwargs): """Setup the task. """ dictionary = Dictionary() for i in range(args.dict_size): dictionary.add_symbol("word{}".format(i)) logger.info("dictionary: {} types".format(len(dictionary))) args.max_source_positions = args.src_len + dictionary.pad() + 2 args.max_target_positions = args.tgt_len + dictionary.pad() + 2 return cls(args, dictionary)

Exemplo n.º 2

0

Exibir arquivo

class DummyLMTask(FairseqTask): def __init__(self, cfg: DummyLMConfig): super().__init__(cfg) # load dictionary self.dictionary = Dictionary() for i in range(cfg.dict_size): self.dictionary.add_symbol("word{}".format(i)) self.dictionary.pad_to_multiple_(8) # often faster if divisible by 8 logger.info("dictionary: {} types".format(len(self.dictionary))) seq = torch.arange(cfg.tokens_per_sample + 1) + self.dictionary.pad() + 1 self.dummy_src = seq[:-1] self.dummy_tgt = seq[1:] def load_dataset(self, split, epoch=1, combine=False, **kwargs): """Load a given dataset split. Args: split (str): name of the split (e.g., train, valid, test) """ if self.cfg.batch_size is not None: bsz = self.cfg.batch_size else: bsz = max(1, self.cfg.max_tokens // self.cfg.tokens_per_sample) self.datasets[split] = DummyDataset( { "id": 1, "net_input": { "src_tokens": torch.stack([self.dummy_src for _ in range(bsz)]), "src_lengths": torch.full( (bsz, ), self.cfg.tokens_per_sample, dtype=torch.long), }, "target": torch.stack([self.dummy_tgt for _ in range(bsz)]), "nsentences": bsz, "ntokens": bsz * self.cfg.tokens_per_sample, }, num_items=self.cfg.dataset_size, item_size=self.cfg.tokens_per_sample, ) @property def source_dictionary(self): return self.dictionary @property def target_dictionary(self): return self.dictionary