Python Vocabulary.add_special_symbol示例

编程语言: Python

命名空间/包名称: mltoolkit.mldp.utils.tools

类/类型: Vocabulary

方法/功能: add_special_symbol

hotexamples.com的示例: 2

Python Vocabulary.add_special_symbol - 已找到2个示例。这些是从开源项目中提取的最受好评的mltoolkit.mldp.utils.tools.Vocabulary.add_special_symbol现实Python示例。您可以评价示例，以帮助我们提高示例质量。

常用方法

显示隐藏

Vocabulary(13)

add_special_symbol(2)

示例#1

显示文件

文件： create_vocabulary.py 项目： yugaljain1999/Copycat-abstractive-opinion-summarizer

def create_vocabulary(vocab_fp, data_path, sep='\t'):
    """Creates a word vocabulary using a simple pipeline."""
    vocab_pipeline = assemble_vocab_pipeline(text_fname=InpDataF.REV_TEXT,
                                             sep=sep)
    words_vocab = Vocabulary(vocab_pipeline, name_prefix="words")
    # adding special symbols before creating vocab, so they would appear on top
    for st in VOCAB_DEFAULT_SYMBOLS:
        if st not in words_vocab:
            words_vocab.add_special_symbol(st)

    words_vocab.create(data_source={'data_path': data_path},
                       data_fnames=InpDataF.REV_TEXT)
    words_vocab.write(vocab_fp, sep=' ')

示例#2

显示文件

文件： run_workflow.py 项目： abrazinskas/Copycat-abstractive-opinion-summarizer

                          r=run_hp.c_r,
                          max_val=run_hp.c_kl_ann_max_val)
z_kl_ann = KlCycAnnealing(t=run_hp.z_kl_ann_batches,
                          m=run_hp.z_m,
                          r=run_hp.c_r,
                          max_val=run_hp.z_kl_ann_max_val)

#   PIPELINES AND VOCAB   #

vocab_pipeline = assemble_vocab_pipeline(text_fname=InpDataF.REV_TEXT)
word_vocab = Vocabulary(vocab_pipeline, name_prefix="word")

# adding special symbols before creating vocab, so they would appear on top
for st in VOCAB_DEFAULT_SYMBOLS:
    if st not in word_vocab:
        word_vocab.add_special_symbol(st)

word_vocab.load_or_create(run_hp.words_vocab_fp,
                          data_source=vocab_data_source,
                          max_size=model_hp.ext_vocab_size,
                          sep=' ',
                          data_fnames=InpDataF.REV_TEXT)

word_vocab.write(comb_paths(run_hp.output_path, "word_vocab.txt"), sep=' ')

train_pipeline = assemble_train_pipeline(
    word_vocab,
    max_groups_per_batch=run_hp.train_max_groups_per_batch,
    min_revs_per_group=run_hp.max_rev_per_group,
    max_revs_per_group=run_hp.max_rev_per_group,
    seed=None,