Python read_vocabularyの例

プログラミング言語: Python

名前空間/パッケージ名: subwordnmt.apply_bpe

メソッド/関数: read_vocabulary

hotexamples.comのコード掲載数: 2

Python read_vocabulary - 2件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのsubwordnmt.apply_bpe.read_vocabularyの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

コード例 #1

ファイルを表示

ファイル: generate_paraphrases.py プロジェクト: adymaharana/BayesAugment

    net.eval()

    # load parse generator network
    parse_args = parse_model['config_args']
    parse_net = ParseNet(parse_args.d_nt, parse_args.d_hid, len(parse_gen_voc))
    if args.gpu >= 0:
        parse_net.cuda()
    parse_net.load_state_dict(parse_model['state_dict'])
    parse_net.eval()

    # encode templates
    template_lens = [len(x.split()) for x in templates]
    np_templates = np.zeros((len(templates), max(template_lens)), dtype='int32')
    for z, template in enumerate(templates):
        np_templates[z, :template_lens[z]] = [parse_gen_voc[w] for w in templates[z].split()]
    if args.gpu >= 0:
        tp_templates = Variable(torch.from_numpy(np_templates).long().cuda())
        tp_template_lens = torch.from_numpy(np.array(template_lens, dtype='int32')).long().cuda()
    else:
        tp_templates = Variable(torch.from_numpy(np_templates).long())
        tp_template_lens = torch.from_numpy(np.array(template_lens, dtype='int32')).long()

    # instantiate BPE segmenter
    bpe_codes = codecs.open(args.bpe_codes, encoding='utf-8')
    bpe_vocab = codecs.open(args.bpe_vocab, encoding='utf-8')
    bpe_vocab = read_vocabulary(bpe_vocab, args.bpe_vocab_thresh)
    bpe = BPE(bpe_codes, '@@', bpe_vocab, None)

    # paraphrase the sst!
    encode_data(out_file=args.out_file)

コード例 #2

ファイルを表示

ファイル: generate_paraphrases.py プロジェクト: karansingla06/ICE-NER-NLP

parse_net.eval()
# encode templates
template_lens = [len(x.split()) for x in templates]
np_templates = np.zeros((len(templates), max(template_lens)), dtype='int32')
for z, template in enumerate(templates):
    np_templates[z, :template_lens[z]] = [
        parse_gen_voc[w] for w in templates[z].split()
    ]
tp_templates = Variable(torch.from_numpy(np_templates).long().cuda())
tp_template_lens = torch.from_numpy(np.array(template_lens,
                                             dtype='int32')).long().cuda()

# instantiate BPE segmenter
bpe_codes = codecs.open(bpe_codes, encoding='utf-8')
bpe_vocab = codecs.open(bpe_vocab, encoding='utf-8')
bpe_vocab = read_vocabulary(bpe_vocab, bpe_vocab_thresh)
bpe = BPE(bpe_codes, '@@', bpe_vocab, None)


def reverse_bpe(sent):
    x = []
    cache = ''

    for w in sent:
        if w.endswith('@@'):
            cache += w.replace('@@', '')
        elif cache != '':
            x.append(cache + w)
            cache = ''
        else:
            x.append(w)