Python Vocab.sort_by_decreasing_count示例

编程语言: Python

命名空间/包名称: stanza.text.vocab

类/类型: Vocab

方法/功能: sort_by_decreasing_count

hotexamples.com的示例: 1

Python Vocab.sort_by_decreasing_count - 已找到1个示例。这些是从开源项目中提取的最受好评的stanza.text.vocab.Vocab.sort_by_decreasing_count现实Python示例。您可以评价示例，以帮助我们提高示例质量。

常用方法

显示隐藏

Vocab(5)

update(4)

add(1)

freeze(1)

indices2words(1)

sort_by_decreasing_count(1)

words2indices(1)

示例#1

显示文件

        nlp = English()
        with open('train.json', 'wb') as f:
            json.dump(parse_file(train_file, nlp), f, indent=2)
        with open('test.json', 'wb') as f:
            json.dump(parse_file(test_file, nlp), f, indent=2)

    logging.info('starting numericalization')
    word_vocab = SennaVocab()
    rel_vocab = Vocab()

    with open('train.json') as f:
        train = json.load(f)
    with open('test.json') as f:
        test = json.load(f)

    numericalize(train, word_vocab, rel_vocab, add=True)
    word_vocab = word_vocab.prune_rares(cutoff=2)
    word_vocab = word_vocab.sort_by_decreasing_count()
    rel_vocab = rel_vocab.sort_by_decreasing_count()
    train = numericalize(train, word_vocab, rel_vocab, add=False)
    test = numericalize(test, word_vocab, rel_vocab, add=False)

    with open('vocab.pkl', 'wb') as f:
        pkl.dump({'word': word_vocab, 'rel': rel_vocab}, f)

    with open('trainXY.json', 'wb') as f:
        json.dump(train, f, indent=2)

    with open('testXY.json', 'wb') as f:
        json.dump(test, f, indent=2)