Python Binarizer.binarize_graph 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: fairseq.binarizer

클래스/타입: Binarizer

메소드/함수: binarize_graph

hotexamples.com에서의 예제들: 2

Python Binarizer.binarize_graph - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 fairseq.binarizer.Binarizer.binarize_graph에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

binarize(12)

find_offsets(11)

binarize_alignments(3)

binarize_sent_doc(3)

binarize_da(2)

binarize_graph(2)

binarize_hierarchical(2)

binarize_tag(1)

tokenize(1)

예제 #1

파일 보기

파일: process_graph_copy.py 프로젝트: zhongxia96/SemSUM

    def _make_edge_dataset(vocab, input_prefix, output_prefix, lang,
                           num_workers, output_text_file):
        print("| [{}] Dictionary: {} types".format(lang, len(vocab) - 1))
        n_seq_tok = [0, 0]
        replaced = Counter()

        def merge_result(worker_result):
            replaced.update(worker_result["replaced"])
            n_seq_tok[0] += worker_result["nseq"]
            n_seq_tok[1] += worker_result["ntok"]

        input_file = "{}{}".format(input_prefix,
                                   ("." + lang) if lang is not None else "")

        ds = []
        merge_result(
            Binarizer.binarize_graph(input_file, vocab,
                                     lambda t: ds.append(t)))
        import json
        with open(output_text_file, 'w') as f:
            for line in ds:
                f.write(json.dumps(line.numpy().tolist()) + '\n')

        print("| [{}] {}: {} sents, {} tokens, {:.3}% replaced by {}".format(
            lang,
            input_file,
            n_seq_tok[0],
            n_seq_tok[1],
            100 * sum(replaced.values()) / n_seq_tok[1],
            vocab.unk_word,
        ))

예제 #2

파일 보기

파일: process_graph_copy.py 프로젝트: zhongxia96/SemSUM

def binarize_graph(args,
                   filename,
                   vocab,
                   output_prefix,
                   lang,
                   offset,
                   end,
                   append_eos=True):
    ds = indexed_dataset.make_builder(dataset_dest_file(
        args, output_prefix, lang, "bin"),
                                      impl=args.dataset_impl)

    def consumer(tensor):
        ds.add_item(tensor)

    res = Binarizer.binarize_graph(filename,
                                   vocab,
                                   consumer,
                                   append_eos=append_eos,
                                   offset=offset,
                                   end=end)
    ds.finalize(dataset_dest_file(args, output_prefix, lang, "idx"))
    return res