Python wordSentenceData 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: lm_data

메소드/함수: wordSentenceData

hotexamples.com에서의 예제들: 2

Python wordSentenceData - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 lm_data.wordSentenceData에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: main.py 프로젝트: wuxiangli91/tf-lm

            print('Sentence-level data')

            if 'rescore' in config and 'bidirectional' in config:
                raise NotImplementedError(
                    "Rescoring with a bidirectional model is not (yet) implemented."
                )
            elif 'predict_next' in config and 'bidirectional' in config:
                raise NotImplementedError(
                    "Predicting the next word with a bidirectional model is not (yet) implemented."
                )
            elif 'debug2' in config and 'bidirectional' in config:
                raise NotImplementedError(
                    "Generating a debug2 file with a bidirectional model is not (yet) implemented."
                )

            data = lm_data.wordSentenceData(config, eval_config, TRAIN, VALID,
                                            TEST)
            all_data, vocab_size, total_length, seq_lengths = data.get_data()

            # set num_steps = total length of each (padded) sentence
            config['num_steps'] = total_length

            print('Write max length of sentence to {0}max_length'.format(
                config['save_path']))

            # write maximum sentence length to file
            max_length_f = io.open('{0}max_length'.format(config['save_path']),
                                   'w')
            max_length_f.write(u'{0}\n'.format(total_length))
            max_length_f.close()

    # rescoring with non-sentence-level LMs: prepare data sentence-level

예제 #2

파일 보기

파일: main.py 프로젝트: flovera1/tf-languagemodel

    # word-level training, on sentence level (sentences are padded until maximum sentence length)
    elif 'per_sentence' in config:

        if 'rescore' in config:
            max_length = int(
                open('{0}max_length'.format(
                    config['trained_model'])).readlines()[0].strip())
            # set num_steps = total length of each (padded) sentence
            config['num_steps'] = max_length

            data = lm_data.wordSentenceDataRescore(config, eval_config)
            all_data, vocab_size, _ = data.get_data()

        else:
            data = lm_data.wordSentenceData(config, eval_config)
            all_data, vocab_size, total_length, seq_lengths = data.get_data()

            # set num_steps = total length of each (padded) sentence
            config['num_steps'] = total_length

            print('Write max length of sentence to {0}max_length'.format(
                config['save_path']))

            # write maximum sentence length to file
            max_length_f = open('{0}max_length'.format(config['save_path']),
                                'w')
            max_length_f.write('{0}\n'.format(total_length))
            max_length_f.close()

    # rescoring with non-sentence-level LMs