Python generate_tf_record_from_json_file示例

编程语言: Python

命名空间/包名称: official.nlp.data.squad_lib_sp

方法/功能: generate_tf_record_from_json_file

hotexamples.com的示例: 2

Python generate_tf_record_from_json_file - 已找到2个示例。这些是从开源项目中提取的最受好评的official.nlp.data.squad_lib_sp.generate_tf_record_from_json_file现实Python示例。您可以评价示例，以帮助我们提高示例质量。

示例#1

显示文件

文件： create_finetuning_data.py 项目： ykate1998/models

def generate_squad_dataset():
    """Generates squad training dataset and returns input meta data."""
    assert FLAGS.squad_data_file
    if FLAGS.tokenization == "WordPiece":
        return squad_lib_wp.generate_tf_record_from_json_file(
            input_file_path=FLAGS.squad_data_file,
            vocab_file_path=FLAGS.vocab_file,
            output_path=FLAGS.train_data_output_path,
            translated_input_folder=FLAGS.translated_squad_data_folder,
            max_seq_length=FLAGS.max_seq_length,
            do_lower_case=FLAGS.do_lower_case,
            max_query_length=FLAGS.max_query_length,
            doc_stride=FLAGS.doc_stride,
            version_2_with_negative=FLAGS.version_2_with_negative,
            xlnet_format=FLAGS.xlnet_format)
    else:
        assert FLAGS.tokenization == "SentencePiece"
        return squad_lib_sp.generate_tf_record_from_json_file(
            input_file_path=FLAGS.squad_data_file,
            sp_model_file=FLAGS.sp_model_file,
            output_path=FLAGS.train_data_output_path,
            translated_input_folder=FLAGS.translated_squad_data_folder,
            max_seq_length=FLAGS.max_seq_length,
            do_lower_case=FLAGS.do_lower_case,
            max_query_length=FLAGS.max_query_length,
            doc_stride=FLAGS.doc_stride,
            xlnet_format=FLAGS.xlnet_format,
            version_2_with_negative=FLAGS.version_2_with_negative)

示例#2

显示文件

def generate_squad_dataset():
  """Generates squad training dataset and returns input meta data."""
  assert FLAGS.squad_data_file
  if FLAGS.tokenizer_impl == "word_piece":
    return squad_lib_wp.generate_tf_record_from_json_file(
        FLAGS.squad_data_file, FLAGS.vocab_file, FLAGS.train_data_output_path,
        FLAGS.max_seq_length, FLAGS.do_lower_case, FLAGS.max_query_length,
        FLAGS.doc_stride, FLAGS.version_2_with_negative)
  else:
    assert FLAGS.tokenizer_impl == "sentence_piece"
    return squad_lib_sp.generate_tf_record_from_json_file(
        FLAGS.squad_data_file, FLAGS.sp_model_file,
        FLAGS.train_data_output_path, FLAGS.max_seq_length, FLAGS.do_lower_case,
        FLAGS.max_query_length, FLAGS.doc_stride, FLAGS.version_2_with_negative)