Python native_to_unicode示例

编程语言: Python

命名空间/包名称: official.nlp.transformer.utils.tokenizer

方法/功能: native_to_unicode

hotexamples.com的示例: 2

Python native_to_unicode - 已找到2个示例。这些是从开源项目中提取的最受好评的official.nlp.transformer.utils.tokenizer.native_to_unicode现实Python示例。您可以评价示例，以帮助我们提高示例质量。

示例#1

显示文件

def bleu_wrapper(ref_filename, hyp_filename, case_sensitive=False):
  """Compute BLEU for two files (reference and hypothesis translation)."""
  ref_lines = tokenizer.native_to_unicode(
      tf.io.gfile.GFile(ref_filename).read()).strip().splitlines()
  hyp_lines = tokenizer.native_to_unicode(
      tf.io.gfile.GFile(hyp_filename).read()).strip().splitlines()
  return bleu_on_list(ref_lines, hyp_lines, case_sensitive)

示例#2

显示文件

文件： compute_bleu.py 项目： coding-geek1711/Object_Detection_Microcontrollers

def bleu_wrapper(ref_filename, hyp_filename, case_sensitive=False):
    """Compute BLEU for two files (reference and hypothesis translation)."""
    ref_lines = tokenizer.native_to_unicode(
        tf.io.gfile.GFile(ref_filename).read()).strip().splitlines()
    hyp_lines = tokenizer.native_to_unicode(
        tf.io.gfile.GFile(hyp_filename).read()).strip().splitlines()

    if len(ref_lines) != len(hyp_lines):
        raise ValueError(
            "Reference and translation files have different number of "
            "lines. If training only a few steps (100-200), the "
            "translation may be empty.")
    if not case_sensitive:
        ref_lines = [x.lower() for x in ref_lines]
        hyp_lines = [x.lower() for x in hyp_lines]
    ref_tokens = [bleu_tokenize(x) for x in ref_lines]
    hyp_tokens = [bleu_tokenize(x) for x in hyp_lines]
    return metrics.compute_bleu(ref_tokens, hyp_tokens) * 100