Python process_punctuation示例

编程语言: Python

命名空间/包名称: ansemb.dataset.preprocess

方法/功能: process_punctuation

hotexamples.com的示例: 7

Python process_punctuation - 已找到7个示例。这些是从开源项目中提取的最受好评的ansemb.dataset.preprocess.process_punctuation现实Python示例。您可以评价示例，以帮助我们提高示例质量。

示例#1

显示文件

def prepare_multiple_choice_answer(answers_json):
    """ Normalize answers from a given answer json in the usual VQA format. """
    multiple_choice_answers = [
        ans_dict['multiple_choice_answer'] for ans_dict in answers_json
    ]
    for answer in multiple_choice_answers:
        yield [process_punctuation(answer)]

示例#2

显示文件

def prepare_v7w_answers(answers_json, decoys_json):
    answers = []
    for ans, decoy in zip(answers_json, decoys_json):
        assert ans['qa_id'] == decoy[
            'qa_id'], 'inconsistent qa_id: {}, decoy_id: {}'.format(
                ans['qa_id'], decoy['qa_id'])
        answers.append(decoy['IoU_decoys'] + decoy['QoU_decoys'] +
                       [ans['answer']])

    answers = [[_a.lower().strip('.') for _a in a] for a in answers]
    for answer in answers:
        yield [process_punctuation(a) for a in answer]

示例#3

显示文件

def prepare_answers(answers_json):
    answers = [[_a.lower().strip('.') for _a in a['multiple_choices']]
               for a in answers_json]
    for answer in answers:
        yield [process_punctuation(a) for a in answer]

示例#4

显示文件

def prepare_questions(questions_json):
    questions = [q['question'] for q in questions_json]
    for question in questions:
        question = question.lower()[:-1]
        yield nltk.word_tokenize(process_punctuation(question))

示例#5

显示文件

文件： vg.py 项目： hexiang-hu/answer_embedding

def prepare_choices(answers_json):
    answers = [
        a['IoU_decoys'] + a['QoU_decoys'] + [a['answer']] for a in answers_json
    ]
    for answer in answers:
        yield [process_punctuation(a.lower().strip('.')) for a in answer]

示例#6

显示文件

文件： vg.py 项目： hexiang-hu/answer_embedding

def prepare_answers(answers_json):
    answers = [a['answer'] for a in answers_json]
    for answer in answers:
        yield [process_punctuation(answer.lower().strip('.'))]

示例#7

显示文件

def prepare_questions(questions_json):
    """ Tokenize and normalize questions from a given question json in the usual VQA format. """
    questions = [q['question'] for q in questions_json]
    for question in questions:
        question = question.lower()[:-1]
        yield nltk.word_tokenize(process_punctuation(question))