Python Document.construct 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: common.document

클래스/타입: Document

메소드/함수: construct

hotexamples.com에서의 예제들: 2

Python Document.construct - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 common.document.Document.construct에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

Document(7)

parse_from_tokens(3)

construct(2)

num_words(2)

decrease_topic(1)

get_words(1)

increase_topic(1)

parse_from_string(1)

set_emails(1)

set_hash_tags(1)

set_links(1)

set_sentences(1)

set_tokens(1)

예제 #1

파일 보기

def read_corenlp_doc(filename, verbose=True):
    if verbose:
        log.info('Reading CoreNLP document from {}'.format(filename))

    input_xml = smart_file_handler(filename)

    xml_parser = etree.XMLParser(target=CoreNLPTarget())
    sents, corefs = etree.parse(input_xml, xml_parser)
    doc_name = splitext(basename(filename))[0]
    doc = Document.construct(doc_name, sents, corefs)

    input_xml.close()

    return doc

예제 #2

파일 보기

def read_doc_from_ontonotes(coref_doc, name_doc, verbose=True):
    doc_id = coref_doc.document_id.split('@')[0]
    assert doc_id == name_doc.document_id.split('@')[0], \
        '{} and {} do not have the same document_id'.format(coref_doc, name_doc)

    if verbose:
        log.info('Reading ontonotes document {}'.format(doc_id))

    conll_file_path = join(ontonotes_annotations_source, doc_id + '.depparse')

    all_sents = read_conll_depparse(conll_file_path)

    all_corefs = read_coref_doc(coref_doc)

    doc_name = doc_id.split('/')[-1]
    doc = Document.construct(doc_name, all_sents, all_corefs)

    for name_entity in read_name_doc(name_doc):
        add_name_entity_to_doc(doc, name_entity)

    return doc