Python url_to_DOM 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: util

메소드/함수: url_to_DOM

hotexamples.com에서의 예제들: 2

Python url_to_DOM - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 util.url_to_DOM에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: example.py 프로젝트: kingb/oracle-parse

def examples_to_nodes(field_list, page_url, text_filter_func=text_filter_strip_newline, user_agent=DEFAULT_USER_AGENT):
    """
    For each field, set the selected node.
    """
    root = url_to_DOM(page_url, user_agent)
    d = {}
    for field in field_list:
        target_nodes = [ node for node in root.iterdescendants() \
                        if text_filter_func(node.text_content()) == text_filter_func(field.example) ]

        d[field.name] = []
        for node in target_nodes:
            xpath = node_to_absolute_XPATH(node)
            record_count = len(root.findall(xpath))
            d[field.name].append((node, xpath, record_count))

    return d

예제 #2

파일 보기

파일: example.py 프로젝트: kingb/oracle-parse

def example_to_node(field, page_url, filter=False, disambiguation_method=take_last,
                    text_filter_func=text_filter_strip_newline):
    """
    This method takes in an ExampleField and finds the target node that contains that example.
    field: An ExampleField.
    page_url: The URL of the selected page.
    filter: If text_filter_func shold be applied to text before comparison.
    disambiguation_method: This should be a method that takes in a list and somehow decides which node to return.
    text_filter_func: Allows the user to supply a function for filtering text.
        default: util.text_filter_strip_newline (removes surrounding whitespace and replaces newlines with ''
    """
    root = url_to_DOM(page_url)
    if filter:
        target_nodes = [ node for node in root.iterdescendants() if text_filter_func(node.text_content()) == text_filter_func(field.example) ]
    else:
        target_nodes = [ node for node in root.iterdescendants() if node.text_content() == field.example ]
    if len(target_nodes) == 0:
        raise ValueError('Node containing the given example not found. Field(%s)' % (field.name))
    else:
        return disambiguation_method(target_nodes)