Python PageElement.find 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: bs4

클래스/타입: PageElement

메소드/함수: find

hotexamples.com에서의 예제들: 4

Python PageElement.find - 4개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 bs4.PageElement.find에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

new_tag(8)

find_all(8)

find_next(5)

find(4)

select(4)

replaceWith(3)

findAllNext(2)

get(2)

new_string(1)

strip(1)

예제 #1

파일 보기

def post2list(ele: PageElement):
    post_list = []

    headers = ele.find('div', class_='hd')('li')
    for header in headers:
        post_list.append({
            'name': header.a.text,
            'link': header.a['href'],
            'children': []
        })

    uls = ele.find('div', class_='bd')('ul')
    for i in range(len(post_list)):
        for li in uls[i]('li'):
            post_list[i]['children'].append({
                'name':
                ''.join(li('a')[-1].text.split()),
                'link':
                li('a')[-1]['href'],
                'new':
                True if li.img else False,
                'date':
                li.span.text if li.span else ''
            })

    return post_list

예제 #2

파일 보기

def parse_video_block(video_block: PageElement) -> Dict:
    video_object = {}
    video_title_el = video_block.find("h3")
    video_object["video_title"] = str(video_title_el.string) if video_title_el else None
    video_link_el = video_block.find(class_ = "btn-link video-sources video-download-button")
    video_object["video_link"] = video_link_el["href"] if video_link_el else None
    transcript_link_el = video_block.select(".wrapper-download-transcripts a")
    video_object["transcript_link"] = set()
    for srt_link in transcript_link_el:
        srt_url = srt_link["href"]
        u = urlparse(srt_url)
        if not u.scheme:
            u = u._replace(scheme='https')
        if not u.netloc:
            u = u._replace(netloc='courses.edx.org')
        srt_url = urlunparse(u)
        video_object["transcript_link"].add(srt_url)
    video_object["transcript_link"] = list(video_object["transcript_link"])
    return video_object

예제 #3

파일 보기

def get_html_table_header_and_rows(
        table: bs4.PageElement) -> Tuple[List, List]:
    """
    return header and rows from a html table as a list
    """
    header = []
    rows = []
    table_header = table.find("tr")
    table_rows = table.find_all("tr")[1:]
    for items in table_header:
        header.append(items.get_text())

    for table_row in table_rows:
        row = []
        for cell in table_row.findAll(['th', 'td']):
            row.append(cell)
        rows.append(row)

    return header, rows

예제 #4

파일 보기

파일: scrapper.py 프로젝트: ChuckBorris33/koronaslovakia

def get_element_with_comment(container: PageElement,
                             comment: str) -> PageElement:
    return container.find(
        text=lambda t: _find_comment(t, comment)).find_parent()