Python Document.parse 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: readability.readability

클래스/타입: Document

메소드/함수: parse

hotexamples.com에서의 예제들: 2

Python Document.parse - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 readability.readability.Document.parse에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

Document(30)

short_title(30)

summary(30)

title(20)

encode(9)

replace(9)

reverse_tags(4)

content(3)

transform(2)

get_clean_html(2)

get_publish_date(2)

parse(2)

split(2)

text_content(1)

summary_with_metadata(1)

strip(1)

read(1)

seek(1)

lower(1)

get_text(1)

get_author(1)

find_all(1)

find(1)

encoding(1)

write(1)

예제 #1

파일 보기

def get_summary(url):
    html = urllib.request.urlopen(url).read()
    doc = Document(html)
    doc.parse(["summary", "short_title"])
    readable_article = doc.summary()
    readable_title = doc.short_title()
    return readable_article, readable_title

예제 #2

파일 보기

파일: scraper.py 프로젝트: za419/reddit-news

def scrape(URL):
    """
    Return the text of the article found at URL
    Some whitespace changes will usually occur.
    """

    html = urllib.request.urlopen(URL).read()
    doc = Document(html)
    doc.parse(["summary", "short_title"])
    readable_article = doc.summary()
    soup = BeautifulSoup(readable_article, 'html.parser')
    text = soup.get_text()
    return text