Python download 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: advanced_link_crawler

메소드/함수: download

hotexamples.com에서의 예제들: 4

Python download - 4개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 advanced_link_crawler.download에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: xpath_scraper_img.py 프로젝트: qianzhaicun/awebscrape

from lxml.html import fromstring
from advanced_link_crawler import download

url = 'http://example.webscraping.com/places/default/view/Afghanistan-1'
html = download(url)

tree = fromstring(html)
img = tree.xpath('//tr[@id="places_national_flag__row"]/td[@class="w2p_fw"]//@src')[0]
print('http://example.webscraping.com' + img)

예제 #2

파일 보기

import time
import re
from all_scrapers import re_scraper, bs_scraper, \
    lxml_scraper, lxml_xpath_scraper
from advanced_link_crawler import download

NUM_ITERATIONS = 1000  # number of times to test each scraper
html = download('http://example.webscraping.com/places/default/view/Singapore-203')

scrapers = [
    ('Regular expressions', re_scraper),
    ('BeautifulSoup', bs_scraper),
    ('Lxml', lxml_scraper),
    ('Xpath', lxml_xpath_scraper)]

for name, scraper in scrapers:
    # record start time of scrape
    start = time.time()
    for i in range(NUM_ITERATIONS):
        if scraper == re_scraper:
            re.purge()
        result = scraper(html)
        # check scraped result is as expected
        assert result['area'] == '692 square kilometres'
    # record end time of scrape and output the total
    end = time.time()
    print('%s: %.2f seconds' % (name, end - start))

예제 #3

파일 보기

파일: test_scrapers.py 프로젝트: qianzhaicun/awebscrape

import time
import re
from all_scrapers import re_scraper, bs_scraper, \
    lxml_scraper, lxml_xpath_scraper
from advanced_link_crawler import download

NUM_ITERATIONS = 1000  # number of times to test each scraper
html = download(
    'http://example.webscraping.com/places/default/view/Afghanistan-1')

scrapers = [('Regular expressions', re_scraper), ('BeautifulSoup', bs_scraper),
            ('Lxml', lxml_scraper), ('Xpath', lxml_xpath_scraper)]

for name, scraper in scrapers:
    # record start time of scrape
    start = time.time()
    for i in range(NUM_ITERATIONS):
        if scraper == re_scraper:
            re.purge()
        result = scraper(html)
        # check scraped result is as expected
        assert result['area'] == '647,500 square kilometres'
    # record end time of scrape and output the total
    end = time.time()
    print('%s: %.2f seconds' % (name, end - start))

예제 #4

파일 보기

파일: mainscrape.py 프로젝트: jkinney23/youtubesidebarscraper

def fetch_youtube_url(watch_id):
    if watch_id == "": watch_id = '0uUoqD8a0V4'
    url = "https://www.youtube.com/watch?v=" + watch_id
    return download(url)