Python BannedException示例

编程语言: Python

命名空间/包名称: banned_exception

类/类型: BannedException

hotexamples.com的示例: 3

Python BannedException - 已找到3个示例。这些是从开源项目中提取的最受好评的banned_exception.BannedException现实Python示例。您可以评价示例，以帮助我们提高示例质量。

常用方法

显示隐藏

BannedException(3)

示例#1

显示文件

文件： core_utils.py 项目： tengfei7890/amazon-reviews-scraper

def get_soup(url):
    if AMAZON_BASE_URL not in url:
        url = AMAZON_BASE_URL + url
    nap_time_sec = 1
    logging.debug(
        'Script is going to sleep for {} (Amazon throttling). ZZZzzzZZZzz.'.
        format(nap_time_sec))
    sleep(nap_time_sec)
    header = {'User-Agent': random.choice(HEADERS_LIST)}
    logging.debug('-> to Amazon : {}'.format(url))
    out = requests.get(url, headers=header)
    assert out.status_code == 200
    soup = BeautifulSoup(out.content, 'lxml')
    if 'captcha' in str(soup):
        raise BannedException(
            'Your bot has been detected. Please wait a while.')
    return soup

示例#2

显示文件

文件： core_utils.py 项目： augustwu/amazon-reviews-scraper

def get_soup(url):
    if AMAZON_BASE_URL not in url:
        url = AMAZON_BASE_URL + url
    nap_time_sec = 1
    logging.debug(
        'Script is going to sleep for {} (Amazon throttling). ZZZzzzZZZzz.'.
        format(nap_time_sec))
    sleep(nap_time_sec)
    header = {
        'User-Agent':
        'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/43.0.2357.134 Safari/537.36'
    }
    logging.debug('-> to Amazon : {}'.format(url))
    out = requests.get(url, headers=header)
    assert out.status_code == 200
    soup = BeautifulSoup(out.content, 'html.parser')
    if 'captcha' in str(soup):
        raise BannedException(
            'Your bot has been detected. Please wait a while.')
    return soup

示例#3

显示文件

文件： core_utils.py 项目： tina31726/makeup-review-engine

def get_soup(url):
    if 'amazon.com' not in url:
        url = 'https://www.amazon.com' + url
    nap_time_sec = 1
    logging.debug(
        'Script is going to sleep for {} (Amazon throttling). ZZZzzzZZZzz.'.
        format(nap_time_sec))
    sleep(nap_time_sec)
    ua = UserAgent()

    test = urllib.parse.quote(url)
    url_proxy = proxy + test

    logging.debug('-> to Amazon : {}'.format(url))
    out = requests.get(url, headers={'user-agent': str(ua.random)})
    assert out.status_code == 200
    soup = BeautifulSoup(out.content, 'html.parser')
    if 'captcha' in str(soup):
        # logging.debug('Your bot has been detected. Please wait a while.')
        # get_soup(url)
        raise BannedException(
            'Your bot has been detected. Please wait a while.')
    return soup