Python setup_configの例

プログラミング言語: Python

名前空間/パッケージ名: pywebcopy.config

メソッド/関数: setup_config

hotexamples.comのコード掲載数: 6

Python setup_config - 6件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのpywebcopy.config.setup_configの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

コード例 #1

ファイルを表示

ファイル: hypertag.py プロジェクト: mszarski/HyperTag

 def scrape(self, url, folder, timeout=1):
     config.setup_config(url, folder)
     wp = WebPage()
     wp.get(url)
     # start the saving process
     wp.save_complete()
     # join the sub threads
     for t in wp._threads:
         if t.is_alive():
             t.join(timeout)
     # location of the html file written
     return wp.file_path

コード例 #2

ファイルを表示

ファイル: site_cloner.py プロジェクト: NourEddineX/k8-website-mass-file-download

    def scrape(url, folder, timeout=1):
        config.setup_config(url, folder)

        wp = WebPage()
        wp.get(url)

        wp.save_complete()
        for t in wp._threads:
            if t.is_alive():
                t.join(timeout)

        return wp.file_path

コード例 #3

ファイルを表示

ファイル: crawl.py プロジェクト: nazmussaif/web_crawler

    def crawl(self, project_name):
        kwargs = {
            'project_url': 'https://www.thedailystar.net/',
            'project_folder': 'thedailystar',
            'project_name': project_name,
            'bypass_robots': False,
            'load_css': False,
            'load_images': False,
            'load_javascript': False,
            'over_write': True
        }
        config.setup_config(**kwargs)

        wp = Crawler()
        wp.crawl()

コード例 #4

ファイルを表示

def crawl(url, folder, timeout=1):

    config.setup_config(url, folder)

    cr = Crawler()
    cr.get(url)

    # start the saving process
    cr.crawl()

    # join the sub threads
    for t in cr._threads:
        if t.is_alive():
            t.join(timeout)

    # location of the html file written
    return cr.file_path

コード例 #5

ファイルを表示

#Does Not Work with Wlvpn.com does not download images and .js and .css files
from pywebcopy import WebPage, config
config.setup_config('https://wlvpn.com/', "e:\\Upwork", "Upp")
wp = WebPage()
wp.get('https://wlvpn.com/')
wp.save_complete()

コード例 #6

ファイルを表示

from pywebcopy import Crawler, config

kwargs = {
    'zip_project_folder':
    False,
    'allowed_file_ext': [
        '.html', '.php', '.asp', '.aspx', '.htm', '.xhtml', '.css', '.json',
        '.js', '.xml', '.svg', '.gif', '.ico', '.jpeg', '.pdf', '.jpg', '.png',
        '.ttf', '.eot', '.otf', '.woff', '.woff2', '.pwcf'
    ]
}

config.setup_config(project_url='https://rednoise.org/teaching/wdm/',
                    project_folder='./downloads2',
                    project_name='wdm')

crawler = Crawler()
crawler.crawl()

#allowed_file_ext=['.html', '.php', '.asp', '.aspx', '.htm', '.xhtml', '.css', '.json', '.js', '.xml', '.svg', '.gif', '.ico', '.jpeg', '.pdf', '.jpg', '.png', '.ttf', '.eot', '.otf', '.woff', '.woff2', '.pwcf'],
#'over_write':True,  <- does not worked properly