Python CrawlerProcess.settings Examples

Programming Language: Python

Namespace/Package Name: scrapy.crawler

Class/Type: CrawlerProcess

Method/Function: settings

Examples at hotexamples.com: 4

Python CrawlerProcess.settings - 4 examples found. These are the top rated real world Python examples of scrapy.crawler.CrawlerProcess.settings extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

CrawlerProcess(30)

crawl(30)

create_crawler(30)

join(30)

start(30)

install(24)

configure(17)

stop(17)

settings(3)

uninstall(3)

start_crawling(2)

_create_crawler(1)

_get_spider_loader(1)

_signal_shutdown(1)

stop_reactor(1)

Example #1

Show file

File: launch.py Project: WuQianyong/lianjia_spider

def run():
    process = CrawlerProcess()
    process.settings = get_project_settings()

    # ================   修改这块 ======================================
    # process.crawl 是添加爬虫

    process.crawl(LianjiaLoupanSpider)  # 链家楼盘列表
    process.crawl(LianjiaInfoSpider)  # 链家楼盘开盘信息
    process.crawl(LianjiaCommentSpider)  # 链家楼盘评论信息

    # =================================================================
    # 启动 上面的 所有爬虫
    process.start()

Example #2

Show file

File: scraper_tests.py Project: branches-cc/openrecipes

def get_spiders():
    """returns a dict of spiders
    """
    settings = get_project_settings()
    crawler = CrawlerProcess(settings)
    crawler.settings = settings
    crawler.configure()

    spiders = {}
    for spname in crawler.spiders.list():
        spider = crawler.spiders.create(spname)
        module_name = spider.__module__
        if not "_feedspider" in module_name:
            match_obj = re.match(r"openrecipes\.spiders\.([a-zA-Z0-9]+)_spider", module_name)
            if match_obj:
                short_name = match_obj.group(1)
                spiders[short_name] = spider

    return spiders

Example #3

Show file

File: scraper_tests.py Project: jterskine/my_stuff

def get_spiders():
    """returns a dict of spiders
    """
    settings = get_project_settings()
    crawler = CrawlerProcess(settings)
    crawler.settings = settings
    crawler.configure()

    spiders = {}
    for spname in crawler.spiders.list():
        spider = crawler.spiders.create(spname)
        module_name = spider.__module__
        if not '_feedspider' in module_name:
            match_obj = re.match(r'openrecipes\.spiders\.([a-zA-Z0-9]+)_spider',
                            module_name)
            if match_obj:
                short_name = match_obj.group(1)
                spiders[short_name] = spider

    return spiders

Example #4

Show file

File: launch.py Project: WuQianyong/fiction

def run():
    process = CrawlerProcess()
    process.settings = get_project_settings()
    # process.crawl(ForumSpider)  # 论坛爬虫
    process.crawl(FallenarkSpider)  # fallenark爬虫
    process.start()