Python link_filterの例

プログラミング言語: Python

名前空間/パッケージ名: src.utils.util

メソッド/関数: link_filter

hotexamples.comのコード掲載数: 6

Python link_filter - 6件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのsrc.utils.util.link_filterの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

コード例 #1

ファイルを表示

ファイル: yippy.py プロジェクト: semihyumusak/SpEnD-v3

    def parse(self, response):
        links = response.css("a.title::attr(href)").getall()

        Sparql.is_endpoint(util.link_filter(links),
                           first_crawl=Yippy.is_first_crawl)

        next_page = response.css("a.listnext::attr(href)").get()

        if next_page is not None:
            yield Request(response.urljoin(next_page), callback=self.parse)

コード例 #2

ファイルを表示

    def parse(self, response):

        links = response.css("a.ob::attr(href)").getall()

        Sparql.is_endpoint(util.link_filter(links),
                           first_crawl=Mojeek.is_first_crawl)

        next_page = response.css("div.pagination a::attr(href)").getall()[-1]

        if next_page is not None:
            yield Request(response.urljoin(next_page), callback=self.parse)

コード例 #3

ファイルを表示

ファイル: aol.py プロジェクト: semihyumusak/SpEnD-v3

    def parse(self, response):

        links = response.css(
            "a.ac-algo.fz-l.ac-21th.lh-24::attr(href)").getall()

        Sparql.is_endpoint(util.link_filter(links),
                           first_crawl=Aol.is_first_crawl)

        next_page = response.css("a.next::attr(href)").get()

        if next_page is not None:
            yield Request(response.urljoin(next_page), callback=self.parse)

コード例 #4

ファイルを表示

    def parse(self, response):

        links = response.css("div.b_title a.sh_favicon::attr(href)").getall()

        Sparql.is_endpoint(util.link_filter(links),
                           first_crawl=Bing.is_first_crawl)

        next_page = response.css(
            "a.sb_pagN.sb_pagN_bp.b_widePag.sb_bp::attr(href)").get()

        if next_page is not None:
            yield Request(response.urljoin(next_page), callback=self.parse)

コード例 #5

ファイルを表示

    def parse(self, response):

        links = response.css(
            "a.PartialSearchResults-item-title-link.result-link::attr(href)"
        ).getall()

        Sparql.is_endpoint(util.link_filter(links),
                           first_crawl=Ask.is_first_crawl)

        next_page = response.css(
            "li.PartialWebPagination-next a::attr(href)").get()

        if next_page is not None:
            yield Request(response.urljoin(next_page), callback=self.parse)

コード例 #6

ファイルを表示

ファイル: google.py プロジェクト: semihyumusak/SpEnD-v3

    def parse(self, response):

        links = response.css("div.kCrYT a::attr(href)").getall()

        Sparql.is_endpoint(util.link_filter(
            util.link_regulator_for_google(links)),
                           first_crawl=Google.is_first_crawl)

        next_page = response.css("a.nBDE1b.G5eFlf::attr(href)").get()

        if "start=10&" in response.url:
            next_page = response.css("a.nBDE1b.G5eFlf::attr(href)").getall()[1]

        if next_page is not None:
            # yield response.follow(next_page, callback=self.parse)
            yield Request(response.urljoin(next_page), callback=self.parse)