Python GeneralItem 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: news_sites.items

클래스/타입: GeneralItem

hotexamples.com에서의 예제들: 4

Python GeneralItem - 4개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 news_sites.items.GeneralItem에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

GeneralItem(4)

자주 사용되는 메소드들

GeneralItem (4)

예제 #1

파일 보기

파일: dm-food.py 프로젝트: wdevon99/fact-Bounty

    def parse(self, response):
        items = []
        for news in response.css('article.entry-item'):
            # print(news_arr)
            item = GeneralItem()
            # append to items object
            item['news_headline'] = news.css(
                'h6.entry-title a ::text').extract_first()
            item['datetime'] = "not in use"
            news_url = news.css(
                'h6.entry-title a ::attr(href)').extract_first()
            item['link'] = news_url
            r = Request(url=news_url, callback=self.parse_1)
            r.meta['item'] = item
            yield r
            items.append(item)
        yield {"newsInDetails": items}

        next_page = response.css(
            'div.pagination.clearfix ul.page-numbers.clearfix li a.last.page-numbers ::attr(href)'
        ).extract_first()
        if next_page is not None:
            print(next_page)
            next_page = str(next_page)
            yield scrapy.Request(next_page, callback=self.parse)

예제 #2

파일 보기

파일: gamespot.py 프로젝트: wdevon99/fact-Bounty

    def parse(self, response):
        items = []
        for news in response.css('article.media.media-game.media-game'):
            # print(news_arr)
            item = GeneralItem()
            # append to items object
            item['news_headline'] = news.css(
                'h3.media-title ::text').extract_first()
            item['datetime'] = news.css(
                'time.media-date ::attr(datetime)').extract_first()
            news_url = "https://www.gamespot.com" + \
                       news.css('a.js-event-tracking ::attr(href)').extract_first()
            item['link'] = news_url
            r = Request(url=news_url, callback=self.parse_1)
            r.meta['item'] = item
            yield r
            items.append(item)
        yield {"newsInDetails": items}

        next_page = "https://www.gamespot.com" + \
                    response.css(
                        'ul.paginate li.paginate__item.skip.next a.btn ::attr(href)').extract_first()
        if next_page is not None:
            print(next_page)
            next_page = str(next_page)
            yield scrapy.Request(next_page, callback=self.parse)

예제 #3

파일 보기

파일: yamu-foods.py 프로젝트: shehand/fact-Bounty

    def parse(self, response):
        items = []
        for news in response.css('a.front-group-item.item'):
            # print(news_arr)
            item = GeneralItem()
            #append to items object
            item['news_headline']=news.css('h3.front-h3 ::text').extract_first().strip()
            item['datetime']="not in use"
            news_url = news.css('::attr(href)').extract_first()
            item['link']=news_url
            r=Request(url=news_url, callback=self.parse_1)
            r.meta['item']=item
            yield r
            items.append(item)
        yield {"newsInDetails":items}

        for i in range(1,8):
            next_page = "https://www.yamu.lk/recipe?page="+str(i)
            yield scrapy.Request(next_page, callback=self.parse)

예제 #4

파일 보기

파일: beauty.py 프로젝트: wdevon99/fact-Bounty

 def parse(self, response):
     items = []
     for news in response.css('div.small-12.medium-4.large-4.columns'):
         # print(news_arr)
         item = GeneralItem()
         # append to items object
         item['news_headline'] = news.css(
             'header.post-title.entry-header h5 ::text').extract_first()
         item['datetime'] = news.css(
             'aside.post-author.cf time ::text').extract_first()
         news_url = news.css(
             'header.post-title.entry-header h5 a ::attr(href)'
         ).extract_first()
         item['link'] = news_url
         r = Request(url=news_url, callback=self.parse_1)
         r.meta['item'] = item
         yield r
         items.append(item)
     yield {"data": items}