Python NewsArticleItem示例

编程语言: Python

命名空间/包名称: ArticleScrapyv2.items

类/类型: NewsArticleItem

hotexamples.com的示例: 2

Python NewsArticleItem - 已找到2个示例。这些是从开源项目中提取的最受好评的ArticleScrapyv2.items.NewsArticleItem现实Python示例。您可以评价示例，以帮助我们提高示例质量。

常用方法

显示隐藏

NewsArticleItem(2)

示例#1

显示文件

    def parse_item(self, response):
        self.log("Scraping: " + response.url)
        articles = response.xpath(self.xpath_dict["articles"])

        for article in articles:
            item = NewsArticleItem()
            item["title"] = article.xpath(
                self.xpath_dict["title"]).extract_first()
            item["url"] = article.xpath(self.xpath_dict["url"]).extract_first()

            yield item

示例#2

显示文件

    def parse_item(self, response):
        self.log("Scraping: " + response.url)

        articles = response.xpath(self.xpath_dict["articles"])
        logging.log(logging.INFO, "Article:: \n" + str(articles))

        for article in articles:
            item = NewsArticleItem()
            try:
                item["url"] = article.xpath(
                    self.xpath_dict["url"]).extract_first()
                logging.log(logging.INFO,
                            "Linked scraped: \n" + str(item["url"]))
                title = item["url"].split(".com")[-1]
                item["title"] = title[:len(title) - 10]
            except AttributeError:
                print("Root Spider failed to scrape item")
            yield item