Python reshape_title示例

编程语言: Python

命名空间/包名称: HedeSpider.tools

方法/功能: reshape_title

hotexamples.com的示例: 2

Python reshape_title - 已找到2个示例。这些是从开源项目中提取的最受好评的HedeSpider.tools.reshape_title现实Python示例。您可以评价示例，以帮助我们提高示例质量。

示例#1

显示文件

文件： canyin168.py 项目： 943426866/HedeSpider

    def parse_content(self, response):
        title = response.css('.biaoti h1 span font::text').get()
        if title is None:
            title = response.url
        # 防止文章标题出现非法字符
        title = tools.reshape_title(title)

        content = response.css('.zuo_nr').get()
        soup = bs(content, 'lxml')
        soup.find(class_='biaoti').extract()
        content = soup.prettify()
        # 清除字体格式，图片
        content = tools.reshape_content(content)

        path = tools.reshape_path(self.name)

        item = items.HedespiderItem()
        item['title'] = title
        item['content'] = content
        item['path'] = path
        item['userid'] = self.userid
        if len(self.keywords) == 0:
            yield item
        for keyword in self.keywords:
            if keyword in str(item):
                yield item
                break

示例#2

显示文件

    def parse_content(self, response):
        title = response.css('.tit::text').get()
        if title is None:
            title = response.url
        # 防止文章标题出现非法字符
        title = tools.reshape_title(title)

        content = response.css('.content').get()
        # 清除字体格式，图片
        content = tools.reshape_content(content)

        path = tools.reshape_path(self.name)

        item = items.HedespiderItem()
        item['title'] = title
        item['content'] = content
        item['path'] = path
        item['userid'] = self.userid
        if len(self.keywords) == 0:
            yield item
        for keyword in self.keywords:
            if keyword in str(item):
                yield item
                break