Python NewsContentItem 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: newsSpider.items

클래스/타입: NewsContentItem

hotexamples.com에서의 예제들: 2

Python NewsContentItem - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 newsSpider.items.NewsContentItem에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

NewsContentItem(2)

자주 사용되는 메소드들

NewsContentItem (2)

예제 #1

파일 보기

파일: beiqingwang_spider.py 프로젝트: fevin/NewsSpiders

 def parseContent(self, response):
     # 获取文章各部分信息
     article = response.selector.xpath('//div[@id="articleContent"]')
     item = NewsContentItem()
     item['title'] = article.xpath('//div[@class="articleTitle"]/h2/text()').extract()[0]
     contents = article.xpath('//div[@class="articleBox mb20 cfix"]/p/text()').extract()
     item['content'] = ''
     for cont in contents:
         item['content'] += cont.strip() + '<br />'
     item['url']   = response.url
     item['time']  = article.xpath('//span[@class="yearMsg"]/text()').extract()[0]
     item['site']  = '北青网'
     yield item

예제 #2

파일 보기

 def parseContent(self, response):
     # 获取文章各部分信息
     article = response.selector.xpath('//div[@class="article"]')
     item = NewsContentItem()
     item['title'] = article.xpath('//h1/text()').extract()[0]
     contents = article.xpath('//div[@class="text"]/p/text()').extract()
     item['content'] = ''
     for cont in contents:
         item['content'] += cont.strip() + '<br />'
     item['url']   = response.url
     item['time']  = time.strftime("%Y-%m-%d %H:%M:%S",time.localtime(time.time()))
     item['site']  = '京郊日报'
     yield item