Python XPathItemLoader.name_in 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: scrapy.contrib.loader

클래스/타입: XPathItemLoader

메소드/함수: name_in

hotexamples.com에서의 예제들: 1

Python XPathItemLoader.name_in - 1개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 scrapy.contrib.loader.XPathItemLoader.name_in에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

XPathItemLoader(30)

add_value(30)

add_xpath(30)

load_item(30)

default_input_processor(14)

default_output_processor(14)

get_output_value(9)

replace_value(1)

name_in(1)

load_items(1)

get_xpath(1)

deffault_input_processor(1)

get_collected_values(1)

__init__(1)

defalut_output_processor(1)

county_in(1)

add_css(1)

state_in(1)

예제 #1

파일 보기

파일: NrcMaterialsScraper.py 프로젝트: netconstructor/scraper-2

    def parse_materials(self, response):
        reportnum = response.request.meta['reportnum']
        text = unicode (response.body, response.encoding)
        hxs = HtmlXPathSelector(text=text)
        materials = hxs.select ('//table[@class="t16Standard"]/tr')
        if (len(materials) == 0):
            self.log('Materials data not present in response from {0}'.format(response.url), log.INFO)
        else:
            # Skip the first report record because this is the header row
            materials.pop (0)
            if (len(materials) == 0):
                self.log('No materials reports found in response {0}'
                         .format(reportnum), log.INFO)
            else:
                self.log('Retrieved {0} materials records in report {1}'
                         .format(len(materials),reportnum), log.INFO)

        for material in materials:
            l = XPathItemLoader(NrcScrapedMaterial(), material)
            l.name_in = lambda slist: [s[:32] for s in slist]
            l.add_value('reportnum', reportnum)
            for name, params in NrcScrapedMaterial.fields.items():
                if 'xpath' in params:
                    l.add_xpath(name, params['xpath'])
            item = l.load_item()
            yield item
     
        self.db.setBotTaskStatus(reportnum, self.name, 'DONE')