Python ItemLoader.default_onput_processorの例

プログラミング言語: Python

名前空間/パッケージ名: scrapy.loader

クラス/型: ItemLoader

メソッド/関数: default_onput_processor

hotexamples.comのコード掲載数: 1

Python ItemLoader.default_onput_processor - 1件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのscrapy.loader.ItemLoader.default_onput_processorの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

ItemLoader(30)

add_xpath(30)

load_item(30)

get_xpath(30)

default_output_processor(30)

default_input_processor(30)

get_collected_values(30)

add_css(30)

add_value(30)

replace_value(28)

get_output_value(28)

nested_css(14)

nested_xpath(11)

_add_value(8)

get_css(6)

selector(6)

__init__(6)

get_value(4)

items(2)

values(2)

price_in(2)

number_of_reviews_in(1)

strip(1)

add_xpath_string(1)

address_out(1)

replace_css(1)

replace(1)

originCity_in(1)

features_in(1)

TakeFirst(1)

ad_value(1)

load_items(1)

default_onput_processor(1)

default_ouput_processor(1)

_local_item(1)

defualt_output_processor(1)

destinationCity_in(1)

deafult_input_processor(1)

コード例 #1

ファイルを表示

def load_author(response,author):
    auths = response.xpath(author['auth'])
    for auth in auths:
        l = ItemLoader(item = AuthorItem(), response = response)
        l.default_onput_processor = TakeFirst()

        # author's first name and last name
        fn = auth.xpath(author['fn']).extract()[0]
        ln = auth.xpath(author['ln']).extract()[0]
        l.add_value('fname', fn)
        l.add_value('lname', ln)

        # author's email
        try:
            email = auth.xpath(author['email']).extract()[0][7:]
            l.add_value('email', email)
        except:
            pass

        # author's address and institution
        try:
            fid = auth.xpath(author['fid']).extract()[0][1:]
            address = l.get_xpath(author['address'] %fid)

            for i in address[0].split(', '):
                if 'niversity' in i:
                    institution = i
                    break
            l.add_value('address', address)
            l.add_value('institution', institution)
        except:
            pass

        # author's vitae
        try:
            href = auth.xpath(author['href']).extract()[0][1:]
            vitae = response.xpath(author['vitae'] %href).extract()[0]
            l.add_value('vitae', fn+' '+ln+vitae)
        except:
            pass

        # author's avatar
        try:
            href = auth.xpath(author['href']).extract()[0][1:]
            avatar = response.xpath(author['avatar'] %href).extract()[0]
            l.add_value('avatar', avatar)
        except:
            pass

        yield l