Python ItemLoader.get_valueの例

プログラミング言語: Python

名前空間/パッケージ名: scrapy.loader

クラス/型: ItemLoader

メソッド/関数: get_value

hotexamples.comのコード掲載数: 4

Python ItemLoader.get_value - 4件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのscrapy.loader.ItemLoader.get_valueの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

ItemLoader(30)

add_xpath(30)

load_item(30)

get_xpath(30)

default_output_processor(30)

default_input_processor(30)

get_collected_values(30)

add_css(30)

add_value(30)

replace_value(28)

get_output_value(28)

nested_css(14)

nested_xpath(11)

_add_value(8)

get_css(6)

selector(6)

__init__(6)

get_value(4)

items(2)

values(2)

price_in(2)

number_of_reviews_in(1)

strip(1)

add_xpath_string(1)

address_out(1)

replace_css(1)

replace(1)

originCity_in(1)

features_in(1)

TakeFirst(1)

ad_value(1)

load_items(1)

default_onput_processor(1)

default_ouput_processor(1)

_local_item(1)

defualt_output_processor(1)

destinationCity_in(1)

deafult_input_processor(1)

コード例 #1

ファイルを表示

ファイル: rmpspider.py プロジェクト: harrywang/rmp

    def parse_rating(self, response):
        body_dic = json.loads(response.body.decode('utf-8'))
        for rating in body_dic['ratings']:
            l = ItemLoader(item=RatingItem(), response=response)
            l.default_output_processor = TakeFirst()
            # error, change tid to id
            l.add_value('rid', rating['id'])
            l.add_value('tid', response.meta['tid'])
            l.add_value('sid', response.meta['sid'])

            l.add_value('attendance', rating['attendance'])
            l.add_value('helpCount', rating['helpCount'])
            l.add_value('notHelpCount', rating['notHelpCount'])
            l.add_value('rClarity', rating['rClarity'])
            l.add_value('rClass', rating['rClass'])
            l.add_value('rComments', rating['rComments'])
            l.add_value('rDate', rating['rDate'])
            l.add_value('rEasy', rating['rEasy'])
            l.add_value('rErrorMsg', rating['rErrorMsg'])
            l.add_value('rHelpful', rating['rHelpful'])
            l.add_value('rInterest', rating['rInterest'])
            l.add_value('rOverall', rating['rOverall'])
            l.add_value('rStatus', rating['rStatus'])
            l.add_value('rTextBookUse', rating['rTextBookUse'])
            l.add_value('rWouldTakeAgain', rating['rWouldTakeAgain'])

            l.add_value('takenForCredit', rating['takenForCredit'])
            l.add_value('teacher', rating['teacher'])
            l.add_value('teacherGrade', rating['teacherGrade'])

            keys = l.get_value(rating['teacherRatingTags'])
            values = [1] * len(keys)
            l.add_value('tags', json.dumps(dict(zip(keys, values))))
            yield l.load_item()

コード例 #2

ファイルを表示

ファイル: meizitu.py プロジェクト: IceDerce/spider_fight

    def parse_page2(self, response):
        relativepath = "/home/zeaslity/Pictures/zhihu/"
        l = ItemLoader(item=PicCrawlItem())
        l.add_css("image_urls", "div.content a img::attr(src)")
        l.add_css("image_title", "div.page em")
        path = relativepath + l.get_value("fileName")
        l.add_value("path", path)

        for href in response.css("div.page a::attr(href)"):
            yield response.follow(href, self.parse_page2)

コード例 #3

ファイルを表示

from CommonProduct import Product
from scrapy.loader import ItemLoader
from scrapy.loader.processors import TakeFirst

if __name__ == '__main__':
    il = ItemLoader(item=Product())
    il.add_value('name', [u'Welcome to my', u'<strong>website</strong>'])
    il.add_value('price', [u'&euro;', u'<span>1000</span>'])
    item = il.load_item()
    print(item)

    #########ItemLoader objects
    loader = ItemLoader(product=Product())

    loader.context['unit'] = 'cm'
    val = loader.get_value(u'name: foo', TakeFirst(), str.upper, re='name: (.+)')
    #print(val)
    loader.add_value('name', u'Color TV')
    loader.add_value('colours', [u'white', u'blue'])
    loader.add_value('length', u'100')
    loader.add_value('name', u'name: foo', TakeFirst(), re='name: (.+)')
    loader.add_value(None, {'name': u'foo', 'sex': u'male'})
    print(loader.load_item())

コード例 #4

ファイルを表示

________________________________________________________________
________________________________________________________________
 - Trả về 1 new Item Loader để nạp Item. Nếu ko có Item cho trước, nó tự động khởi tạo sử dụng class trong 'default_item_class'.
 - Khi được khởi tạo với parameter là 1 selector hoặc 1 response, ItemLoader cung cấp cơ chế thuận tiện để trích xuất dữ liệu từ các website mà chúng sử dụng selectors.
'Parameters':	
	|item (Item object) – The item instance to populate using subsequent calls to add_xpath(), add_css(), or add_value().
	|selector (Selector object) – The selector to extract data from, when using the add_xpath() (resp. add_css()) or replace_xpath() (resp. replace_css()) method.
	|response (Response object) – The response used to construct the selector using the default_selector_class, unless the selector argument is given, in which case this argument is ignored.
 - Item, selector, response và các argument chính còn lại được gán cho Loader context (có thể truy nhập qua thuộc tính 'context')
 - Instance của Itemloader có các phương thức sau :
__________________________________________________________
'get_value(value, *processors, **kwargs)'
 - Xử lý 'value' cho trước bằng 'processors' và các argument chính
 - Vd : 
>>> from scrapy.loader.processors import TakeFirst
>>> loader.get_value(u'name: foo', TakeFirst(), unicode.upper, re='name: (.+)')
>>> Ra :'FOO`
__________________________________________________________
'add_value(field_name, value, *processors, **kwargs)'
 - Xử lý và sau đó thêm 'value' đã cho vào trường field_name.
 - Đầu tiên, value được truyền vào hàm 'get_value()' bởi processors và kwargs, sau đó truyền vào processor input của field_name, và kết quả được thêm vào dữ liệu đã thu thập trước đó cho field_name. Nếu field_name đã chứa dữ liệu đc thu thập, dữ liệu mới sẽ được thêm vào.
 - Trong trường hợp field_name = None, các giá trị cho các field có thể được đưa vào, và giá trị được xử lý phải là 1 dictinary mà các field đưa vào map tới các giá trị đó.
 - Vd :
loader.add_value(None, {'name': u'foo', 'sex': u'male'})
loader.add_value('name', u'Color TV')
loader.add_value('colours', [u'white', u'blue'])
loader.add_value('length', u'100')
loader.add_value('name', u'name: foo', TakeFirst(), re='name: (.+)')
__________________________________________________________
'replace_value(field_name, value, *processors, **kwargs)'
 - Tương tự add_value, nhưng nó thay thế giá trị cũ với giá trị mới.
__________________________________________________________