Python HtmlResponse._get_urlの例

プログラミング言語: Python

名前空間/パッケージ名: scrapy.http

クラス/型: HtmlResponse

メソッド/関数: _get_url

hotexamples.comのコード掲載数: 1

Python HtmlResponse._get_url - 1件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのscrapy.http.HtmlResponse._get_urlの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

HtmlResponse(30)

css(30)

xpath(30)

follow(23)

urljoin(22)

json(16)

request(13)

body_as_unicode(9)

follow_all(6)

meta2(3)

_status(2)

_set_body(2)

copy(2)

flags(1)

_get_url(1)

encoding(1)

driver(1)

read(1)

replace(1)

status(1)

status_code(1)

url_list(1)

browser(1)

headers(1)

コード例 #1

ファイルを表示

    def vacacy_parse(self, response: HtmlResponse):

        self.name = response.xpath('//h1/text()').extract_first()
        salary_job = response.css('p.vacancy-salary span::text').extract()
        salary_job = correct_list(salary_job)
        self.salary_parse(salary_job=salary_job)
        self.link = response._get_url()
        self.company_name = response.xpath(
            "//div[contains(@class, 'vacancy-company-name-wrapper')]//text()"
        ).extract()

        if len(self.company_name) > 1:
            self.company_name = ' '.join(
                map(str, correct_list(self.company_name)))
        else:
            self.company_name = self.company_name[0]

        self.address = response.xpath(
            "//p[contains(@data-qa, 'vacancy-view-location')]//text()"
        ).extract()

        if len(self.address) > 1:
            self.address = ', '.join(map(str, correct_list(self.address)))
        else:
            self.address = self.address[0]

        vacancy_description = response.xpath(
            "//div[@class='vacancy-description']")
        self.experience = vacancy_description.xpath(
            "//span[@data-qa='vacancy-experience']/text()").extract_first()
        self.mode = vacancy_description.xpath(
            "//p[@data-qa='vacancy-view-employment-mode']//text()").extract()
        self.mode = correct_list(self.mode)

        vacancy_desc_sections = vacancy_description.xpath(
            "//div[@class='vacancy-section']")
        self.description = vacancy_desc_sections[0].xpath(
            "//div[@data-qa='vacancy-description']//text()").extract()
        self.description = correct_list(self.description)
        self.description = '\n'.join(map(str, self.description))

        self.accept_handicapped = vacancy_desc_sections[1].xpath(
            "//span[@xpath='1']//text()").extract()

        if not self.accept_handicapped:
            self.accept_handicapped = None

        self.key_skills = vacancy_desc_sections[2].xpath(
            "//span[contains(@class, 'bloko-tag__section_text')]/text()"
        ).extract()
        self.key_skills = correct_list(self.key_skills)

        yield JobparserItem(name=self.name,
                            company_name=self.company_name,
                            address=self.address,
                            salary_min=self.salary_min,
                            salary_max=self.salary_max,
                            currency=self.currency,
                            payment_type=self.payment_type,
                            experience=self.experience,
                            mode=self.mode,
                            description=self.description,
                            accept_handicapped=self.accept_handicapped,
                            key_skills=self.key_skills,
                            link=self.link,
                            site=self.site)