Python parse_job_xml 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: util.file_reader

메소드/함수: parse_job_xml

hotexamples.com에서의 예제들: 2

Python parse_job_xml - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 util.file_reader.parse_job_xml에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: sw_lagou_list.py 프로젝트: evestorm/udacityspider

        elif response.status_code == 403:
            log.error('request is forbidden by the server...')
            return 0
        else:
            log.error(response.status_code)
            return 0
    except requests.exceptions.RequestException as e:
        log.error(response.status_code + "超时3次")
    return 0

    


# 爬取职位信息，将内容保存在当前目录的data文件夹下
if __name__ == '__main__':
    craw_job_list = parse_job_xml('../config/job.xml')
    for _ in craw_job_list:
        # 创建joblist对象
        joblist = crawl_jobs(_)
        col = [
            u'公司ID',
            u'工作经验',
            u'教育程度',
            u'工作性质',
            u'岗位名称',
            u'岗位ID',
            u'发布时间',
            u'城市',
            u'公司LOGO',
            u'工业领域',
            u'岗位优势',

예제 #2

파일 보기

파일: m_lagou_spider.py 프로젝트: EclipseXuLu/LagouJob

    if response.status_code == 200:
        max_page_no = int(int(response.json()['content']['data']['page']['totalCount']) / 15 + 1)

        return max_page_no
    elif response.status_code == 403:
        log.error('request is forbidden by the server...')

        return 0
    else:
        log.error(response.status_code)

        return 0


if __name__ == '__main__':
    craw_job_list = parse_job_xml('../config/job.xml')
    for _ in craw_job_list:
        joblist = crawl_jobs(_)
        col = [
            u'职位编码',
            u'职位名称',
            u'所在城市',
            u'发布日期',
            u'薪资待遇',
            u'公司编码',
            u'公司名称',
            u'公司全称']
        df = pd.DataFrame(joblist, columns=col)
        dir = "./data/"
        mkdirs_if_not_exists(dir)
        df.to_excel(os.path.join(dir, _ + ".xlsx"), sheet_name=_, index=False)