Python urljoin 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: python.url_enhance

메소드/함수: urljoin

hotexamples.com에서의 예제들: 3

Python urljoin - 3개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 python.url_enhance.urljoin에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: bootcss.py 프로젝트: xunull/Example-Scrapy

    def parse(self,response):
        # 这个跟node比 有个缺点是 node 就是异步的，而这里好像不是异步的，
        # 因此第一个主页面  将会在最后才能被保存
        self.savePath(response.url,response.body)
        # 这个路径是执行命令的位置的相对路径
        urls = response.css('a').extract()
        imgs = response.css('img').extract()
        for a in response.css('a'):

            if a.css('::attr(href)').extract() != None:
                item = BootcssItem()
                link=a.css('::attr(href)').extract()[0]
                item['title']=a.css('::attr(title)').extract()
                item['link']=link
                # print(item)
                if link not in self.crawled_urls:
                    self.crawled_urls.append(link)
                    if link.startswith('http'):
                        # 大部分是外部链接
                        pass
                    elif link.startswith('#'):
                        # 页面锚点
                        pass
                    else :
                        if link.endswith('.js'):
                            # js文件
                            pass
                        elif link.endswith('.css'):
                            # css文件
                            pass
                        else:
                            # 正常的页面链接
                            targetUrl=urljoin(response.url,link)
                            yield scrapy.Request(targetUrl, self.parse)

예제 #2

파일 보기

파일: tupian.py 프로젝트: xunull/Example-Scrapy

 def saveImg(self,path,content):
     tempPath=path[len(self.start_urls[0]):]
     if os.path.splitext(tempPath)[1] == '':
         # 链接没有路径，restful 风格
         # 处理方式都转换成一个index.html
         path=urljoin(path,'index.html')
     else:
         pass
     path=path[7:]
     saveWebFile(path,content)

예제 #3

파일 보기

파일: tupian.py 프로젝트: xunull/Example-Scrapy

 def handleA(self,url):
     if url.startswith('http'):
         # 完整路径
         pass
     elif url.startswith('#'):
         # 锚点
         return None
     else:
         # 绝对路径或者是相对路径
         url=urljoin(self.start_urls[0],url)
     return url