Python extract_urls 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: contentparsing

메소드/함수: extract_urls

hotexamples.com에서의 예제들: 6

Python extract_urls - 6개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 contentparsing.extract_urls에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

 def discover(self, start_url: str, limit: int) -> List[str]:
     """
     Fetch the url provided and retrieve links, subsequently fetching
     the pages at those links until reaching limit (or running out of links).
     :param start_url: url to start from
     :param limit: number of urls to return in list
     :return: list of urls discovered
     """
     urls = [start_url]
     seen = {start_url: True}
     count = 1
     while len(urls) > 0 and count < limit:
         url = urls.pop()
         contents = self.content_fetcher.retrieve_page(url)
         new_urls = filter(lambda x: x not in seen, extract_urls(url, contents))
         for new_url in new_urls:
             if count == limit:
                 break
             urls.append(new_url)
             seen[new_url] = True
             count += 1
     return list(seen.keys())

예제 #2

파일 보기