Python get_pages 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: spack.util.web

메소드/함수: get_pages

hotexamples.com에서의 예제들: 3

Python get_pages - 3개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 spack.util.web.get_pages에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: package.py 프로젝트: dshrader/spack

def find_versions_of_archive(archive_url, **kwargs):
    list_url   = kwargs.get('list_url', None)
    list_depth = kwargs.get('list_depth', 1)

    if not list_url:
        list_url = os.path.dirname(archive_url)

    # This creates a regex from the URL with a capture group for the
    # version part of the URL.  The capture group is converted to a
    # generic wildcard, so we can use this to extract things on a page
    # that look like archive URLs.
    url_regex = url.wildcard_version(archive_url)

    # We'll be a bit more liberal and just look for the archive part,
    # not the full path.
    archive_regex = os.path.basename(url_regex)

    # Grab some web pages to scrape.
    page_map = get_pages(list_url, depth=list_depth)

    # Build a version list from all the matches we find
    versions = VersionList()
    for site, page in page_map.iteritems():
        # extract versions from matches.
        matches = re.finditer(archive_regex, page)
        version_strings = set(m.group(1) for m in matches)
        for v in version_strings:
            versions.add(Version(v))

    return versions

예제 #2

파일 보기

파일: package.py 프로젝트: scrobey/spack

def find_versions_of_archive(archive_url, **kwargs):
    list_url = kwargs.get('list_url', None)
    list_depth = kwargs.get('list_depth', 1)

    if not list_url:
        list_url = os.path.dirname(archive_url)

    # This creates a regex from the URL with a capture group for the
    # version part of the URL.  The capture group is converted to a
    # generic wildcard, so we can use this to extract things on a page
    # that look like archive URLs.
    url_regex = url.wildcard_version(archive_url)

    # We'll be a bit more liberal and just look for the archive part,
    # not the full path.
    archive_regex = os.path.basename(url_regex)

    # Grab some web pages to scrape.
    page_map = get_pages(list_url, depth=list_depth)

    # Build a version list from all the matches we find
    versions = VersionList()
    for site, page in page_map.iteritems():
        # extract versions from matches.
        matches = re.finditer(archive_regex, page)
        version_strings = set(m.group(1) for m in matches)
        for v in version_strings:
            versions.add(Version(v))

    return versions

예제 #3

파일 보기

파일: package.py 프로젝트: jprotze/spack

def find_versions_of_archive(archive_url, **kwargs):
    list_url   = kwargs.get('list_url', None)
    list_depth = kwargs.get('list_depth', 1)
    wildcard   = kwargs.get('wildcard', None)

    if not list_url:
        list_url = os.path.dirname(archive_url)
    if not wildcard:
        wildcard = url.parse_version(archive_url).wildcard()

    versions = VersionList()
    url_regex = os.path.basename(url.wildcard_version(archive_url))

    page_map = get_pages(list_url, depth=list_depth)

    for site, page in page_map.iteritems():
        strings = re.findall(url_regex, page)

        for s in strings:
            match = re.search(wildcard, s)
            if match:
                v = match.group(0)
                versions.add(Version(v))

    return versions