Python downloadResource示例

编程语言: Python

命名空间/包名称: scraptools

方法/功能: downloadResource

hotexamples.com的示例: 3

Python downloadResource - 已找到3个示例。这些是从开源项目中提取的最受好评的scraptools.downloadResource现实Python示例。您可以评价示例，以帮助我们提高示例质量。

示例#1

显示文件

文件： Scrap_Imgur.py 项目： bunstable/WebScraping

def downloadImgur(href, path=''):
    '''Detects the type of url and does the appropriate download'''
    if 'gallery/' in href:
        downloadImgurPage(href, path)
    elif '/r/' in href:
        downloadImgurGallery(href, path)
    elif href[-4] == '.': #possibly a pic ex .jpg, .png
        downloadResource(href, destPath=path)
    else:
        imgBox = getElementsFromUrl(href, 'div.image.textbox > a')
        for e in imgBox:
            src = e.get('href')
            downloadResource(src, destPath=path)

示例#2

显示文件

文件： Scrap_Tumblr.py 项目： bunstable/WebScraping

    for i, postUrl in enumerate(postUrls[:limit], 1):
        print i, '/', limit, postUrl
        
        #Find pictures directly on post
        newSrcs = cleanImgSrcs(getImgSrcs(postUrl))
        print '\tFound :', len(newSrcs)
        imageSrcs += newSrcs
        
        #Find pictures in post iframe
        elems = getElementsFromUrl(postUrl, 'iframe.photoset')
        iframeUrls = [e.get('src') for e in elems]
        for iframeUrl in iframeUrls:
            print '\tiframe:', iframeUrl
            iframeImageSrcs = cleanImgSrcs(getImgSrcs(iframeUrl))
            print '\tFound :', len(iframeImageSrcs)
            imageSrcs += iframeImageSrcs

    return imageSrcs

if __name__ == '__main__':
    print 'Getting image srcs...'
    srcs = getSearchImgs('cat')
    
    print 'Result:'
    print '\n'.join(srcs)
    
    print 'Downloading images...'
    for i, src in enumerate(srcs, 1):
        print i, '/', len(srcs)
        downloadResource(src, destPath='tumblr')

示例#3

显示文件

文件： Scrap_Imgur.py 项目： bunstable/WebScraping

def downloadImgurPage(href, path=''):
    '''Downloads all the images from an imgur page or album'''
    imgSrcs = getImgurImageSrcs(href)
    for src in imgSrcs:
        downloadResource(src, destPath=path)