Python fetch_url示例

编程语言: Python

命名空间/包名称: urlproxy

方法/功能: fetch_url

hotexamples.com的示例: 2

Python fetch_url - 已找到2个示例。这些是从开源项目中提取的最受好评的urlproxy.fetch_url现实Python示例。您可以评价示例，以帮助我们提高示例质量。

示例#1

显示文件

文件： grabimgurl.py 项目： jhgao/blogger-grabber

def save_img_in_url(url, fn='tempimg', maxdepth=1, currentdepth=1 ):
    try:
        if currentdepth > maxdepth:
            raise TooDeepError( maxdepth )

        # get resource
        goturl = urlp.fetch_url(url)

        # check type and save
        if 'image' ==  goturl.info().getmaintype():
            with open(fn,'wb') as f:
                f.write(urlp.fetch_url(url).read())
                f.close()
                dmsg('saved img:'+url)

        elif 'text/html' == goturl.info().gettype():
            imglist = bs(goturl)('img')
            if len(imglist) > 0:
                for img in imglist:
                    save_img_in_url(img['src'], fn, maxdepth, currentdepth+1)
            else:
                dmsg("no img found")
                raise ImgNotFoundError( url, "no img found in the given url" )

        else:
            raise OpenUrlError(url,'unknown type' + goturl.info().gettype())

    except:
        logfn = fn+'.saveerror'
        l = open(logfn,'w')
        l.write(url)
        l.close()
        raise GrabImgError

示例#2

显示文件

文件： grabber.py 项目： jhgao/blogger-grabber

def main(url):
    step = 1
    while url:
        print
        print "step", step
        print "page", url

        soup = bs(urlp.fetch_url(url))
        try:
            prob_save_post(soup)
        except NoTimestampError:
            print "missing timestamp", url

        step = step + 1
        if step > num_limit:
            break

        url = newer_page(soup)
        if url is None:
            break

        sleep(sleep_between_post)