Python srvs_connect 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: lib.discovery

메소드/함수: srvs_connect

hotexamples.com에서의 예제들: 6

Python srvs_connect - 6개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 lib.discovery.srvs_connect에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

0

파일 보기

파일: scraper.py 프로젝트: rranshous/scraper

    def site_spider(self, root_url):
        """
        spider every page of the site we can find, report back
        with links found and their details
        """

        response = o.SpiderResponse(url=root_url)
        response.pages = []

        # starting @ the root spider all the sites we can find w/in
        # the domain
        links = self.link_spider(root_url, 1000, True)

        # all that data is nice and cached so we can reprocess it
        for link in links + [root_url]:
            page = o.Page(url=link)
            page.links = self.get_links(link)
            page.images = self.get_images(link)
            try:
                with srvs_connect(Requester) as c:
                    r = c.urlopen(ro.Request(link))
                page.response = r
            except o.Exception, ex:
                # problem w/ response = no response
                print "o.request exception: %s %s" % (link, ex.msg)
            except Exception, ex:
                print "request exception: %s %s" % (link, ex)

예제 #2

0

파일 보기

파일: scraper.py 프로젝트: rranshous/scraper

    def site_spider(self, root_url):
        """
        spider every page of the site we can find, report back
        with links found and their details
        """

        response = o.SpiderResponse(url=root_url)
        response.pages = []

        # starting @ the root spider all the sites we can find w/in
        # the domain
        links = self.link_spider(root_url, 1000, True)

        # all that data is nice and cached so we can reprocess it
        for link in links + [root_url]:
            page = o.Page(url=link)
            page.links = self.get_links(link)
            page.images = self.get_images(link)
            try:
                with srvs_connect(Requester) as c:
                    r = c.urlopen(ro.Request(link))
                page.response = r
            except o.Exception, ex:
                # problem w/ response = no response
                print 'o.request exception: %s %s' % (link, ex.msg)
            except Exception, ex:
                print 'request exception: %s %s' % (link, ex)

예제 #3

0

파일 보기

파일: scraper.py 프로젝트: rranshous/scraper

    def get_links(self, url):
        """ returns back the href for all links on page """

        url = url.strip()
        print "get_links: %s" % url

        # if it's an image forget it
        if url.lower().endswith(self.not_html_ext):
            return []

        # request the url
        try:
            with srvs_connect(Requester) as c:
                r = c.urlopen(ro.Request(url))
            if not r:
                return []
        except o.Exception, ex:
            raise o.Exception("o.Could not make request: %s %s" % (url, ex))

예제 #4

0

파일 보기

파일: scraper.py 프로젝트: rranshous/scraper

    def get_links(self, url):
        """ returns back the href for all links on page """

        url = url.strip()
        print 'get_links: %s' % url

        # if it's an image forget it
        if url.lower().endswith(self.not_html_ext):
            return []

        # request the url
        try:
            with srvs_connect(Requester) as c:
                r = c.urlopen(ro.Request(url))
            if not r:
                return []
        except o.Exception, ex:
            raise o.Exception('o.Could not make request: %s %s' % (url, ex))

예제 #5

0

파일 보기

파일: scraper.py 프로젝트: rranshous/scraper

    def get_images(self, url):
        """ returns back the src for all images on page """

        url = url.strip()
        print "get_images: %s" % url

        # only care to parse html pages
        if url.lower().endswith(self.not_html_ext):
            return []

        # request the url
        try:
            print "get image making request: %s" % url
            with srvs_connect(Requester) as c:
                r = c.urlopen(ro.Request(url))
            if not r:
                print "get image no response: %s" % url
                return []
        except o.Exception, ex:
            print "ex"
            raise o.Exception("o.Could not make request: %s %s" % (url, ex))

예제 #6

0

파일 보기

파일: scraper.py 프로젝트: rranshous/scraper

    def get_images(self, url):
        """ returns back the src for all images on page """

        url = url.strip()
        print 'get_images: %s' % url

        # only care to parse html pages
        if url.lower().endswith(self.not_html_ext):
            return []

        # request the url
        try:
            print 'get image making request: %s' % url
            with srvs_connect(Requester) as c:
                r = c.urlopen(ro.Request(url))
            if not r:
                print 'get image no response: %s' % url
                return []
        except o.Exception, ex:
            print 'ex'
            raise o.Exception('o.Could not make request: %s %s' % (url, ex))