Python MongoFns 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: mongoFunctions

클래스/타입: MongoFns

hotexamples.com에서의 예제들: 2

Python MongoFns - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 mongoFunctions.MongoFns에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

saveCrawl(1)

예제 #1

파일 보기

파일: crawler.py 프로젝트: rohithb/threeSixtyOne

 def __init__(self, binarySemaphore, url):
     self.binarySemaphore = binarySemaphore
     self.url = url
     self.threadId = hash(self)
     self.mongo = MongoFns()
     self.http = urllib3.PoolManager()
     threading.Thread.__init__(self)

예제 #2

파일 보기

파일: crawler.py 프로젝트: rohithb/threeSixtyOne

class CrawlerThread(threading.Thread):
    def __init__(self, binarySemaphore, url):
        self.binarySemaphore = binarySemaphore
        self.url = url
        self.threadId = hash(self)
        self.mongo = MongoFns()
        self.http = urllib3.PoolManager()
        threading.Thread.__init__(self)

    def run(self):
        try:
            print('Getting %s' % self.url)
            response = self.http.request('GET', self.url)
        except:
            response.status = 400
        if response.status == 200:
            soup = BeautifulSoup(response.data.decode('utf-8'), "html.parser")
            try:
                feedUrl = soup.find('link', rel="alternate", type="application/rss+xml")
                if ('href' in dict(feedUrl)):
                    feedUrl = feedUrl['href']
            except:
                feedUrl = ''
            atags = soup.findAll('a')
            links = []
            for tag in atags:
                if ('href' in dict(tag.attrs)):
                    links.append(tag['href'])
            print('%s : %s' % (self.threadId, len(links)))
            # self.binarySemaphore.acquire()
            print('%s acquired lock' % self.threadId)
            self.mongo.saveCrawl(self.url, feedUrl, response.data, links)
            print('%s written to db' % self.threadId)
            # self.binarySemaphore.release()
            print('%s lock released' % self.threadId)

            for link in links:
                CrawlerThread(self.binarySemaphore, link).start()