Python Soup.phantomjs 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: src.Soup

클래스/타입: Soup

메소드/함수: phantomjs

hotexamples.com에서의 예제들: 8

Python Soup.phantomjs - 8개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 src.Soup.Soup.phantomjs에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

requests(22)

phantomjs(8)

자주 사용되는 메소드들

requests (22)

phantomjs (8)

예제 #1

파일 보기

파일: Hkn24.py 프로젝트: zsef123/News-Cral

 def getPageCount(self):
     searchUrl = "http://www.hkn24.com/news/articleList.html?page=1&sc_section_code=&sc_sub_section_code=&sc_serial_code=&sc_area=A&sc_level=&sc_article_type=&sc_view_level=&sc_sdate=&sc_edate=&sc_serial_number=&sc_word=" + self.query + "&sc_view_code=&view_type="
     soup = Soup.phantomjs(searchUrl)
     pageCount = int(
         re.sub("\D", "",
                soup.find('font', color='#333333').get_text()))
     return math.floor(pageCount / 20)

예제 #2

파일 보기

파일: Khanews.py 프로젝트: zsef123/News-Cral

 def getPageCount(self):
     searchUrl = "http://www.khanews.com/news/articleList.html?page=&sc_section_code=&sc_sub_section_code=&sc_serial_code=&sc_area=A&sc_level=&sc_article_type=&sc_view_level=&sc_sdate=2000.01.01&sc_edate=&sc_serial_number=&sc_word=" + self.query + "&view_type="
     soup = Soup.phantomjs(searchUrl)
     tbodys = soup.find("td", bgcolor="#FFFFFF")
     pageCount = int(
         re.sub("\D", "",
                tbodys.find("font", color="#333333").get_text()))
     return math.floor(pageCount / 20)

예제 #3

파일 보기

파일: Hkn24.py 프로젝트: zsef123/News-Cral

 def getPageHrefs(self, count):
     searchUrl = "http://www.hkn24.com/news/articleList.html?page=" + str(
         count
     ) + "&sc_section_code=&sc_sub_section_code=&sc_serial_code=&sc_area=A&sc_level=&sc_article_type=&sc_view_level=&sc_sdate=&sc_edate=&sc_serial_number=&sc_word=" + self.query + "&sc_view_code=&view_type="
     soup = Soup.phantomjs(searchUrl)
     searchBox = soup.find_all('td', class_='ArtList_Title')
     pageHrefs = [
         "http://www.hkn24.com/news/" + x.a.get('href') for x in searchBox
     ]
     return pageHrefs

예제 #4

파일 보기

 def getPageHrefs(self, count):
     searchUrl = "http://www.doctorsnews.co.kr/news/articleList.html?page=" + str(
         count
     ) + "&sc_section_code=&sc_sub_section_code=&sc_serial_code=&sc_add_section_code=&sc_add_sub_section_code=&sc_add_serial_code=&sc_area=A&sc_level=&sc_m_level=&sc_article_type=&sc_view_level=&sc_sdate=" + self.startDate + "&sc_edate=" + self.endDate + "&sc_serial_number=&sc_word=" + self.query + "&sc_word2=&sc_andor=OR&sc_order_by=I&view_type="
     soupArticle = Soup.phantomjs(searchUrl)
     articleHrefList = soupArticle.find_all("a", class_="news_list_title")
     pageHrefs = [
         "http://www.doctorsnews.co.kr/news/" + x.get('href')
         for x in articleHrefList
     ]
     return pageHrefs

예제 #5

파일 보기

파일: Khanews.py 프로젝트: zsef123/News-Cral

 def getPageHrefs(self, count):
     searchUrl = "http://www.khanews.com/news/articleList.html?page=" + str(
         count
     ) + "&sc_section_code=&sc_sub_section_code=&sc_serial_code=&sc_area=A&sc_level=&sc_article_type=&sc_view_level=&sc_sdate=2000.01.01&sc_edate=&sc_serial_number=&sc_word=" + self.query + "&view_type="
     soup = Soup.phantomjs(searchUrl)
     searchBox = soup.find("td",
                           bgcolor="#FFFFFF").find_all("font",
                                                       color="#001DD0")
     pageHrefs = [
         "http://www.khanews.com/news/" + x.parent.get('href')
         for x in searchBox
     ]
     return pageHrefs

예제 #6

파일 보기

 def getPageCount(self):
     searchUrl = "http://www.doctorsnews.co.kr/news/articleList.html?page=1&sc_section_code=&sc_sub_section_code=&sc_serial_code=&sc_add_section_code=&sc_add_sub_section_code=&sc_add_serial_code=&sc_area=A&sc_level=&sc_m_level=&sc_article_type=&sc_view_level=&sc_sdate=" + self.startDate + "&sc_edate=" + self.endDate + "&sc_serial_number=&sc_word=" + self.query + "&sc_word2=&sc_andor=OR&sc_order_by=I&view_type="
     soup = Soup.phantomjs(searchUrl)
     articleCnt = soup.find("tr", height="35").td.get_text()
     maxArticle = int(re.sub("\D", "", articleCnt))
     return math.floor(int(maxArticle / 25))

예제 #7

파일 보기

파일: Doctorstimes.py 프로젝트: zsef123/News-Cral

 def getPageHrefs(self, count):
     searchUrl = "http://www.doctorstimes.com/news/articleList.html?page="+str(count)+"&sc_section_code=&sc_sub_section_code=&sc_serial_code=&sc_area=A&sc_level=&sc_article_type=&sc_view_level=&sc_sdate=&sc_edate=&sc_serial_number=&sc_word="+self.query+"&sc_word2=&sc_andor=&sc_order_by=E&view_type="
     soup = Soup.phantomjs(searchUrl,'html.parser')
     searchBox = soup.find_all('td',class_='list-titles list-pad-5')
     pageHrefs = ["http://www.doctorstimes.com/news/"+x.a.get('href') for x in searchBox]
     return pageHrefs

예제 #8

파일 보기

파일: Doctorstimes.py 프로젝트: zsef123/News-Cral

 def getPageCount(self):
     searchUrl = "http://www.doctorstimes.com/news/articleList.html?page=1&sc_section_code=&sc_sub_section_code=&sc_serial_code=&sc_area=A&sc_level=&sc_article_type=&sc_view_level=&sc_sdate=&sc_edate=&sc_serial_number=&sc_word="+self.query+"&sc_word2=&sc_andor=&sc_order_by=E&view_type="
     soup = Soup.phantomjs(searchUrl)
     pageCount = int(re.sub("\D","",soup.select("#article-list > tbody > tr > td > table > tbody > tr:nth-of-type(1) > td > table > tbody > tr > td:nth-of-type(1)")[0].get_text()))
     return math.floor(pageCount/20)