Python cleanUnicode 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: StringCleaner

메소드/함수: cleanUnicode

hotexamples.com에서의 예제들: 3

Python cleanUnicode - 3개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 StringCleaner.cleanUnicode에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: IMVDBDataExtractor.py 프로젝트: exsonic/BillboardPredictor

 def extractStatDataFromScript(self, script):
     lines = script.split('\n')
     dateList = []
     dataList = []
     for line in lines:
         if line.find('categories: [') != -1:
             dateLine = line[line.find('[') + 1 : line.find(']') - 1]
             dateList = [IMVDBDateStringToDate(dateString) for dateString in dateLine.split(',')]
         elif line.find('data: [') != -1:
             dataLine = line[line.find('[') + 1 : line.find(']') - 1]
             dataList = [int(cleanUnicode(dataValue)) for dataValue in dataLine.split(',')]
             break
     rawDataList = zip(dateList, dataList)
     return self.filterDataByWeek(rawDataList)

예제 #2

파일 보기

파일: IMVDBDataExtractor.py 프로젝트: exsonic/BillboardPredictor

 def extractDetailStatData(self, tables, URL):
     detailStatDict = {'week' : dateToSaturday(datetime.today()), 'URL' : URL}
     for table in tables:
         tableText = cleanUnicode(table.text)
         if tableText.find('Views') != -1:
             detailStatDict['MVViewCount'] = self.getDetailStatTableData(tableText, 'Views')
             detailStatDict['MVCommentCount'] = self.getDetailStatTableData(tableText, 'Comments')
         else:
             detailStatDict['FBLikeCount'] = self.getDetailStatTableData(tableText, 'Facebook Like Count')
             detailStatDict['FBShareCount'] = self.getDetailStatTableData(tableText, 'Facebook Share Count')
             detailStatDict['FBCommentCount'] = self.getDetailStatTableData(tableText, 'Facebook Comment Count')
             detailStatDict['TwitterCount'] = self.getDetailStatTableData(tableText, 'Twitter')
             detailStatDict['GooglePlusCount'] = self.getDetailStatTableData(tableText, 'GooglePlusOne')
     return detailStatDict

예제 #3

파일 보기

파일: MusicReviewsExtractor.py 프로젝트: exsonic/BillboardPredictor

 def extractContent(self, textDict):
     try:
         page = urllib2.urlopen(textDict["URL"])
         soup = BeautifulSoup(page.read())
         if textDict["type"] == "article":
             body = soup.find(attrs={"class": "article-body"})
         else:
             body = soup.find(attrs={"class": "entry"})
         text = ""
         for content in body.contents:
             # iterate among body, check if it's tag class, and name is <p>
             if "Tag" in type(content).__name__ and content.name == "p":
                 text += content.text
     except Exception as e:
         # the URL link maybe invalid
         print e
         text = ""
     return cleanUnicode(text)