Python Article 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: news_app.models

클래스/타입: Article

hotexamples.com에서의 예제들: 2

Python Article - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 news_app.models.Article에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

link(2)

source(2)

thumbnail(2)

title(2)

fetched_on(1)

save(1)

view_count(1)

예제 #1

파일 보기

파일: test-rfi.py 프로젝트: ptchankue/acnews


url= 'http://rfi.fr/afrique'
page = urllib2.urlopen(url)
soup = BeautifulSoup(page.read(),  'html.parser')

sections = soup.findAll('section', {'id':'news'})

#anchors = [td.find('a') for td in soup.findAll('li', {'data-bo-type':'article'})]
articles = soup.findAll('li', {'data-bo-type':'article'})
print 'Number of article:', len(articles)
for article in articles:
	if article:
		a = article.find('a')
		if a.get('title'):
			post = Article()
			print a.get('title').encode('utf-8')
			post.title =  a.get('title').encode('utf-8')

			print 'Link:', a['href']
			post.link = 'http://rfi.fr' + a['href']

			print a.get('data-height')
			if a.get('data-image'):
				print 'Image:', a.get('data-image')
				post.thumbnail =  a.get('data-image')

			posts = Article.objects.filter(link = post.link)
			if posts.count()==0:
				post.source = 'RFI Afrique'
				post.view_count = 0

예제 #2

파일 보기

파일: test-cameroon-info.py 프로젝트: ptchankue/acnews

reload(sys)
sys.setdefaultencoding('utf8')

from news_app.models import Article


url= 'http://cameroon-info.net'
page = urllib2.urlopen(url)
soup = BeautifulSoup(page.read(),  'html.parser')
#soup = BeautifulSoup(page.read(), convertEntities=BeautifulSoup.HTML_ENTITIES)
#soup = soup.prettify(formatter="html")
articles = soup.find_all('td', {'width': '475'})
for a in articles[7:]:
	print '-'*60
	if a:
		post = Article()
		link = a.find('a', {'class': 'morehltitle2012'})
		if link :
			print 'Link ', link.get('href')
			post.link = url + link.get('href')

		desc = a.find('div', {'class': 'morehldesc'})
		if desc :
			print 'Title:\n', desc.get_text().encode('utf-8')
			post.title = desc.get_text().encode('utf-8')
			if desc.img:
				print '\n\nImage', url + desc.img.get('src')
				post.thumbnail = url + desc.img.get('src')
		source = a.find('div', {'class': 'morehlsource'})
		if source:
			print source.encode('utf-8')