Python Article.url Exemples

Langage de programmation: Python

Espace de nommage/Pack: newspaper

Class/Type: Article

Méthode/Fonction: url

Exemples au hotexamples.com: 1

Python Article.url - 1 exemples trouvés. Ce sont les exemples réels les mieux notés de newspaper.Article.url extraits de projets open source. Vous pouvez noter les exemples pour nous aider à en améliorer la qualité.

Méthodes fréquemment utilisées

Afficher Cacher

Article(30)

nlp(30)

set_html(30)

parse(30)

download(30)

build(20)

html(15)

text(11)

download_state(9)

fetch_images(6)

is_valid_url(6)

publish_date(5)

authors(5)

is_downloaded(5)

title(4)

top_image(3)

article_html(3)

keywords(3)

set_text(2)

images(2)

has_top_image(2)

is_valid_body(2)

summary(1)

summarylen(1)

split(1)

set_top_img_no_check(1)

tag(1)

set_title(1)

tags(1)

textlen(1)

set_meta_data(1)

lower(1)

set_keywords(1)

save(1)

prepareSentenceHighlights(1)

nlpEntropy(1)

meta_data(1)

append(1)

is_video(1)

is_parsed(1)

is_media_news(1)

has_video(1)

get_is_news(1)

format_top_node(1)

category_urls(1)

articles(1)

url(1)

Méthodes fréquemment utilisées

Article (30)

nlp (30)

set_html (30)

parse (30)

download (30)

build (20)

html (15)

text (11)

download_state (9)

fetch_images (6)

Méthodes fréquemment utilisées

is_valid_url (6)

publish_date (5)

authors (5)

is_downloaded (5)

title (4)

top_image (3)

article_html (3)

keywords (3)

set_text (2)

images (2)

has_top_image (2)

is_valid_body (2)

summary (1)

summarylen (1)

split (1)

set_top_img_no_check (1)

tag (1)

set_title (1)

tags (1)

textlen (1)

Méthodes fréquemment utilisées

has_top_image (2)

is_valid_body (2)

summary (1)

summarylen (1)

split (1)

set_top_img_no_check (1)

tag (1)

set_title (1)

tags (1)

textlen (1)

set_meta_data (1)

lower (1)

set_keywords (1)

save (1)

prepareSentenceHighlights (1)

nlpEntropy (1)

meta_data (1)

append (1)

is_video (1)

is_parsed (1)

is_media_news (1)

has_video (1)

get_is_news (1)

format_top_node (1)

category_urls (1)

articles (1)

url (1)

Méthodes fréquemment utilisées

set_meta_data (1)

lower (1)

set_keywords (1)

save (1)

prepareSentenceHighlights (1)

nlpEntropy (1)

meta_data (1)

append (1)

is_video (1)

is_parsed (1)

is_media_news (1)

has_video (1)

get_is_news (1)

format_top_node (1)

category_urls (1)

articles (1)

url (1)

Exemple #1

0

Afficher le fichier

Fichier : gcp_stock_news_scraper.py Projet : colin4554/stocks

def article_info(self, url): """Extracts article data using newspaper Args: url: url of article Returns: Dictionary of article data """ try: try: article = Article( url, browser_user_agent=self.HEADERS['User-Agent']) # fixes issue with bloomberg if 'bloomberg.com' in article.url: article.url = article.url.replace( 'www.bloomberg.com', 'www.bloombergquint.com') article.download() article.parse() except Exception as e: logging.info(e) # sometimes we need to use googlebot if an error occurs article = Article( url, browser_user_agent= 'Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)' ) article.download() article.parse() logging.info(article.url[8:40] + "... article scraped with googlebot") finally: article.nlp() # prevents Yahoo finance articles from being scraped incorrectly if 'finance.yahoo.com' in article.url: return self.yahoo_get_text(article) else: keywords = ", ".join([item for item in article.keywords]) return { 'title': article.title, 'keywords': keywords, 'summary': article.summary, 'full_text': article.text, 'meta_descr': article.meta_description } # prints and logs error except Exception as e: logging.info(article.url + "... article skipped due to error: " + str(e)) return {'error': 'article skipped'}