Python isContainKeyword Exemples

Langage de programmation: Python

Espace de nommage/Pack: base_parser

Méthode/Fonction: isContainKeyword

Exemples au hotexamples.com: 5

Python isContainKeyword - 5 exemples trouvés. Ce sont les exemples réels les mieux notés de base_parser.isContainKeyword extraits de projets open source. Vous pouvez noter les exemples pour nous aider à en améliorer la qualité.

Associées

wsgi_request

compareVersions

get_handler_instance

registerParserHelpers

get_pixels

checkcmd

Parser

registerProgram

dvectors

ADB

Related in langs

CalendarRecord (PHP)

Track (PHP)

ControllUI (C#)

StreamDisposition (C#)

nvram_set_temp (C++)

mpeg_pmt_section_codec (C++)

Info (Go)

NewVBox (Go)

Map (Java)

DomainUtils (Java)

Exemple #1

0

Afficher le fichier

Fichier : myblogParser.py Projet : yangshenhuai/myHackNews

def parse(html,keywords,url_prefix): soup = BeautifulSoup(html,'html.parser'); header_list = soup.find_all('header',attrs={'class':'entry-header'}); results= {} for header in header_list: for child in header.descendants: if child.name=='a' and base_parser.isContainKeyword(keywords,child.text): if not base_parser.isContainKeyword('tags',child['href']): #titles results[child.text] = url_prefix + child['href'] else: #tags results[header.find('a').text] = url_prefix + header.find('a')['href'] return results

Exemple #2

0

Afficher le fichier

Fichier : myblogParser.py Projet : yangshenhuai/myHackNews

def parse(html, keywords, url_prefix): soup = BeautifulSoup(html, 'html.parser') header_list = soup.find_all('header', attrs={'class': 'entry-header'}) results = {} for header in header_list: for child in header.descendants: if child.name == 'a' and base_parser.isContainKeyword( keywords, child.text): if not base_parser.isContainKeyword('tags', child['href']): #titles results[child.text] = url_prefix + child['href'] else: #tags results[header.find( 'a').text] = url_prefix + header.find('a')['href'] return results

Exemple #3

0

Afficher le fichier

Fichier : hnParser.py Projet : yangshenhuai/myHackNews

def parse(html,keywords,url_prefix): soup=BeautifulSoup(html,'html.parser') results= {} title_list = soup.find_all('td',attrs={'class':'title'}) for title in title_list: a = title.find('a'); if a is not None and base_parser.isContainKeyword(keywords,a.text) and a['href'].startswith('http'): results[a.text] = a['href'] return results

Exemple #4

0

Afficher le fichier

Fichier : infoqNewsParser.py Projet : yangshenhuai/myHackNews

def parse(html, keywords, url_prefix): soup = BeautifulSoup(html, "html.parser") news_blocks = soup.find_all("div", class_="news_type_block") results = {} for block in news_blocks: h2_block = block.contents[1] title_block = h2_block.contents[1] if base_parser.isContainKeyword(keywords, title_block.text): results[base_parser.simplify_text(title_block.text)] = base_parser.get_url( url_prefix, title_block["href"], "/news" ) return results

Exemple #5

0

Afficher le fichier

Fichier : infoqNewsParser.py Projet : yangshenhuai/myHackNews

def parse(html, keywords, url_prefix): soup = BeautifulSoup(html, 'html.parser') news_blocks = soup.find_all('div', class_='news_type_block') results = {} for block in news_blocks: h2_block = block.contents[1] title_block = h2_block.contents[1] if base_parser.isContainKeyword(keywords, title_block.text): results[base_parser.simplify_text( title_block.text)] = base_parser.get_url( url_prefix, title_block['href'], '/news') return results