Python url_to_absolute Examples

Programming Language: Python

Namespace/Package Name: project.crawler.auxfunctions

Method/Function: url_to_absolute

Examples at hotexamples.com: 2

Python url_to_absolute - 2 examples found. These are the top rated real world Python examples of project.crawler.auxfunctions.url_to_absolute extracted from open source projects. You can rate examples to help us improve the quality of examples.

Example #1

Show file

File: crawlerpravda.py Project: radomirbosak/trendy

 def get_links_from_soup(self, soup, baseurl):
     """Finds links in HTML DOM. Returns list of strings (urls)
     """
     
     base = soup.head.find('base')
     baseurl = base['href'] if base else baseurl
     linky = soup.findAll('div', attrs={'class' : 'article-preview-top'}) + soup.findAll('div', attrs={'class' : 'article-preview'})
     
     linky = [x.find('a') for x in linky if x is not None]
     linky = [url_to_absolute(x['href'], baseurl) for x in linky if x is not None]
     return linky

Example #2

Show file

File: crawlersme.py Project: radomirbosak/trendy

 def get_links_from_soup(self, soup, baseurl):
     """Finds links in HTML DOM. Returns list of strings (urls)
     """
     div = soup.find('div', attrs = {'id': 'contentw'})
     pole = []
     for h3 in div.findAll('h3'):
         a = h3.find('a', attrs={'class' : 'mainHeadline'})
         if a is None:
             a = h3.find('a')
         if a is not None:
             pole.append(url_to_absolute(a['href'], baseurl))
     return pole