Python HtmlParser.feed Examples

Programming Language: Python

Namespace/Package Name: html_parser

Class/Type: HtmlParser

Method/Function: feed

Examples at hotexamples.com: 2

Python HtmlParser.feed - 2 examples found. These are the top rated real world Python examples of html_parser.HtmlParser.feed extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

HtmlParser(30)

city_parser(3)

county_parser(3)

feed(2)

get_assets(2)

close(1)

contextParer(1)

extract_url(1)

get_answer_count(1)

get_article_count(1)

get_ask_question_count(1)

get_brief_info(1)

get_collection_count(1)

get_education(1)

write_to_file(1)

Example #1

Show file

 def parse_html(page_url):
     html_string = ''
     try:
         response = urlopen(page_url, timeout=5)
         if 'text/html' in response.getheader('Content-Type'):
             html_bytes = response.read()
             html_string = html_bytes.decode("utf-8")
         finder = HtmlParser(Spider.base_url, page_url)
         finder.feed(html_string)
     except Exception as e:
         print(str(e))
         return set(), html_string
     return finder.page_links(), html_string

Example #2

Show file

    def parse_html(self):
        """
        parse_html - Parse the html content
        """

        try:
            parser = HtmlParser(self.url)

            parser.set_pattern(self.pattern)
            parser.set_urls(self.spider_config)
            parser.set_next_depth(self.depth)
            parser.feed(self.page)
            parser.close()
        except UnicodeDecodeError as e:
            logging.error('Thread:{} parse {} failed, msg:{}'.format(
                self.thread_id, self.url, e))
            return False

        return True