Python html_extractorの例

プログラミング言語: Python

名前空間/パッケージ名: html_utilities

メソッド/関数: html_extractor

hotexamples.comのコード掲載数: 7

Python html_extractor - 7件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのhtml_utilities.html_extractorの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

コード例 #1

ファイルを表示

ファイル: test_html_utilities.py プロジェクト: watinha/javascriptproxy

 def test_html_extractor(self):
   """Testing the HTMLExtractor function wether it gets the URL HTML page"""
   test_url = "http://www.google.com"
   self.assertEquals(html_utilities.html_extractor(""), "")
   response = html_utilities.html_extractor(test_url)
   self.assertTrue(len(response) > 3)
   self.assertEquals(response[0:4], "<!do")

コード例 #2

ファイルを表示

ファイル: cross_proxy.py プロジェクト: watinha/javascriptproxy

  def get(self):
    """
    Receives the GET request with a URI parameter
    """
    parameter = self.request.get('url')
    domain = parse_url(parameter)[0]

    # Including the base javascript for replacing the relative URLs
    response = html_extractor (parameter)

    # Adding the decorators functions
    text_decorator = JsReplaceDecorator(domain, CSSReplaceDecorator(domain))

    #script_text = "<script type='text/javascript' src='/javascripts/replacing_urls.js'></script>"
    #response = response[0:response.find("</head>")] + script_text + response[response.find("</head>"):]

    self.response.headers['Content-Type'] = 'text/html; charset=UTF-8'
    self.response.out.write(text_decorator.decorate_text(response))

コード例 #3

ファイルを表示

ファイル: cross_proxy.py プロジェクト: watinha/javascriptproxy

    def get(self):
        """
    Receives the GET request with a URI parameter
    """
        parameter = self.request.get('url')
        domain = parse_url(parameter)[0]

        # Including the base javascript for replacing the relative URLs
        response = html_extractor(parameter)

        # Adding the decorators functions
        text_decorator = JsReplaceDecorator(domain,
                                            CSSReplaceDecorator(domain))

        #script_text = "<script type='text/javascript' src='/javascripts/replacing_urls.js'></script>"
        #response = response[0:response.find("</head>")] + script_text + response[response.find("</head>"):]

        self.response.headers['Content-Type'] = 'text/html; charset=UTF-8'
        self.response.out.write(text_decorator.decorate_text(response))

コード例 #4

ファイルを表示

ファイル: test_html_utilities.py プロジェクト: watinha/javascriptproxy

 def test_encoding_issues(self):
   test_url = "http://www.uol.com"
   response = html_utilities.html_extractor(test_url)
   self.assertTrue(len(response) > 15)

コード例 #5

ファイルを表示

ファイル: test_html_utilities.py プロジェクト: watinha/javascriptproxy

 def test_attribute_exception(self):
   test_url = "http://www.watinha.com"
   self.assertEquals(html_utilities.html_extractor(""), "")
   response = html_utilities.html_extractor(test_url)
   self.assertTrue(len(response) > 3)

コード例 #6

ファイルを表示

 def get_css(self, url):
     return html_extractor(url)

コード例 #7

ファイルを表示

ファイル: css_replace_decorator.py プロジェクト: watinha/javascriptproxy

 def get_css(self, url):
   return html_extractor(url)