Python PositionContentExtractor.PositionContentExtractorの例

プログラミング言語: Python

名前空間/パッケージ名: position_content_extractor

メソッド/関数: PositionContentExtractor

hotexamples.comのコード掲載数: 2

Python PositionContentExtractor.PositionContentExtractor - 2件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのposition_content_extractor.PositionContentExtractor.PositionContentExtractorの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

PositionContentExtractor(2)

_get_content(1)

コード例 #1

ファイルを表示

ファイル: test_content_extractor.py プロジェクト: EuanCockburn/ifind

 def setUp(self):
     self.logger = logging.getLogger("TestStructuredExtractor")
     html = ' <html> <div id="header"><h1>hello world</h1>' \
            '</div><div id="content"><p>this is important</p>' \
            '<p> study computing it is fun</p></div>' \
            '<div id="footer"> <h2>byes</h2></div> ' \
            '<div id="post"> stay <div id="sub-post">should be gone</div>' \
            '</div><footer class="myfoot">at the bottom</footer></html> '
     div_ids = []
     self.extractor = PositionContentExtractor(div_ids=div_ids)
     self.extractor.process_html_page(html)

コード例 #2

ファイルを表示

ファイル: test_content_extractor.py プロジェクト: EuanCockburn/ifind

 def test_extract_from_bad_page(self):
     self.extractor = PositionContentExtractor(div_ids=self.div_ids)
     self.extractor.process_html_page(self.html)
     #todo pass if no errors?
     div_ids = ['related', 'skiplink-container']
     self.extractor.set_div_ids(div_ids)