Python RscHtmlReaderの例

プログラミング言語: Python

名前空間/パッケージ名: chemdataextractor.reader

クラス/型: RscHtmlReader

hotexamples.comのコード掲載数: 5

Python RscHtmlReader - 5件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのchemdataextractor.reader.RscHtmlReaderの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

RscHtmlReader(5)

readstring(3)

detect(1)

よく使われるメソッド

RscHtmlReader (5)

readstring (3)

detect (1)

コード例 #1

ファイルを表示

 def test_detect(self):
     """Test RscHtmlReader can detect an RSC document."""
     r = RscHtmlReader()
     fname = '10.1039_C6OB02074G.html'
     f = io.open(os.path.join(os.path.dirname(__file__), 'data', 'rsc', fname), 'rb')
     content = f.read()
     self.assertEqual(r.detect(content, fname=fname), True)

コード例 #2

ファイルを表示

 def test_direct_usage(self):
     """Test RscHtmlReader used directly to parse file."""
     r = RscHtmlReader()
     fname = '10.1039_C6OB02074G.html'
     f = io.open(os.path.join(os.path.dirname(__file__), 'data', 'rsc', fname), 'rb')
     content = f.read()
     d = r.readstring(content)
     self.assertEqual(len(d.elements), 61)

コード例 #3

ファイルを表示

ファイル: test_reader_rsc.py プロジェクト: edbeard/chemdataextractor

 def test_fig_id_detection(self):
     """ Tests RscHtmlReader can detect the right number of figures and fig captions"""
     r = RscHtmlReader()
     fname = '10.1039_C6OB02074G.html'
     f = io.open(
         os.path.join(os.path.dirname(__file__), 'data', 'rsc', fname),
         'rb')
     content = f.read()
     d = r.readstring(content)
     figs = d.figures
     ids = [fig.id for fig in figs]
     self.assertEqual(len(ids), 4)

コード例 #4

ファイルを表示

ファイル: test_reader_rsc_new.py プロジェクト: edbeard/chemdataextractor

 def test_fig_and_fig_cation_detection(self):
     """ Tests RscHtmlReader can detect the right number of figures and fig captions"""
     r = RscHtmlReader()
     fname = 'B9PP00180H.html'
     f = io.open(
         os.path.join(os.path.dirname(__file__), 'data', 'rsc', fname),
         'rb')
     content = f.read()
     d = r.readstring(content)
     figs = d.figures
     captions = [
         fig.caption for fig in figs if fig.caption.text != ('\n' or '')
     ]
     self.assertEqual(len(figs), 6)
     self.assertEqual(len(captions), 6)
     self.assertEqual(len(captions[1].sentences), 1)

コード例 #5

ファイルを表示

ファイル: test_reader_rsc.py プロジェクト: sunflower6069/ChemDataExtractor

 def test_document_usage(self):
     """Test RscHtmlReader used via Document.from_file."""
     fname = '10.1039_C6OB02074G.html'
     f = io.open(
         os.path.join(os.path.dirname(__file__), 'data', 'rsc', fname),
         'rb')
     d = Document.from_file(f, readers=[RscHtmlReader()])
     self.assertEqual(len(d.elements), 61)