Python tostringの例

プログラミング言語: Python

名前空間/パッケージ名: lxml_linux_64.html

メソッド/関数: tostring

hotexamples.comのコード掲載数: 3

Python tostring - 3件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのlxml_linux_64.html.tostringの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

コード例 #1

ファイルを表示

def htm2txt(inf):
    """ extract the text context"""
    doc=html.document_fromstring(inf)
    content=doc.xpath('//*[@id="contents"]')
    htmls=html.tostring(content[0],False)
    htmls=htmls.replace('<br>','\n')
    htmls=htmls.replace('<p>','\n')
    htmls=unescape(htmls)
    p=re.compile('\n{2,}') #replace more than 2 newlines in a row into one newline
    htmls=p.sub('\n',htmls)
    newdoc=html.document_fromstring(htmls)
    return newdoc.text_content()

コード例 #2

ファイルを表示

ファイル: www.ranwen.net.py プロジェクト: anselmorenato/litebook-project

def htm2txt(inf):
    """ extract the text context"""
    doc=html.document_fromstring(inf)
    content=doc.xpath('//*[@id="bgdiv"]/table[2]/tbody/tr[1]/td/table/tbody/tr')
    htmls=html.tostring(content[0],False)
    htmls=htmls.replace('<br>','\n')
    htmls=htmls.replace('<p>','\n')
    htmls=htmls.replace('&#160;',' ')
    p=re.compile('\n{2,}') #replace more than 2 newlines in a row into one newline
    htmls=p.sub('\n',htmls)
    newdoc=html.document_fromstring(htmls)
    return newdoc.text_content()

コード例 #3

ファイルを表示

def htm2txt(inf):
    """ extract the text context"""
    doc = html.document_fromstring(inf)
    #content=doc.xpath('//*[@id="bgdiv"]/table[2]/tbody/tr[1]/td/table/tbody/tr')
    content = doc.xpath('//*[@id="content"]')
    htmls = html.tostring(content[0], False)
    htmls = htmls.replace('<br>', '\n')
    htmls = htmls.replace('<p>', '\n')
    htmls = htmls.replace('&#160;', ' ')
    p = re.compile(
        '\n{2,}')  #replace more than 2 newlines in a row into one newline
    htmls = p.sub('\n', htmls)
    newdoc = html.document_fromstring(htmls)
    return newdoc.text_content()