Python BeautifulSoup.recursiveChildGeneratorの例

プログラミング言語: Python

名前空間/パッケージ名: util.BeautifulSoup

クラス/型: BeautifulSoup

メソッド/関数: recursiveChildGenerator

hotexamples.comのコード掲載数: 4

Python BeautifulSoup.recursiveChildGenerator - 4件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのutil.BeautifulSoup.BeautifulSoup.recursiveChildGeneratorの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

BeautifulSoup(5)

find(2)

recursiveChildGenerator(2)

findAll(1)

renderContents(1)

コード例 #1

ファイルを表示

ファイル: strings.py プロジェクト: AlexUlrich/digsby

def strip_html_and_tags(s, invalid_tags):
    '''
    content between "invalid_tags" is removed
    '''
    if not s: return s

    from util.BeautifulSoup import BeautifulSoup
    soup = BeautifulSoup(s.replace('<br>','\n').replace('<br/>','\n').replace('<br />', '\n'))
    for tag in invalid_tags:
        for result in soup.findAll(name=tag):
            result.replaceWith("")

    return ''.join(e for e in soup.recursiveChildGenerator()
                   if isinstance(e,unicode))

コード例 #2

ファイルを表示

def strip_html_and_tags(s, invalid_tags):
    '''
    content between "invalid_tags" is removed
    '''
    if not s: return s

    from util.BeautifulSoup import BeautifulSoup
    soup = BeautifulSoup(
        s.replace('<br>', '\n').replace('<br/>', '\n').replace('<br />', '\n'))
    for tag in invalid_tags:
        for result in soup.findAll(name=tag):
            result.replaceWith("")

    return ''.join(e for e in soup.recursiveChildGenerator()
                   if isinstance(e, unicode))

コード例 #3

ファイルを表示

def strip_html2(s):
    '''
    Strips out HTML with the BeautifulSoup library.

    >>> strip_html2('<html><body><b>Some <i>ugly</i></b> html.</body></html>')
    u'Some ugly html.'
    '''
    if not s: return s

    from util.BeautifulSoup import BeautifulSoup
    soup = BeautifulSoup(s)

    text_pieces = []
    for pc in soup.recursiveChildGenerator():
        if isinstance(pc, unicode):
            text_pieces.append(pc)
        elif pc.name == 'br':
            text_pieces.append('\n')

    return ''.join(text_pieces)

コード例 #4

ファイルを表示

ファイル: strings.py プロジェクト: AlexUlrich/digsby

def strip_html2(s):
    '''
    Strips out HTML with the BeautifulSoup library.

    >>> strip_html2('<html><body><b>Some <i>ugly</i></b> html.</body></html>')
    u'Some ugly html.'
    '''
    if not s: return s

    from util.BeautifulSoup import BeautifulSoup
    soup = BeautifulSoup(s)

    text_pieces = []
    for pc in soup.recursiveChildGenerator():
        if isinstance(pc, unicode):
            text_pieces.append(pc)
        elif pc.name == 'br':
            text_pieces.append('\n')

    return ''.join(text_pieces)