Python PyQuery.itemsの例

プログラミング言語: Python

名前空間/パッケージ名: pyquery.pyquery

クラス/型: PyQuery

メソッド/関数: items

hotexamples.comのコード掲載数: 1

Python PyQuery.items - 1件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのpyquery.pyquery.PyQuery.itemsの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

PyQuery(30)

attr(21)

children(17)

text(12)

find(3)

next(2)

replace(2)

strip(2)

eq(1)

has_class(1)

is_(1)

items(1)

startswith(1)

val(1)

xhtml_to_html(1)

コード例 #1

ファイルを表示

def get_links(htmlpath, exclude=None):
    ''' Get links from an html file.

        Not well tested. See reinhardt.feeds for examples of more reliable parsing.

        Returns a list. Each item is a list of [PATH, URL, SUMMARY].

        'htmlpath' is path of html file.

        'exclude' is string in href to exclude, without top level domain.
        Example: To exclude links to google, use "exclude='google'".

        Very ad hoc.
    '''

    # fallable importdelayed until needed
    try:
        from pyquery.pyquery import PyQuery

    except ModuleNotFoundError:
        raise Exception('pyquery not installed')

    else:

        results = []

        with open(htmlpath) as infile:

            html = PyQuery(to_bytes(infile.read()))
            anchor_tags = html.items('a')
            # log.debug(f'{len(list(anchor_tags))} links: {htmlpath}') # DEBUG
            for item in anchor_tags:
                href = item.attr('href')
                if href and href.startswith('http'):
                    if exclude and (exclude not in href):
                        results.append([htmlpath, href, item.text().strip()])
                        # log.debug(f'\t{href}') # DEBUG

        return results