Python get_html示例

编程语言: Python

命名空间/包名称: html2md

方法/功能: get_html

hotexamples.com的示例: 3

Python get_html - 已找到3个示例。这些是从开源项目中提取的最受好评的html2md.get_html现实Python示例。您可以评价示例，以帮助我们提高示例质量。

示例#1

显示文件

文件： download_image.py 项目： bingjin/codingpy_archives

def get_urls(url, selector):
    resp = get_html(url, selector)
    html = fromstring(resp)

    if 'weixin.qq.com' in url:
        image_urls = html.xpath('//img/@data-src')
    else:
        image_urls = html.xpath('//img/@src')
    image_urls = [normalize_image_url(url, image_url) for image_url in image_urls]
    return image_urls

示例#2

显示文件

文件： download_image.py 项目： yangjiandong/articles

def get_urls(url, selector):
    resp = get_html(url, selector)
    html = fromstring(resp)

    if 'weixin.qq.com' in url:
        image_urls = html.xpath('//img/@data-src')
    else:
        image_urls = html.xpath('//img/@src')
    image_urls = [
        normalize_image_url(url, image_url) for image_url in image_urls
    ]
    return image_urls

示例#3

显示文件

from upload2cos import upload_image
from watermark import watermark_text, watermark_overlay

# add argument

parser = argparse.ArgumentParser()
parser.add_argument('url', help='target url page')
parser.add_argument('selector', help='target selector')

args = parser.parse_args()
url = args.url
selector = args.selector

# get html, convert it to md

html = get_html(url, selector)
md = html2md(html)
save_md(md)

# find images on the url page

image_urls = get_urls(url, selector)

# upload image to COS, replace with COS access url

for image_url in image_urls:
    print(image_url)
    image_path = download_image(image_url)
    print(image_path)
    new_url = upload_image(image_path)
    print(new_url)