Python Creep_Tools._Deduplication Examples

Programming Language: Python

Class/Type: Creep_Tools

Method/Function: _Deduplication

Examples at hotexamples.com: 1

Python Creep_Tools._Deduplication - 1 examples found. These are the top rated real world Python examples of Creep_Tools._Deduplication extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

_Analyze_Soup(2)

_Deduplication(1)

_Download_Picture(1)

_mkdir(1)

Example #1

Show file

File: commodity_list.py Project: Bearfu/JD

def parser_for_one_url(soup):
    url_list = []
    try:
        lists = soup.find_all("ul", {"class": "gl-warp clearfix"})
        for item in lists:
            hrefs = item.find_all()
            for herf in hrefs:
                names = herf.find_all("div", {"class": "p-name"})
                for name in names:
                    url = name.a["href"]
                    if url is not None:
                        try:
                            url_list.append(url)
                        except:
                            pass
                    else:
                        print("soup为空")
    except:
        pass
    return url_list


if __name__ == "__main__":
    with open("JD_commodity_urls.txt", mode="w", encoding="utf-8") as file:
        _catch_Index_Url()
    Creep_Tools._Deduplication("JD_commodity_urls.log")
    # 测试用URL
    # soup =_Analyze_Soup("http://list.jd.hk/list.html?cat=1319,1525,7057&go=0&gjz=0")
    # parser_for_one_url(soup)
    print("运行终了")