Python table_to_list 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: scrape_tools

메소드/함수: table_to_list

hotexamples.com에서의 예제들: 3

Python table_to_list - 3개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 scrape_tools.table_to_list에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

def content_scraper(table):

    docs = table_to_list(table)
    for i,doc in enumerate(docs):
        if 'content' in doc.keys():
            continue
        thread = threading.Thread(name=i, target=content_adder_thread, args=(table, doc, i))
        thread.start()
        time.sleep(np.random.random()/3+0.3)

예제 #2

파일 보기

파일: nyt_scraper.py 프로젝트: zachary-britt/text2slant

    def __init__(self):
        self.i = 0
        self.table = st.open_database_collection('nyt')

        docs = st.table_to_list(self.table)
        self.seen_urls = {doc['web_url'] for doc in docs}

        self.link = 'http://api.nytimes.com/svc/search/v2/articlesearch.json'
        NYT_API_KEY = os.environ['NYT_API_KEY']
        self.payload = {'api-key': NYT_API_KEY}
        self._set_filters()

예제 #3

파일 보기

파일: database_cleaning.py 프로젝트: zachary-britt/text2slant

def remove_dups(table):
    #ipdb.set_trace()
    docs = st.table_to_list(table)

    # urls = [ doc['link'] for doc in docs]
    # _ids = [ doc['_id'] for doc in docs]

    if 'web_url' in docs[0].keys():
        for i, _ in enumerate(docs):
            docs[i]['link'] = docs[i]['web_url']

    pairs = [(doc['link'], doc['_id']) for doc in docs]
    pair_dict = dict(pairs)
    id_keepers = set(pair_dict.values())
    id_all = {doc['_id'] for doc in docs}

    kill_ids = id_all.difference(id_keepers)

    for _id in kill_ids:
        table.delete_one(filter={'_id': _id})