Python Extract.get_urls_for_all_subreddits 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: extract

클래스/타입: Extract

메소드/함수: get_urls_for_all_subreddits

hotexamples.com에서의 예제들: 2

Python Extract.get_urls_for_all_subreddits - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 extract.Extract.get_urls_for_all_subreddits에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

Extract(30)

extract(6)

get_csv_data(3)

get_urls_for_all_subreddits(2)

get_ADR_RAdec(2)

get_api_data(2)

get_data_from_bucket(2)

extract_face_to_list(2)

load_data(2)

process(2)

add_extract(2)

getPackage(2)

imageToText(1)

printLabeledTraces(1)

getRecordNode(1)

getRefNames(1)

getRename(1)

getRepClause(1)

getType(1)

getUnitComment(1)

insert_data(1)

industry(1)

get_attachment(1)

get_basic_info(1)

moffat_psf(1)

ifuslot_i(1)

get_parsed_diff(1)

load_a_min(1)

get_rows(1)

get_soup(1)

get_spectrum_by_coord_index(1)

getRecordComponent(1)

get_url_to_download(1)

mask(1)

market(1)

getoldworkouts(1)

get_pr_diff(1)

getImport(1)

getRecord(1)

extract_main(1)

any(1)

authorize(1)

clipboardToImage(1)

clone_repo(1)

convert_csv(1)

coords(1)

data(1)

delete_data(1)

display_extract(1)

done_market(1)

예제 #1

파일 보기

    def get_subreddits_links_to_build_task(self):
        base_ = Base()
        extract_ = Extract()
        list_subreddits_data = base_.get_data_list_subreddits()
        downloaded_subs = base_.check_resume_file(file_path=self.resume_file)
        urls = extract_.get_urls_for_all_subreddits(subreddits=list_subreddits_data, \
            start_date=self.st_dt, end_date=self.end_dt)
        if len(downloaded_subs) > 0:
            urls = list(set(urls)- set(downloaded_subs))
            print("Already Dowloaded {} sub-reddits yet to download {} sub-reddits".format(len(downloaded_subs), len(urls)))
            print("Completed {}%".format(len(downloaded_subs)/len(urls)))

        return urls

예제 #2

파일 보기

파일: run_links.py 프로젝트: instigateideas/Instigate_Ideas

 def run_extraction(self):
     extract_ = Extract()
     base_ = Base()
     list_subreddits_data = base_.get_data_list_subreddits()
     downloaded_subs = base_.check_resume_file(file_path=self.resume_file)
     start_time = time.time()
     cost = 0
     urls = extract_.get_urls_for_all_subreddits(subreddits=list_subreddits_data, \
         start_date=self.st_dt, end_date=self.end_dt)
     if len(downloaded_subs) > 0:
         urls_ = list(set(urls) - set(downloaded_subs))
         print(
             "Already Dowloaded {} sub-reddits yet to download {} sub-reddits"
             .format(len(downloaded_subs), len(urls_)))
         print("Completed {}%".format(len(downloaded_subs) / len(urls_)))
         extract_.url_based_extraction(links=urls_, base_path=self.sav_path)
     else:
         extract_.url_based_extraction(links=urls, base_path=self.sav_path)