Python print_files Exemples

Langage de programmation: Python

Espace de nommage/Pack: scraper_commands

Méthode/Fonction: print_files

Exemples au hotexamples.com: 3

Python print_files - 3 exemples trouvés. Ce sont les exemples réels les mieux notés de scraper_commands.print_files extraits de projets open source. Vous pouvez noter les exemples pour nous aider à en améliorer la qualité.

Associées

get_implied_timescales

has_app

parse_log

EntryListTableView

lm

Followup

init

get_client_version

Word2Vec

assert_test_dir_v2_not_loaded

Related in langs

PHP_PMD_Rule_Naming_ShortVariable (PHP)

GetGroupRequest (PHP)

Fox.FoxKeyboardLayout (C#)

HeartbeatServer (C#)

print_uname (C++)

Eig (C++)

Iter (Go)

Info (Go)

DocumentFilter (Java)

Exception (Java)

Exemple #1

0

Afficher le fichier

Fichier : scrape_nh.py Projet : OpenData-NC/data-dashboard

def main(): #fetch data from our google spreadsheet that tells us what to scrape home_dir, data_dir, database, db_user, db_pw, commands_url = make_config( '_nh') site = { 'URL': 'http://p2c.nhcgov.com/p2c/Summary.aspx', 'Agency': "New Hanover County Sheriff's Office", 'County': 'New Hanover', 'How far back': '7' } #variables we'll use in our scraping and data format county = site['County'] url = site['URL'] agency = site['Agency'] #this is how many days back we want to scrape #e.g. 1 would scrape a total of 2 days: # today plus 1 day back (yesterday) howfar = int(site['How far back']) #try for daily bulletin bulletin_url = try_bulletin(url) start_scrape(agency, county, bulletin_url, howfar) #output data as tab-delimited text files named for the #record type (arrest.txt, incident.txt, citation.txt, accident.txt) print_files(scraper_commands.all_data, data_dir) for data_type in all_data: data_file = data_dir + '/' + data_type + '.txt' table = data_type.lower() + 's' load(database, data_file, table, db_user, db_pw)

Exemple #2

0

Afficher le fichier

Fichier : scrape_nh.py Projet : OpenData-NC/data-dashboard

def main(): #fetch data from our google spreadsheet that tells us what to scrape home_dir, data_dir, database, db_user, db_pw, commands_url = make_config('_nh') site = {'URL': 'http://p2c.nhcgov.com/p2c/Summary.aspx','Agency':"New Hanover County Sheriff's Office",'County': 'New Hanover','How far back':'7'} #variables we'll use in our scraping and data format county = site['County'] url = site['URL'] agency = site['Agency'] #this is how many days back we want to scrape #e.g. 1 would scrape a total of 2 days: # today plus 1 day back (yesterday) howfar = int(site['How far back']) #try for daily bulletin bulletin_url = try_bulletin(url) start_scrape(agency, county, bulletin_url, howfar) #output data as tab-delimited text files named for the #record type (arrest.txt, incident.txt, citation.txt, accident.txt) print_files(scraper_commands.all_data,data_dir) for data_type in all_data: data_file = data_dir + '/' + data_type + '.txt' table = data_type.lower() + 's' load(database,data_file, table, db_user, db_pw)

Exemple #3

0

Afficher le fichier

def main(): home_dir, data_dir, database, db_user, db_pw, commands_url = make_config() sites_to_scrape = fetch_commands(commands_url) #pick out site we want as index from list of sites #passed as an argument to this script site = sites_to_scrape[int(sys.argv[1])] #variables we'll use in our scraping and data format county = site['County'] url = site['URL'] agency = site['Agency'] #this is how many days back we want to scrape #e.g. 1 would scrape a total of 2 days: # today plus 1 day back (yesterday) howfar = int(site['How far back']) #try for daily bulletin #if not, then go for search bulletin_url = scrape_bulletin.try_bulletin(url) if bulletin_url: if bulletin_url == 'unreachable': print "\t".join([url,bulletin_url]) else: data = scrape_bulletin.start_scrape(agency, county, bulletin_url, howfar) else: #we'll need to import the functionality to scrape a search site import scrape_search data = scrape_search.start_scrape(agency, url, howfar, county) if not data: print "\t".join([url,"failed"]) #output data as tab-delimited text files named for the #record type (arrest.txt, incident.txt, citation.txt, accident.txt) print_files(all_data,data_dir, site['Site']) exit() for data_type in all_data: data_file = data_dir + '/' + site['Site'] + data_type + '.txt' table = data_type.lower() + 's' db_load(database,data_file, table, db_user, db_pw)