Skip to content

Simple spider to crawl HXL datasets on the Humanitarian Data Exchange and report stats.

Notifications You must be signed in to change notification settings

HXLStandard/hxl-spider

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Script to crawl HXL datasets on HDX and collect statistics

Prerequisites

  • Python3
  • the ckanapi module
  • the libhxl module
  • an account on a CKAN instance

Instructions

  1. Copy the file config.py.TEMPLATE to config.py and fill in the fields
  2. Execute the command python3 crawl-hxl.py

About

Simple spider to crawl HXL datasets on the Humanitarian Data Exchange and report stats.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages