Publish-o-Matic

Takes data from places, and puts them into a "Data" "Catalogue"

Automatic Installation

See the instructions in the https://github.com/nhsengland/iit-infrastructure/tree/master/ansible README.rst file.

Manual Installation

Set up a virtualenv using your favourite tool for doing so, and activate it.
git clone https://github.com/nhsengland/publish-o-matic.git
python setup.py install # or develop if you insist on it being changeable
See below for setting up cronjobs
To manually run a scraper do run_scraper <NAME> where name is the name of a module in the datasets module.

TODO: Merge steps 2 and 3.

Setting up a cronjob

I don't trust it yet ...

$ crontool > mycrontab 

$ less mycrontab 

** does it look sane? ** 

$ crontab mycrontab

How wrong can it be?

$ crontool | crontab

Structure of this repository:

./datasets

Contains individual STL (Scrape, Transform, Load) procedures for curated datasets.

Each directory is expected to contain a data dir (For cached/retrieved data files) and three files:

scrape.py - scrape the data files and metadata
transform.py - make adjustments / additions to scraped metadata as required
load.py - load the datasets into a CKAN instance.

Name		Name	Last commit message	Last commit date
Latest commit History 158 Commits
datasets		datasets
metadata		metadata
publish		publish
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
WRITING_SCRAPERS.md		WRITING_SCRAPERS.md
config.ini.example		config.ini.example
old_dc.py		old_dc.py
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

datasets

datasets

metadata

metadata

publish

publish

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

WRITING_SCRAPERS.md

WRITING_SCRAPERS.md

config.ini.example

config.ini.example

old_dc.py

old_dc.py

requirements.txt

requirements.txt

setup.py

setup.py

Repository files navigation

Publish-o-Matic

Automatic Installation

Manual Installation

Setting up a cronjob

Structure of this repository:

./datasets

About

Releases

Packages

Contributors 3

Languages

License

nhsengland/publish-o-matic

Folders and files

Latest commit

History

Repository files navigation

Publish-o-Matic

Automatic Installation

Manual Installation

Setting up a cronjob

Structure of this repository:

./datasets

About

Resources

License

Stars

Watchers

Forks

Languages