Broken-Promises

In 2003, an article in the New Scientist stated that bananas could be extinct in 2013. Guess what. This was yet another broken promise.

It can be hard to keep track of all promises and predictions. This tool makes it easier for journalists and citizens to keep an eye on what was promised for a certain date (a day, a month or a year). Articles from the Guardian and the New-York Times are searched for dates and then filtered to remove inadequate results (movie releases, for instance).

Note

This is only the backend system wich will retrieve articles. To show them, we have built broken-promises-client.

Installation

a. Requirements

sudo apt-get install build-essential git-core python python-pip python-dev redis-server
sudo pip install virtualenv

And MongoDB.

b. Download the project

git clone git@github.com:jplusplus/broken-promises.git
cd broken-promises

c. Install

Will download and install all the dependances in a venv directory. (Virtualenv required)

make install

Run the Application (API + Control Tower)

Launch Mongodb, the workers and the web application (API + Control Tower)

make run

Then visit http://127.0.0.1:5000

password : brokenpromises

CLI

In the Scripts/ directory.

Collect articles for a date

Usage: 
./collect_articles.py [options] year 
./collect_articles.py [options] year month
./collect_articles.py [options] year month day

Options:
  -h, --help            show this help message and exit
  -f CHANNELS_FILE, --channelslistfile=CHANNELS_FILE
                        Use this that as channels list to use
  -c CHANNELS_LIST, --channels=CHANNELS_LIST
                        channels list comma separated
  -s, --storage         Save the result with the default storage
  -d, --drop            drop the previous articles from database before
  --force               Force the scrap. If --storage is enable, the scrap
                        could be escape b/c of a previous similar scrap
  -o OUTPUT_FILE, --output=OUTPUT_FILE
                        Specify  a file to write the export to. If you do not
                        specify a file name, the program writes data to
                        standard output (e.g. stdout)

Scrap a given page using the appropriate channel

usage: scrap_article.py [-h] [--with-filters] [--dates] url

positional arguments:
  url             url to scrap

optional arguments:
  -h, --help      show this help message and exit
  --with-filters  Apply filters to remove unwanted dates
  --dates         Return the date found in the article

Run tests

$ make test

Name		Name	Last commit message	Last commit date
Latest commit History 256 Commits
Scripts		Scripts
Sources		Sources
Tests		Tests
Webapp		Webapp
.env		.env
.gitignore		.gitignore
.travis.yml		.travis.yml
COPYING		COPYING
Makefile		Makefile
Procfile		Procfile
README.md		README.md
npm_requirements.txt		npm_requirements.txt
requirements.txt		requirements.txt
settings.py		settings.py

License

joshuarrrr/broken-promises

Folders and files

Latest commit

History

Repository files navigation

Broken-Promises

Note

Installation

Run the Application (API + Control Tower)

CLI

Collect articles for a date

Scrap a given page using the appropriate channel

Run tests

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Languages