CPOSS

Chicago Police Officer Scheduling System

Data sets

These are the data sets that we used in our project.

Chicago crimes https://data.cityofchicago.org/Public-Safety/Crimes-2001-to-present/ijzp-q8t2
Chicago traffic tracker https://data.cityofchicago.org/Transportation/Chicago-Traffic-Tracker-Historical-Congestion-Esti/77hq-huss
Chicago traffic crashes https://data.cityofchicago.org/Transportation/Traffic-Crashes-Crashes/85ca-t3if

Setup

Here are the following things that needs to be downloaded on every VM in the cluster, including the master node and slave nodes. Assuming that the VMs are running Ubuntu 16.04.6 LTS

Hadoop 3.1.3
Spark 3.0.0
Python 3.7.6
Anaconda 4.5.11
Install all the packages from requirements.txt

After running jupyter notebook, it should be quite straight forward when running the notebooks, as long as you have installed Hadoop and Spark correctly.

Each folder has a README.md file that further explains what the contents of that folder does.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
anomdet		anomdet
classifiers		classifiers
demo		demo
forecasters		forecasters
preprocessing		preprocessing
testing		testing
.gitignore		.gitignore
DAT500_report.pdf		DAT500_report.pdf
README.md		README.md
requirements.txt		requirements.txt
spec-file.txt		spec-file.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

anomdet

anomdet

classifiers

classifiers

demo

demo

forecasters

forecasters

preprocessing

preprocessing

testing

testing

.gitignore

.gitignore

DAT500_report.pdf

DAT500_report.pdf

README.md

README.md

requirements.txt

requirements.txt

spec-file.txt

spec-file.txt

Repository files navigation

CPOSS

Data sets

Setup

About

Releases

Packages

Contributors 2

Languages

Eiriksak/CPOSS

Folders and files

Latest commit

History

Repository files navigation

CPOSS

Data sets

Setup

About

Resources

Stars

Watchers

Forks

Languages