This is Proof-of-Concept project, created to show two things:
- How modern technologies can be used to process data
- Rube Goldberg style, unintentionally widely used in software development
Following technologies and frameworks are used: Docker, Java 8 and Java 11, Mongo, MySQL, Python, Spark (PySpark), Pandas and Scikit-learn.
Tested on Ubuntu 18.04 LTS, 20.04 LTS
run.sh should be executed with root privileges
YouTube video: https://youtu.be/aECsmTd0QjY