- Apache Spark
- HADOOP YARN (for High Availabilty)
for example
$ export PYTHONPATH=$PYTHONPATH:/home/kuanpern/Scflex/Scflex-master
- Ensure that "spark-submit" is on the PATH variable.
- Ensure HADOOP cluster is running properly
Optionally, install the task monitoring server.
- Refer to Scflex/controls/README.md
$ # use interactive session with
$ ./bin/interactive_add_service
- Socket-based logging system
- Dashboard to monitor the job and node status
- run python by module import / thread-based processing