Skip to content

birutbhai/stream-classification-spark-kafka-kibana

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 

Repository files navigation

Libraries:kafka, pyspark, time, elasticsearch, json, sys, requests, nltk, pandas, sklearn.
Python Version: Python 2.7.15+
Files: 
1> training.py: Please keep the training data in a file 'texts.txt' in the same directory. 
To Run: python training.py

2> stream_producer.py: Store the output of this file in 'texts.txt' file, which can be used for training.
To Run: python stream_producer.py <API_KEY> <From date> <To Date>
Date should be in YYYY-MM-DD format.

3> stream_consumer.py: Keep the model files generated by stream_producer.py in the same directory.
To Run: python stream_consumer.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages