GitHub

#Dataset files can ben found at http://www.sfu.ca/~alopes/732_project/dataset.zip #Report and results at http://www.sfu.ca/~vpalenge/index.html

#Instructions to run Daemon cd tweet_stream
nohup python3 tweet_stream2.py >out.txt &
The command is running and saving the twitters inside the folder 'airlines_tweets'!

#Copy the files to be processed hdfs dfs -copyFromLocal airlines_tweets
back to the main directory: cd ..
copying the training file: hdfs dfs -copyFromLocal airline.csv

#Executing spark-submit --master=yarn-client --executor-memory 6G --num-executors 15 --packages TargetHolding/pyspark-cassandra:0.3.5,com.databricks:spark-csv_2.11:1.5.0 --py-files svm.zip process_twitter.py airlines_tweets airline.csv vap

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
RUNNING.md		RUNNING.md
airline_train_small.txt		airline_train_small.txt
inspection.txt		inspection.txt
out.txt		out.txt
process_twitter.py		process_twitter.py
read.py		read.py
read_v2.py		read_v2.py
svm.zip		svm.zip
train_svm.py		train_svm.py
train_svm.pyc		train_svm.pyc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

RUNNING.md

RUNNING.md

airline_train_small.txt

airline_train_small.txt

inspection.txt

inspection.txt

out.txt

out.txt

process_twitter.py

process_twitter.py

read.py

read.py

read_v2.py

read_v2.py

svm.zip

svm.zip

train_svm.py

train_svm.py

train_svm.pyc

train_svm.pyc

Repository files navigation

About

Releases

Packages

Languages

pfattahi/AIRLINE_RANKINGS

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Languages