Repository for Applied Statistics & Data Science Program in Yerevan State University (ASDS-YSU)
This repository contains multiple (stand-alone) projects.
Clone the Repo and and do the following steps:
python3 -m venv .<project_name>
source .<project_name>/bin/activate
pip install -U pip
pip install -r requirements/<project_name>.txt
In workspace you will see files like <project_name>
.py which is the main.py
for the corresponding project.
Status: In process
Status: In progress
Using PySpark engine, project solves the problem of real time
prediction for posts
from social sites (Facebook, Twitter), to determinie whether
posts are containing offensive language or not.
Project Structure
.
│───data
│ │-tweet_sentiments.csv
│───services
│ │-facebook.py //
│ │-twitter.py // not implemented
│───spark
│ │──core
│ │ │-session.py
│ │ │-context.py
│ │ │-streaming.py
│ │──ml
│ │ │-estimator.py
│ │ │-pipeline.py
│ │ │-transformer.py
└───utils
│-schema.py
make sure you have the spark
properly installed on your machine.
Build
python3 -m venv .social_sentiment
source .social_sentiment/bin/activate
pip install -U pip
pip install -r requirements/social_sentiment.txt
Run
python social_sentiment.py localhost 4455 // start the engine
nc -lk 4455 // write streams here
Click to see the attached video-demo.