Tweet-Diabetes-Classification

This project aims to identify diabetes distress patterns based on social media data using artificial intelligence methods. We are working with twitter data.

In the following a small overview over the directories:

Visualisation_US_map: D3 visualisation of tweets occurrence over the USA after our geolocation algorithm
Tweets_extraction_Twitter : Extractions of tweets via Twitter API
WordEmbeddings : Calculating word embeddings (Word2Vec or FastText) via the gensim package
data : Trained models (not up-to-date)
db : Algorithms to filter tweets, remove duplicates (from chatbots) , clean database , ..
files : Only list with keywords to extract tweets
jupyter_notebooks : To experiment
preprocess : Functions to preprocess tweets and textual data in general
readWrite : Read & Write files (parquet, csv, text)
tests : (not up-to-date)
topicModel : Extract topics with LDA method
training : Train classifiers for filtering or predicting
utils : utility functions

More detailed information about the programs and algorithms used, you will find in the corresponding folders.

Check the development branch 'devAA' for current programs.

Prerequisites

Python (version >= 3.5.5)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tweets_extraction_Twitter

Tweets_extraction_Twitter

Visualisation_US_map

Visualisation_US_map

WordEmbeddings

WordEmbeddings

clustering

clustering

db

db

jupyter_notebooks

jupyter_notebooks

models

models

preprocess

preprocess

readWrite

readWrite

tests

tests

topicModel

topicModel

training

training

utils

utils

README.md

README.md

Repository files navigation

Tweet-Diabetes-Classification

Prerequisites

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 202 Commits
Tweets_extraction_Twitter		Tweets_extraction_Twitter
Visualisation_US_map		Visualisation_US_map
WordEmbeddings		WordEmbeddings
clustering		clustering
db		db
jupyter_notebooks		jupyter_notebooks
models		models
preprocess		preprocess
readWrite		readWrite
tests		tests
topicModel		topicModel
training		training
utils		utils
README.md		README.md

WDDS/Tweet-Diabetes-Classification

Folders and files

Latest commit

History

Repository files navigation

Tweet-Diabetes-Classification

Prerequisites

About

Resources

Stars

Watchers

Forks

Languages