Skip to content

mirandrom/CrisisTweetMap

Repository files navigation

CrisisTweetMap

EMNLP 2020 Covid-19 Workshop Paper: Real-time Classification, Geolocation and Interactive Visualization of COVID-19 Information Shared on Social Media to Better Understand Global Developments (OpenReview).

Winner of McHacks 7 Hackathon: Using Natural Language Processing to categorize and map tweets in real-time during the covid-19 crisis.

Crisis Tweet Map

Tweets are classified among the following categories:

affected_people
other_useful_information
disease_transmission
disease_signs_or_symptoms
prevention
treatment
not_related_or_irrelevant
deaths_reports

To run the app

  1. Download Trained AllenNLP Model to /tweet_classifier/experiments/l2_balanced/

  2. Setup up tweepy_auth.json with Twitter API keys

  3. Download and run ElasticSearch geonames gazetteer container

    docker pull elasticsearch:5.5.2
    wget https://s3.amazonaws.com/ahalterman-geo/geonames_index.tar.gz --output-file=wget_log.txt
    tar -xzf geonames_index.tar.gz
    docker run -d -p 127.0.0.1:9200:9200 -v $(pwd)/geonames_index/:/usr/share/elasticsearch/data elasticsearch:5.5.2
    
  4. Install requirements
    pip install -r requirements.txt

  5. Run live twitter scraper/classifier
    python stream_twitter.py

  6. Run live dashboard
    python app.py

To train model

allennlp train experiments/l2_balanced/config.json --serialization-dir experiments/l2_balanced/out

Shoutouts

These following repositories made my life much easier with working examples of the different components I needed for this project.

About

(Covid-19 EMNLP 2020) Real-time Classification, Geolocation and Interactive Visualization of COVID-19 Information Shared on Social Media to Better Understand Global Developments

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published