disaster-tracker

Study print media rhetoric relating to the Syrian refugee crisis. Run one of the _ex.py files to get started.

Corpus Parser: Saves JSON articles in a specified directory as Strings.
Unigram Stats: Counts the number of appearances of each word in a corpus.
- unigram_stats_ex.py path/to/archive/ path/to/stop-word-list
N-gram Stats: Counts the number of appearances of any phrase in a corpus.
- ngram_stats_ex.py path/to/archive/ path/to/stop-word-list (n-gram length)
Proximate Unigrams: Lists the words near a given word in a corpus.
- proximate_words_ex.py path/to/archive/ path/to/stop-word-list (offset distance) (word to look for)
Proximate N-grams: List the phrases near a given word in a corpus in order of frequency.
- proximate_ngrams_ex.py path/to/archive/ path/to/stop-word-list (n-gram length) (word to look for) (offset)
Naive Sentiment:
- naive_sentiment_ex.py path/to/archive/ path/to/stop-word-list (n-gram length) (word to look for) (offset)
Aggregate Attributes: Lists the number of words with a given attribute in the corpus in order of frequency.
- attribute_agg_ex.py path/to/archive/ path/to/stop-word-list
Attribute Dictionary: For each attribute, lists the number of words in a corpus with that attribute.
- attribute_agg_ex.py path/to/archive/ path/to/stop-word-list attribute

A few notes:

There must be files ./lexicons/positive-words.txt and ./lexicons/negative-words.txt to run aive Sentiment
A stop word list must be present. Many such lists are available online.
The Harvard Inquirer Excel file must be saved as a .csv and be in ./lexicons/inquirerbasic.csv

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
lexicons		lexicons
src		src
README.md		README.md
attribute_agg_ex.py		attribute_agg_ex.py
attribute_dict_ex.py		attribute_dict_ex.py
country_sentiment_ex.py		country_sentiment_ex.py
key_sentences_ex.py		key_sentences_ex.py
naive_sentiment_ex.py		naive_sentiment_ex.py
ngram_stats_ex.py		ngram_stats_ex.py
proximate_ngrams_ex.py		proximate_ngrams_ex.py
proximate_words_ex.py		proximate_words_ex.py
unigram_stats_ex.py		unigram_stats_ex.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lexicons

lexicons

src

src

README.md

README.md

attribute_agg_ex.py

attribute_agg_ex.py

attribute_dict_ex.py

attribute_dict_ex.py

country_sentiment_ex.py

country_sentiment_ex.py

key_sentences_ex.py

key_sentences_ex.py

naive_sentiment_ex.py

naive_sentiment_ex.py

ngram_stats_ex.py

ngram_stats_ex.py

proximate_ngrams_ex.py

proximate_ngrams_ex.py

proximate_words_ex.py

proximate_words_ex.py

unigram_stats_ex.py

unigram_stats_ex.py

Repository files navigation

disaster-tracker

About

Releases

Packages

Languages

thinkmpink/disaster-tracker

Folders and files

Latest commit

History

Repository files navigation

disaster-tracker

About

Resources

Stars

Watchers

Forks

Languages