A collection of scripts to parse and analyze exported Google Voice data.
So far it just parses the data and runs a few Naive Bayes classifiers, but I'm looking to expand.
-
Download and extract your Google Voice data.
-
Install the dependencies with something like
pip install -r requirements.txt
. -
Copy
settings_example.py
tosettings.py
and configure appropriately. -
Run
gv_to_db.py
to load everything into a SQLite database. -
Run
who_from.py
to run a few simple analyzers using Naive Bayes.
-
people_with_many_texts(n)
classifies the texts of people who have sent more thann
texts. -
recipient_is(name)
classifies texts into eithername
ornot_name
. -
split_me_not_me()
classifies texts into eitherme
ornot_me
depending on the sender.
-
Get your training and test sets from one of the classifiers.
-
Run
build_classifier(training_set)
. -
Run
interactive
on your classifier. -
Enter some messages.
MIT/X11 licensed, except for emoticons.py
and twokenize.py
.
emoticons.py
and twokenize.py
are copyright Brendan O'Connor, Michel Krieger, and David Ahn and licensed under the Apache License 2.0.