Pythia is Lab41's exploration of approaches to novel content detection. We are interested in making it easier to tell when a document coming into a corpus has something new to say. We welcome your contributions (see our contributor guidelines) and attention.
docker build -t lab41/pythia . # runs tests and builds project image
docker run -it lab41/pythia experiments/experiments.py with 'XGB=True' 'BOW_APPEND=True' 'BOW_PRODUCT=True'
Our code is written in Python 3. envs/make_envs.sh will install the necessary dependencies on a Debian/Ubuntu system with Anaconda installed.