Nokore - A Collection of Scripts for Detecting Spammy, Fake or Otherwise Dangerous Communications Online

This repository applies SIMOn -- a Character-level CNN + bidirectional LSTM modeling library for text classification -- to email spam classification, and other forms of social media "dangerous/fake/spammy" communication detection.

The repository achieves this via a collection of scripts, with the eventual goal of comparing the SIMOn-based-model to emerging giants BERT and/or ELMo.

The name "Nokore" is a Twi word, by the Akan people of Ghana, for "Truth".

Architecture is described at https://arxiv.org/abs/1901.08456

Review https://github.com/algorine/simon

Also See Texas AI Summit Talk video: https://youtu.be/SmIsWF1xBeI

Getting Started

To get started, make sure you are using python v3.5+ and pip install via

pip3 install git+https://github.com/algorine/simon

Then, install keras-bert using pip install -q keras-bert.

Then, study the scripts and pretrained models included in the Nokore/scripts directory.

Rendered Jupyter notebooks are also provided in the Nokore/scripts directory, and they are meant to be self-explanatory.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Nokore/scripts		Nokore/scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nokore/scripts

Nokore/scripts

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Nokore - A Collection of Scripts for Detecting Spammy, Fake or Otherwise Dangerous Communications Online

Getting Started

About

Releases

Packages

Languages

License

algorine/nokore

Folders and files

Latest commit

History

Repository files navigation

Nokore - A Collection of Scripts for Detecting Spammy, Fake or Otherwise Dangerous Communications Online

Getting Started

About

Resources

License

Stars

Watchers

Forks

Languages