TextMining

A platform for text mining purpose to see the trends of words and different phrases across different years that has a web front end. The user can give and input as a text or any type of file with text and can specify the year of that input; a platform gets input and first, processes it by extracting text from the file and tokenizing it into words and phrases using NLTK library. After the tokenization process it stores all the words and phrases in the PostgreSQL database. On the website, the user can also visualize the words and phrases by applying different filters (e.g., year filter). Visualization is represented by charts, word cloud, and different topic clusters during the specific year so that we can monitor the trends of that year. Words and phrases are separated into clusters using k-means algorithm on embeddings that are taken using the BERT model.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
TextMining		TextMining
media/tmp_wordcloud		media/tmp_wordcloud
mysite		mysite
templates		templates
.gitignore		.gitignore
README.md		README.md
db.sqlite3		db.sqlite3
manage.py		manage.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TextMining

TextMining

media/tmp_wordcloud

media/tmp_wordcloud

mysite

mysite

templates

templates

.gitignore

.gitignore

README.md

README.md

db.sqlite3

db.sqlite3

manage.py

manage.py

requirements.txt

requirements.txt

Repository files navigation

TextMining

About

Releases

Packages

Languages

nnarziev/TextMining

Folders and files

Latest commit

History

Repository files navigation

TextMining

About

Resources

Stars

Watchers

Forks

Languages