GitHub - msingh27/nlp_group6: A project on hybrid recommendation system

Hybrid Tag Recommendation System

The project aims to modify existing collaborative filtering based recommendation techniques to use content-based information from the data.

Stack Overflow

Each data point is associated with a question body and associated tags

preprocessor.py: Extract keywords from the questions body to a sqlite database. Calculate tf-idf statistic for the keywords.
stopwords.py: Outputs a csv dataset with stopwords removed using tf-idf scores
tagcloud.py: Generate tag-clouds from the keyword database.
autoencoder.py:
doc2vec.py:
MatrixFactoriztion.py:
testingtraining.py:

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
MatrixFactorization.py		MatrixFactorization.py
README.md		README.md
data.csv		data.csv
data_full.pkl		data_full.pkl
data_question.pkl		data_question.pkl
demo.py		demo.py
demo_preprocess.py		demo_preprocess.py
doc2vectorization.py		doc2vectorization.py
preprocessor.py		preprocessor.py
q_tilda_test_d100.pkl		q_tilda_test_d100.pkl
qid_test.pkl		qid_test.pkl
qid_train.pkl		qid_train.pkl
stopwords.py		stopwords.py
tag_set.pkl		tag_set.pkl
tag_trained_vec.pkl		tag_trained_vec.pkl
tagcloud.py		tagcloud.py
testing_training.py		testing_training.py