Neural Collaborative Filtering with Keras, Pytorch and Gluon

This repo contains an implementation of Xiangnan He, et al, 2017 neural collaborative filtering in Keras (original paper), Gluon and Pytorch. The Keras code is mostly borrowed from the author's original repo, adapted to the new keras 2.2 API and python 3. Of course, I strongly recommend reading their paper.

Everything one needs to run the experiment is in this repo. The code is organized as follows:

The core of the repo are of course the GMF_DLFRAME.py, MLP_DLFRAME.py and NeuMF_DLFRAME.py where DLFRAME is keras, pytorch and gluon
I have also included data_preparation.py and data_comparison.ipynb. The first shows how to prepare the data for the experiment (not included in the author's original repo) and the second simply shows that the results of my data preparation and those of Xiangnan He are consistent.
If you are just interested in a comparison between the results obtained with Keras, Pytorch and Gluon, you can directly go to results_summary.ipynb.

All the experiments run are included in run_net.sh. If you clone this repo you could directly copy and paste the content in that file. For example, the following line will run a GMF model using Gluon, with batch_size 256, learning rate 0.01, 32 dim embeddings for 30 epochs:

python GMF_gluon.py --batch_size 256 --lr 0.01 --n_emb 32 --epochs 30

The best performing GMF and MLP models are included in the dir models.

Given the relative simplicity of the model, I thought this would be a good exercise to illustrate the similarities and differences between the 3 frames . In addition the results obtained turned out to be quite interesting.

The Figure below shows the Hit Ratio (HR) and Normalized Discounted Cumulative Gain (NDCG) at k=10 for the MLP, GMF models and also the training time for the MLP model.

Top: Hit Ratio (HR) and Normalized Discounted Cumulative Gain (NDCG) at k=10 for both the GMF and MLP models vs the number of embeddings. Bottom: training time for the MLP model per number of embeddings.

Overall, the results can be summarized as follows:

Keras (with the Tensorflow backend): the easiest to use and the most "stable" across set ups. Is, in general, the slowest.

Gluon: I see a lot of potential in this package and I clearly see myself using it in the future. Shows some "strange" behavior but that could be me, since this is my first time using it. Is, in general, faster than Keras and overall, the success metrics are comparable to (or better than) those obtained with Pytorch.

Pytorch: The success metrics are very good, its behavior in sensible and is the fastest of the three.

For more details, go to results_summary.ipynb

Any suggestion, email me at: jrzaurin@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
Data_Javier		Data_Javier
Data_Xiangnan		Data_Xiangnan
Data_raw/ml-1m		Data_raw/ml-1m
docs/images		docs/images
models		models
Dataset.py		Dataset.py
GMF_gluon.py		GMF_gluon.py
GMF_keras.py		GMF_keras.py
GMF_pytorch.py		GMF_pytorch.py
MLP_gluon.py		MLP_gluon.py
MLP_keras.py		MLP_keras.py
MLP_pytorch.py		MLP_pytorch.py
NeuMF_gluon.py		NeuMF_gluon.py
NeuMF_keras.py		NeuMF_keras.py
NeuMF_pytorch.py		NeuMF_pytorch.py
README.md		README.md
data_comparison.ipynb		data_comparison.ipynb
data_preparation.py		data_preparation.py
evaluate_keras.py		evaluate_keras.py
plot_utils.py		plot_utils.py
results_summary.ipynb		results_summary.ipynb
run_net.sh		run_net.sh
select_NeuMF_models.py		select_NeuMF_models.py
utils.py		utils.py

kjyjxy/neural_cf

Folders and files

Latest commit

History

Repository files navigation

Neural Collaborative Filtering with Keras, Pytorch and Gluon

About

Resources

Stars

Watchers

Forks

Languages