Skip to content

mindis/Personalized-Search-Results

 
 

Repository files navigation

Personalized-Search-Results

Personalized Search Results using Spark, Mongo, LSH, AWS S3

  • 1+ Billion record are loaded into Spark dataframe from Mongodb
  • Popular items are loaded from user behavioural logs stored in S3
  • Popular items are used to find most similar items from Mongodb database.
  • Similarity is computed using Local Sensitive Hashing concept
  • Code is develoepd using Spark ML framework
  • Code can run on AWS EMR Cluster or Google Cloud Platfor's Dataprocs.

About

Personalized Search Results using Spark, Mongo, LSH

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 98.8%
  • Python 1.2%