Skip to content

deepesch/deepesch.github.io

Repository files navigation

Data Science Projects

Introduction

This repository is mainly used to store projects of Data Science which I have followed & worked for a period of time. You can find all the codes for the corresponding projects in the repository. The following is a list of the projects. When I set up to do these work, I was just a beginner of this field, so may be there are many questions remained that I have not noticed. Welcome to point them out and if you have any problems with my work welcome to contact me by e-mail: deepesch@gmail.com

List of the projects.

  • Twitter Cluster to stream:

    Demo of the most common techniques used in industry from clustering to streaming.

  • Wikimapper:

    Wikipedia Data Engineering project on Databricks cloud-based Spark platform. Wikimapper helps active contributors to find most searched non-existent wikipedia articles.

  • Visualization libraries in Python:

    There are many different Python plotting packages. Amoung all of the available libraries some may suit your needs better. Some of these are interfaces to existing plotting libraries while others are Python-centered new implementations.

  • Word2Vec - Kaggle Competition:

    Machine Learning Competition(Movie Ratings).

  • Naive Bayes Implementation: Natural Language Processing Project

  • Support Vector Machine Implementation: Machine Learning Project

About

Data science projects portfolio

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages