Skip to content

ahnqirage/avenir

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Introduction

Set of predictive and exploratory data mining tools. Runs on Hadoop and Storm

Philosophy

  • Simple to use
  • Input output in CSV format
  • Metadata defined in simple JSON file
  • Extremely configurable with tons of configuration knobs

Solution

  • Exploratry analytic including correlation, feature subset selection
  • Naive Bayes
  • Discrimininant analysis
  • Nearest neighbor
  • Decision tree and Random Forest
  • Association Mining
  • Reinforcement learning

Blogs

The following blogs of mine are good source of details of avenir. These are the only source of detail documentation

Getting started

Project's resource directory has various tutorial documents for the use cases described in the blogs.

Configuration

All configuration parameters are described in the wiki page https://github.com/pranab/avenir/wiki/Configuration

Build

Please refer to resource/dependency.txt for build time and run time dependencies

For Hadoop 1

  • mvn clean install

For Hadoop 2 (non yarn)

  • git checkout nuovo
  • mvn clean install

For Hadoop 2 (yarn)

  • git checkout nuovo
  • mvn clean install -P yarn

Help

Please feel free to email me at pkghosh99@gmail.com

Contribution

Contributors are welcome. Please email me at pkghosh99@gmail.com

About

Set of machine learning tools based on Hadoop and Storm https://pkghosh.wordpress.com/

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Java 88.0%
  • Python 8.6%
  • Ruby 1.7%
  • Shell 1.4%
  • Scala 0.3%