Skip to content

Ruhul964/dsa2

 
 

Repository files navigation

Check here if latest commit is working :

Testing code

Main Main, test_fast_linux Main, test_full

Multi test_fast_linux test_full

Preprocessors Check test_preprocess

Looking for contributors

 Maintain and setup roadmap of this excellent Data Science / ML repo.
 Goal is to unified Data Science and Machine Learning .
 Basic idea is to have one single dictionary/json for
        model, compute, data definition,
 --> easy to define, easy to track, easy to modify.

Install

 git clone 
 cd dsa2
 pip install -r zrequirements.txt

Basic usage

cd dsa2
python  titanic_classifier.py  preprocess    --nsample 1000
python  titanic_classifier.py  train         --nsample 2000
python  titanic_classifier.py  predict

How to train a new dataset ?

 https://github.com/arita37/dsa2/issues/200

Examples

 https://github.com/arita37/dsa2/tree/main/example

List of preprocessor

    #### Data Over/Under sampling 

    #### Category, Numerical
    
    #### Text        

    #### Target label encoding

    #### Time Series 


    https://github.com/arita37/dsa2/issues/194

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages

  • HTML 72.3%
  • Jupyter Notebook 19.1%
  • Python 8.6%