LoanInterest

The dataset was from a work assignment of a Data Scientist job application. In this assignment, I completed a mini data science project that involves end-to-end pipeline from data cleaning, data preparing, featuring exploration and engineering, model prototyping and selection, and evaluation.

The goal of the project is to predict the interest rate of loan applications using a mixture of very heterogeneous data columns. Some columns contain useful features while some are totally irrelevant. Some columns may contain many missing values that need to be properly imputed.

The python script and results are summarized in the following Jupyter notebooks:

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
code		code
Data Cleaning and Wrangling.ipynb		Data Cleaning and Wrangling.ipynb
Missing value imputation.ipynb		Missing value imputation.ipynb
Model fitting and evaluation.ipynb		Model fitting and evaluation.ipynb
README.md		README.md
Raw data.ipynb		Raw data.ipynb
interestrate.pdf		interestrate.pdf
loan_interest.xlsx		loan_interest.xlsx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

code

code

Data Cleaning and Wrangling.ipynb

Data Cleaning and Wrangling.ipynb

Missing value imputation.ipynb

Missing value imputation.ipynb

Model fitting and evaluation.ipynb

Model fitting and evaluation.ipynb

README.md

README.md

Raw data.ipynb

Raw data.ipynb

interestrate.pdf

interestrate.pdf

loan_interest.xlsx

loan_interest.xlsx

Repository files navigation

LoanInterest

About

Releases

Packages

Languages

chao-ji/LoanInterest

Folders and files

Latest commit

History

Repository files navigation

LoanInterest

About

Resources

Stars

Watchers

Forks

Languages