This main purpose of this project is to implement the concepts of statistics namely Correlation and Hypothesis Testing
- Python
- Numpy
- Pandas
- Matplotlib
- Scipy
First we have two sets of data lusc-rsem-fpkm-tcga_paired.txt and lusc-rsem-fpkm-tcga-t_paired.txt, healthy and cancerous respictively. they represent the Genes Expressions for both healthy and cancer type Lung Squamous Cell Carcinoma (LUSC), knowing that the data is paired.
The Project Files are in this manner
- helpers.py: This file contains all the functions created to calculate both Correlation and Hypothesis Testing.
- main.ipynb: This file contains the processing of the data and the outputs after each operation.
- main.py: it is the same as main.ipynb but it does not give a visual aid of the output after each process.
This project is a part of the SBE304 course (Bio-Statistics) in the Systems and Biomedical Engineering Department - Cairo University
Dr.Ibrahim Mohamed Ibrahim
TA. Eslam Adel