lung_cancer_survival_time

Quick-Start

Place these codes in the same directory as the data, putting the training folders "images" and "features" in a folder named "x_train", the training output in a folder named "y_train", and the validation folders "images" and "features" in a "x_test" folder. Run

python model.py

for method 1, and

python cnn_based_model.py

for method 2.

Work done

For this challenge my goal was to build a model using the images and the masks. I first built a model using a CoxPH Fitter, with the data from clinicals.csv and radiomics.csv. I then built a CNN based model, learning from the clinicals, radiomics, and the images, with a coxPH loss function (based on the DeepSurv model).

Model.py (Python Lifelines)

This model first takes the data, transforms it into dummy variables, then uses the CoxPHFitter from the lifelines library in python. It achieves a c-index of 0.74 for the train set, and about 0.64 for the validation set (overfitting). It could be improved through features selection with a correlation matrix or other methods, and data normalization. Most of the time for this challenge was spent on the second method.

Cnn_based_model.py (Python Keras)

This model is a multi-input neural network, taking the clinicals, radiomics as well as the scans. The first two are passed into a few Dense layers, using BatchNormalization and Dropout. The scans are passed into a ResNet50. We use a pre-trained model to balance the fact that the dataset is so small. More than the time-to-event, the model predicts for now the hazard ratio, trying to maximize the Cox Proportional Hazard Model equation. However if this model does indeed learn (as we can see in the figures below), it doesen't output satisfying variables, with a hazard-ratio of 1.0 for each input after 20 epochs.

Model using radiomics+scans after 10 epochs Model using radiomics+scans+clinicals after 20 epoch

This model could mostly be improved by a better implementation of the cox loss function.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
Appendix.md		Appendix.md
README.md		README.md
cnn_based_model.py		cnn_based_model.py
cnn_mode.png		cnn_mode.png
cnn_rad_cli_scan.png		cnn_rad_cli_scan.png
metrics_t9gbvr2.py		metrics_t9gbvr2.py
model.py		model.py
random_submission_0vhlEZN.csv		random_submission_0vhlEZN.csv
submission_alexlacour.csv		submission_alexlacour.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

Appendix.md

Appendix.md

README.md

README.md

cnn_based_model.py

cnn_based_model.py

cnn_mode.png

cnn_mode.png

cnn_rad_cli_scan.png

cnn_rad_cli_scan.png

metrics_t9gbvr2.py

metrics_t9gbvr2.py

model.py

model.py

random_submission_0vhlEZN.csv

random_submission_0vhlEZN.csv

submission_alexlacour.csv

submission_alexlacour.csv

Repository files navigation

lung_cancer_survival_time

Quick-Start

Work done

Model.py (Python Lifelines)

Cnn_based_model.py (Python Keras)

About

Releases

Packages

Languages

AlexLacour/lung_cancer_survival_time

Folders and files

Latest commit

History

Repository files navigation

lung_cancer_survival_time

Quick-Start

Work done

Model.py (Python Lifelines)

Cnn_based_model.py (Python Keras)

About

Resources

Stars

Watchers

Forks

Languages