Skip to content

schlogl2017/deep_learning_genomics_nlp

 
 

Repository files navigation

Learning the Language of the Genome Using RNNs

Epigenetics is the study of how the genome is regulated by external mechanisms. Biological experiments have shown that subsequences of the human genome are regulated by specific proteins. The purpose of this project is to explore how an RNN architecture can be used to learn sequential patterns in genomic sequences. A robust method for modeling the genome can offer insights on genetic patterns related to health and disease.

This project was for Stanford's Deep Learning in NLP course from Spring 2016.

Please see our poster and final report for details. We designed a bidirectional 2-layer RNN for multitask learning using GRUs. You can find code for the TensorFlow model we used here.

About

Final project for CS224d

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 62.6%
  • Python 21.1%
  • TeX 16.3%