Post Translational Modification Prediction

Long term goal is to develop into an easy to use library for rapid prototyping and development of PTM tools. Called ptm_Pred

Post Translational Modification Prediction

Capstone project for Senior Year at Tulane University

A full write up of using supervised learning and class imbalance methods can be found here: https://docs.google.com/document/d/1Yi3vMEq4l0SLw95HtiVRHsn010nrVaNZiZlV9pi7TjU/edit?usp=sharing

The supervised methods generate precision and accuracy in the 80-90% range with recall in the 10-20% range.

Recently I have started using unsupervised learning methods with interesting results. The word2vec implementations are averaging around 75 in recall, precision, and accuracy for most post translational modifications tests. This presents a possible solution to the recall issue which has plagued post translational modification prediction for the last decade.

TODO:

Write FASTA -> CSV converter for benchmark tests

Implement benchmarks into word2vec.

Try prot2vec implementations

Try using exon/intron as an additional feature set.

Notes:

The data posted comes from dbptm.mbc.nctu.edu.tw which is a great rescource for protien related machine learning projects.

Name		Name	Last commit message	Last commit date
Latest commit History 116 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Data		Data
__pycache__		__pycache__
old		old
.gitignore		.gitignore
LICENSE		LICENSE
MCC -Grid -multiple -dbptm+ELM -VectorAvr. -SeqFeatures.ipynb		MCC -Grid -multiple -dbptm+ELM -VectorAvr. -SeqFeatures.ipynb
MCC -noGrid -multiple -dbptm+ELM -VectorAvr. -SeqFeatures.ipynb		MCC -noGrid -multiple -dbptm+ELM -VectorAvr. -SeqFeatures.ipynb
README.md		README.md
data_processing.py		data_processing.py
index.py		index.py
pred.py		pred.py
scraper.py		scraper.py
w2vImp.py		w2vImp.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.ipynb_checkpoints

.ipynb_checkpoints

Data

Data

pycache

pycache

old

old

.gitignore

.gitignore

LICENSE

LICENSE

MCC -Grid -multiple -dbptm+ELM -VectorAvr. -SeqFeatures.ipynb

MCC -Grid -multiple -dbptm+ELM -VectorAvr. -SeqFeatures.ipynb

MCC -noGrid -multiple -dbptm+ELM -VectorAvr. -SeqFeatures.ipynb

MCC -noGrid -multiple -dbptm+ELM -VectorAvr. -SeqFeatures.ipynb

README.md

README.md

data_processing.py

data_processing.py

index.py

index.py

pred.py

pred.py

scraper.py

scraper.py

w2vImp.py

w2vImp.py

Repository files navigation

Post Translational Modification Prediction

TODO:

Notes:

About

Releases

Packages

Languages

License

vzg100/Post-Translational-Modification-Prediction

Folders and files

Latest commit

History

Repository files navigation

Post Translational Modification Prediction

TODO:

Notes:

About

Resources

License

Stars

Watchers

Forks

Languages