Using Deep Learning to Annotate the Protein Universe

Understanding the relationship between amino acid sequence and protein function is a long-standing problem in molecular biology with far-reaching scientific implications. Despite six decades of progress, state-of-the-art techniques cannot annotate 1/3 of microbial protein sequences, hampering our ability to exploit sequences collected from diverse organisms. In this code, i explore an alternative methodology based on deep learning that learns the relationship between unaligned amino acid sequences and their functional annotations across all 17929 families of the Pfam database.

My study focused on only 600 families out of all the families included in the dataset.

Model Architecture

	#Architecture
Model

Result:

	(Training) Accuracy vs Validation Accuracy	(Training) Loss vs Validation Loss
result

Model Evaluation

Notice:

pre-trainde model: https://drive.google.com/file/d/12ZsTkRlEPG8DL50Wb_tdDmHINv9pKTbj/view?usp=share_link

pre-trainde model weights: https://drive.google.com/file/d/1bj4uJBu7rbO6OaIZg--IkOC5yke_WiLn/view?usp=share_link

Tokenizer: https://drive.google.com/file/d/1-01g2VBsa6hMSCRB-DGylfffJDrCRXu4/view?usp=share_link

References:

https://www.biorxiv.org/content/10.1101/626507v4.full.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
Using_Deep_Learning_to_Annotate_the_Protein_Universe.ipynb		Using_Deep_Learning_to_Annotate_the_Protein_Universe.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Using Deep Learning to Annotate the Protein Universe

Model Architecture

Result:

Model Evaluation

Notice:

References:

About

Releases

Packages

Languages

kaledhoshme123/Using-Deep-Learning-to-Annotate-the-Protein-Universe

Folders and files

Latest commit

History

Repository files navigation

Using Deep Learning to Annotate the Protein Universe

Model Architecture

Result:

Model Evaluation

Notice:

References:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages