ProALIGN Directly learning alignments for protein structure prediction via exploiting context-specifice alignment motifs

Introduction

we report a novel deep learning approach ProALIGN that can predict much more ac-curate sequence-template alignment. Like protein sequences consisting of sequence motifs, protein alignments are also composed of frequently-occurring alignment motifs with charac-teristic patterns. Alignment motifs are context-speciﬁc as their characteristic patterns are tightly related to sequence contexts of the aligned regions. Inspired by this observation, we represent a protein alignment as a binary matrix (in which 1 denotes an aligned residue pair) and then use a deep convolutional neural network to predict the optimal alignment from the query protein and its template. The trained neural network implicitly but eﬀectively encodes an alignment scoring function, which reduces inaccuracies in the handcrafted scoring func-tions widely used by the current threading approaches. For a query protein and a template, we apply the neural network to directly infer likelihoods of all possible residue pairs in their entirety, which could eﬀectively consider the correlations among multiple residues. We further construct the alignment with maximum likelihood, and ﬁnally build structure model accord-ing to the alignment.

Publications

No Publication Information

Credits

Dongbo Bu dbu@ict.ac.cn
Investigator
University of Chinese Academy of Sciences, Beijing, China, China

Community Ratings

Usability	Efficiency	Reliability	Rated By
			0 user
Sign in to rate

Summary

Accession	BT007267
Tool Type	Application
Category	Protein structures
Platforms	Linux/Unix
Technologies	C++
User Interface
Input Data	FASTA
Latest Release	1.0 (September 15, 2021)
Download Count	356
Country/Region	China
Submitted By	Shaoliang Peng

Fundings

2018YFC0910400