Published on Mon Oct 07 2019

Weighted graphlets and deep neural networks for protein structure classification

Hongyu Guo, Khalique Newaz, Scott Emrich, Tijana Milenkovic, Jun Li

Analysis of protein structures can help predict protein functions. We develop a weighted network that depicts the protein structures. We also develop a deep neural network (DNN) to use this measure for classification.

0
0
0
Abstract

As proteins with similar structures often have similar functions, analysis of protein structures can help predict protein functions and is thus important. We consider the problem of protein structure classification, which computationally classifies the structures of proteins into pre-defined groups. We develop a weighted network that depicts the protein structures, and more importantly, we propose the first graphlet-based measure that applies to weighted networks. Further, we develop a deep neural network (DNN) composed of both convolutional and recurrent layers to use this measure for classification. Put together, our approach shows dramatic improvements in performance over existing graphlet-based approaches on 36 real datasets. Even comparing with the state-of-the-art approach, it almost halves the classification error. In addition to protein structure networks, our weighted-graphlet measure and DNN classifier can potentially be applied to classification of other weighted networks in computational biology as well as in other domains.

Thu Apr 12 2018
Machine Learning
Network-based protein structural classification
Protein structural classification (PSC) can help determine structural classes of currently unclassified proteins. Existing PSC approaches rely on sequence-based or direct 3-dimensional (3D) structure-based protein features. In contrast, we first model 3D structures of proteins as protein structure
0
0
0
Fri Oct 30 2020
Machine Learning
PersGNN: Applying Topological Data Analysis and Geometric Deep Learning to Structure-Based Protein Function Prediction
Protein structure-function relationships is a key challenge in computational biology. We present an end-to-end trainable deep learning model that combines graph representation learning with topological data analysis to capture a complex set of features.
0
0
0
Fri Jul 24 2020
Machine Learning
Hierarchical Protein Function Prediction with Tail-GNNs
Protein function prediction may be framed as predicting subgraphs (with certain closure properties) of a directed acyclic graph. Graph neural networks (GNNs) are naturally suited for this task. But in contrast with most GNN applications, the graph is not related to the
0
0
0
Tue May 10 2016
Machine Learning
MUST-CNN: A Multilayer Shift-and-Stitch Deep Convolutional Architecture for Sequence-based Protein Structure Prediction
Predicting protein properties is an important task in bioinformatics. We propose a deep convolutional neural network to predict protein properties. We beat the state-of-the-art performance on two large protein property prediction datasets.
0
0
0
Tue Dec 22 2020
Machine Learning
Deep Multi-attribute Graph Representation Learning on Protein Structures
Graphs as a type of data structure have recently attracted significant attention. Proteins are complex macromolecules composed of thousands of atoms. This makes them much harder to model than micro-molecules. We propose a graph neural network architecture to represent the proteins as 3D graphs.
0
0
0
Fri Nov 23 2018
Artificial Intelligence
Spectral Multigraph Networks for Discovering and Fusing Relationships in Molecules
Spectral Graph Convolutional Networks (GCNs) are a generalization of convolutional networks to learning on graph-structured data. Chebyshev GCNsrict graphs to have at most one edge between any pair of nodes.
0
0
0