Skip to main navigation Skip to search Skip to main content

Using Alias Sampling Strategy Based on Network Embeddings to Detect Protein Complexes

  • Dalian Maritime University

Research output: Contribution to journalArticlepeer-review

Abstract

Detecting protein complexes from available protein-protein interaction (PPI) data will help to deeply understand the mechanism of the biological activities. In recent years, various computational methods have been developed for identifying protein complexes from PPI networks. Almost all the basic computational methods mainly depend on the association of topological analysis of PPI networks. However, most of them fail to satisfactorily capture the global and local topological structures of the PPI networks, as well as the diversity of connectivity patterns between individual nodes at the same time. To solve this problem, in this work we propose a node embedding based alias sampling extension method to detect protein complexes. More specifically, for a given set of seed nodes, it first uses the alias sampling strategy based on protein node embedding similarities to select potential addable nodes. Then it makes use of a new conductance measure, which could better quantify the likelihood of a subgraph being a protein complex, to decide whether to extend the current candidate subgraph in order to find protein complexes. Evaluated on six real yeast PPI networks, our method outperforms state-of-the-art methods in detecting protein complexes. Furthermore, the experimental results demonstrate the protein complexes predicted by our method have higher biological significance.

Original languageEnglish
Article number9268940
Pages (from-to)211773-211783
Number of pages11
JournalIEEE Access
Volume8
DOIs
StatePublished - 2020

Keywords

  • alias sampling
  • node embedding
  • PPI network
  • Protein complex

Fingerprint

Dive into the research topics of 'Using Alias Sampling Strategy Based on Network Embeddings to Detect Protein Complexes'. Together they form a unique fingerprint.

Cite this