University of Wollongong
Browse

Leveraging Stacked Denoising Autoencoder in Prediction of Pathogen-Host Protein-Protein Interactions

Download (1.81 MB)
conference contribution
posted on 2024-11-14, 10:20 authored by Huaming Chen, Jun ShenJun Shen, Lei WangLei Wang, Jiangning Song
In big data research related to bioinformatics, one of the most critical areas is proteomics. In this paper, we focus on the protein-protein interactions, especially on pathogen-host protein-protein interactions (PHPPIs), which reveals the critical molecular process in biology. Conventionally, biologists apply in-lab methods, including small-scale biochemical, biophysical, genetic experiments and large-scale experiment methods (e.g. yeast-two-hybrid analysis), to identify the interactions. These in-lab methods are time consuming and labor intensive. Since the interactions between proteins from different species play very critical roles for both the infectious diseases and drug design, the motivation behind this study is to provide a basic framework for biologists, which is based on big data analytics and deep learning models. Our work contributes in leveraging unsupervised learning model, in which we focus on stacked denoising autoencoders, to achieve a more efficient prediction performance on PHPPI. In this paper, we further detail the framework based on unsupervised learning model for PHPPI researches, while curating a large imbalanced PHPPI dataset. Our model demonstrates a better result with the unsupervised learning model on PHPPI dataset.

History

Citation

Chen, H., Shen, J., Wang, L. & Song, J. (2017). Leveraging Stacked Denoising Autoencoder in Prediction of Pathogen-Host Protein-Protein Interactions. IEEE 6th International Congress on Big Data (pp. 368-375). United States: IEEE.

Parent title

Proceedings - 2017 IEEE 6th International Congress on Big Data, BigData Congress 2017

Pagination

368-375

Language

English

RIS ID

112537

Usage metrics

    Categories

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC