Logo image
Prediction of protein aggregation propensity employing SqFt-based logistic regression model
Journal article   Peer reviewed

Prediction of protein aggregation propensity employing SqFt-based logistic regression model

Fatemeh Eshari, Fahime Momeni, Amirreza Faraj Nezhadi, Soudabeh Shemehsavar and Mehran Habibi-Rezaei
International journal of biological macromolecules, Vol.249, 126036
2023
PMID: 37516225

Abstract

Logistic regression Machine learning Protein aggregation
Here we present a novel machine-learning approach to predict protein aggregation propensity (PAP) which is a key factor in the formation of amyloid fibrils based on logistic regression (LR). Amyloid fibrils are associated with various neurodegenerative diseases (ND) such as Alzheimer's disease (AD) and Parkinson's disease (PD), which are caused by oxidative stress and impaired protein homeostasis. Accordingly, the paper uses a dataset of hexapeptides with known aggregation tendencies and eight physiochemical features to train and test the LR model. Also, it evaluates the performance of the LR model using F-measure and Matthews correlation coefficient (MCC) as metrics and compares it with other existing methods. Moreover, it investigates the effect of combining sequence and feature information in the prediction. In conclusion, the LR model with sequence and feature information achieves high F-measure (0.841) and MCC (0.6692), outperforming other methods and demonstrating its efficiency and reliability for PAP prediction. In addition, the overall performance of the concluded method was higher than the other known servers, for instance, Aggrescan, Metamyl, Foldamyloid, and PASTA 2.0. The LR model can be accessed at: https://github.com/KatherineEshari/Protein-aggregation-prediction. [Display omitted]

Details

UN Sustainable Development Goals (SDGs)

This output has contributed to the advancement of the following goals:

#3 Good Health and Well-Being

Source: InCites

Metrics

InCites Highlights

These are selected metrics from InCites Benchmarking & Analytics tool, related to this output

Collaboration types
Domestic collaboration
International collaboration
Citation topics
1 Clinical & Life Sciences
1.52 Neurodegenerative Diseases
1.52.57 Alzheimer's Mechanisms
Web Of Science research areas
Biochemistry & Molecular Biology
Chemistry, Applied
Polymer Science
ESI research areas
Biology & Biochemistry
Logo image