搜索

x
中国物理学会期刊

蛋白质pKa预测模型研究进展

Progress in protein pKa prediction

PDF
HTML
导出引用
  • pH表征溶液的酸碱性, 是许多与人类重大疾病密切相关的生命活动的调控因子. \mathrmpK_\mathrma 决定可滴定基团在一定pH条件下的去质子化平衡, 是研究pH调控的生物化学过程的重要参量. 然而, 由于蛋白质结构的复杂性以及实验条件的限制, 蛋白质 \mathrmpK_\mathrma 通常需要借助理论预测. 近30年, 研究者们开发了各种基于先验知识的 \mathrmpK_\mathrma 预测模型. 随着近几年人工智能技术的快速发展, 人们开始尝试将人工智能算法应用于蛋白质 \mathrmpK_\mathrma 预测工具的开发. 本文介绍 \mathrmpK_\mathrma 理论预测近年来的一些重要研究进展, 主要包括恒定pH分子动力学以及基于泊松-玻尔兹曼方程、经验函数和机器学习的 \mathrmpK_\mathrma 预测模型. 在此基础上, 讨论蛋白质 \mathrmpK_\mathrma 预测模型的未来发展方向和应用前景.

     

    The pH value represents the acidity of the solution and plays a key role in many life events linked to human diseases. For instance, the β-site amyloid precursor protein cleavage enzyme, BACE1, which is a major therapeutic target of treating Alzheimer’s disease, functions within a narrow pH region around 4.5. In addition, the sodium-proton antiporter NhaA from Escherichia coli is activated only when the cytoplasmic pH is higher than 6.5 and the activity reaches a maximum value around pH 8.8. To explore the molecular mechanism of a protein regulated by pH, it is important to measure, typically by nuclear magnetic resonance, the binding affinities of protons to ionizable key residues, namely \mathrmpK_\mathrma values, which determine the deprotonation equilibria under a pH condition. However, wet-lab experiments are often expensive and time consuming. In some cases, owing to the structural complexity of a protein, \mathrmpK_\mathrma measurements become difficult, making theoretical \mathrmpK_\mathrma predictions in a dry laboratory more advantageous. In the past thirty years, many efforts have been made to accurately and fast predict protein \mathrmpK_\mathrma with physics-based methods. Theoretically, constant pH molecular dynamics (CpHMD) method that takes conformational fluctuations into account gives the most accurate predictions, especially the explicit-solvent CpHMD model proposed by Huang and coworkers (2016 J. Chem. Theory Comput. 12 5411) which in principle is applicable to any system that can be described by a force field. However, lengthy molecular simulations are usually necessary for the extensive sampling of conformation. In particular, the computational complexity increases significantly if water molecules are included explicitly in the simulation system. Thus, CpHMD is not suitable for high-throughout computing requested in industry circle. To accelerate \mathrmpK_\mathrma prediction, Poisson-Boltzmann (PB) or empirical equation-based schemes, such as H++ and PropKa, have been developed and widely used where \mathrmpK_\mathrma values are obtained via one-structure calculations. Recently, artificial intelligence (AI) is applied to the area of protein \mathrmpK_\mathrma prediction, which leads to the development of DeepKa by Huang laboratory (2021 ACS Omega 6 34823), the first AI-driven \mathrmpK_\mathrma predictor. In this paper, we review the advances in protein \mathrmpK_\mathrma prediction contributed mainly by CpHMD methods, PB or empirical equation-based schemes, and AI models. Notably, the modeling hypotheses explained in the review would shed light on future development of more powerful protein \mathrmpK_\mathrma predictors.

     

    目录

    /

    返回文章
    返回