- Previous Article
- Next Article
- Table of Contents
Chemical Engineering Science, Vol.121, 340-350, 2015
Predicting recombinant protein expression experiments using molecular dynamics simulation
Soluble expression of de novo-designed proteins in Escherichia coli (E. col remains empirical. For given experimental conditions expression success is determined in part by protein primary sequence. This has been previously explored with varying success using a variety of statistical solubility prediction tools though without taking fold stability into account. In the present study, the three-dimensional structure of proteins in molecular dynamics (MD) simulations is used to predict expression as a new approach with a set of four-helix bundles. Stability-related parameters for ten structures were determined in a thermal unfolding MD simulation and used to build statistical models with a support vector machine (SVM) classifier. The most accurate models were identified by their performance on five independent Foul-helix bundle sequences. The final model provided accurate classification prediction for this test set and was successfully applied in a model challenge with two newly designed sequences. The combination of simulation-derived parameters and an SVM classifier has potential to predict recombinant expression outcome for this set of four-helix bundles. With further development. this approach of utilizing higher-dimensional protein structural information to predict expression may have potential to advance recombinant biotechnology through modern computational and statistical science. (C) 2014 Elsevier Ltd. All rights reserved.