7.91 Amy Keating Ab initio structure prediction Protein Design
Ab Initio Structure Prediction & Protein Design 7.91 Amy Keating
Ab initio prediction Ab initio = from the beginning"; in strictest sense uses first principles, not information about other protein structures In practice all methods rely on empirical observations about other structures Force fields Knowledge-based scoring functions Training sets Fragment structures a good review: Bonneau, R, and D Baker. "Ab Initio Protein Structure Prediction: Progress and Prospects. "Rev Biophys Biomol Struct 30(2001):17389
Ab initio prediction • Ab initio = “from the beginning”; in strictest sense uses first principles, not information about other protein structures • In practice, all methods rely on empirical observations about other structures – Force fields – Knowledge-based scoring functions – Training sets – Fragment structures A good review: Bonneau, R, and D Baker. "Ab Initio Protein Structure Prediction: Progress and Prospects." Rev Biophys Biomol Struct. 30 (2001): 173-89
Approaches to ab initio folding Full Md with explicit solvation( e.g. IBM Blue Gene) VERY expensive May not work Reduced complexity models No side chains(sometimes no main chain atoms either!) Reduced degrees of freedom On-or off-lattice Generally have a solvation-based score and a knowledge based residue-residue interaction term Sometimes used as first step to prune the enormous conformational space then resolution is increased for later fine-tuning
Approaches to ab initio folding • Full MD with explicit solvation (e.g. IBM Blue Gene) – VERY expensive – May not work • Reduced complexity models – No side chains (sometimes no main chain atoms either!) – Reduced degrees of freedom – On- or off-lattice – Generally have a solvation-based score and a knowledgebased residue-residue interaction term – Sometimes used as first step to prune the enormous conformational space, then resolution is increased for later fine-tuning
ROSETTA- the most successful approach to ab initio prediction David Baker, U. Washington, Seattle Based on the idea that the possible conformations of any short peptide fragment(3-9 residues) are well represented by the structures it is observed to adopt in the pdb Generate a library of different possible structures for each sequence segment Search the possible combinations of these for ones that are protein-like by various criteria
ROSETTA - the most successful approach to ab initio prediction • David Baker, U. Washington, Seattle • Based on the idea that the possible conformations of any short peptide fragment (3-9 residues) are wellrepresented by the structures it is observed to adopt in the pdb • Generate a library of different possible structures for each sequence segment • Search the possible combinations of these for ones that are protein-like by various criteria
ROSETTA fragment libraries Remove all homologs of the protein to be modeled(25% sequence identity) For each 9 residue segment in the target use sequence similarity and secondary structure similarity(compare predicted secondary stucture for target to fragment secondary structure) to select w 25 fragments Because secondary structure is influenced by tertiary structure, ensure that the fragments span different secondary structures The extent to which the fragments cluster around a consensus structure is correlated with how good a model the fragment is likely to be for the target LSERTVARS①e
ROSETTA fragment libraries • Remove all homologs of the protein to be modeled (>25% sequence identity) • For each 9 residue segment in the target, use sequence similarity and secondary structure similarity (compare predicted secondary stucture for target to fragment secondary structure) to select ~25 fragments • Because secondary structure is influenced by tertiary structure, ensure that the fragments span different secondary structures • The extent to which the fragments cluster around a consensus structure is correlated with how good a model the fragment is likely to be for the target LSERTVARS