World Library  
Flag as Inappropriate
Email this Article

Protein folding

Article Id: WHEBN0000052085
Reproduction Date:

Title: Protein folding  
Author: World Heritage Encyclopedia
Language: English
Subject: Protein, Prefoldin, Protein engineering, Protein domain, List of biophysicists
Collection: Protein Folding, Protein Structure
Publisher: World Heritage Encyclopedia

Protein folding

Protein before and after folding.
Results of protein folding.

Protein folding is the process by which a protein structure assumes its functional shape or conformation. It is the physical process by which a polypeptide folds into its characteristic and functional three-dimensional structure from random coil.[1] Each protein exists as an unfolded polypeptide or random coil when translated from a sequence of mRNA to a linear chain of amino acids. This polypeptide lacks any stable (long-lasting) three-dimensional structure (the left hand side of the first figure). Amino acids interact with each other to produce a well-defined three-dimensional structure, the folded protein (the right hand side of the figure), known as the native state. The resulting three-dimensional structure is determined by the amino acid sequence (Anfinsen's dogma).[2] Experiments [3] beginning in the 1980s indicate the codon for an amino acid can also influence protein structure.

The correct three-dimensional structure is essential to function, although some parts of functional proteins may remain unfolded.[4] Failure to fold into native structure generally produces inactive proteins, but in some instances misfolded proteins have modified or toxic functionality. Several neurodegenerative and other diseases are believed to result from the accumulation of amyloid fibrils formed by misfolded proteins.[5] Many allergies are caused by incorrect folding of some proteins, for the immune system does not produce antibodies for certain protein structures.[6]


  • Known facts 1
    • Relationship between folding and amino acid sequence 1.1
    • Disruption of the native state 1.2
    • Incorrect protein folding and neurodegenerative disease 1.3
    • Effect of external factors on the folding of proteins 1.4
    • The Levinthal paradox and kinetics 1.5
  • Experimental techniques for studying protein folding 2
    • Protein nuclear magnetic resonance spectroscopy 2.1
    • Circular dichroism 2.2
    • Dual polarisation interferometry 2.3
    • Vibrational circular dichroism of proteins 2.4
    • Studies of folding with high time resolution 2.5
    • Proteolysis 2.6
    • Optical tweezers 2.7
  • Computational methods for studying protein folding 3
    • Energy landscape of protein folding 3.1
    • Modeling of protein folding 3.2
  • See also 4
  • References 5
  • External links 6

Known facts

Relationship between folding and amino acid sequence

Illustration of the main driving force behind protein structure formation. In the compact fold (to the right), the hydrophobic amino acids (shown as black spheres) are in general shielded from the solvent.

The amino-acid sequence of a protein determines its native conformation.[7] A protein molecule folds spontaneously during or after biosynthesis. While these macromolecules may be regarded as "folding themselves", the process also depends on the solvent (water or lipid bilayer),[8] the concentration of salts, the pH, the temperature, the possible presence of cofactors and of molecular chaperones.

Minimizing the number of hydrophobic side-chains exposed to water is an important driving force behind the folding process.[9] Formation of intramolecular hydrogen bonds provides another important contribution to protein stability.[10] The strength of hydrogen bonds depends on their environment, thus H-bonds enveloped in a hydrophobic core contribute more than H-bonds exposed to the aqueous environment to the stability of the native state.[11]

The process of folding often begins heat shock proteins. Although most globular proteins are able to assume their native state unassisted, chaperone-assisted folding is often necessary in the crowded intracellular environment to prevent aggregation; chaperones are also used to prevent misfolding and aggregation that may occur as a consequence of exposure to heat or other changes in the cellular environment.

There are two models of protein folding that are currently being confirmed:

  • The diffusion collision model, in which a nucleus is formed, then the secondary structure is formed, and finally these secondary structures are collided together and pack tightly together.
  • The nucleation-condensation model, in which the secondary and tertiary structures of the protein are made at the same time.

Recent studies have shown that some proteins show characteristics of both of these folding models.

For the most part, scientists have been able to study many identical molecules folding together en masse. At the coarsest level, it appears that in transitioning to the native state, a given amino acid sequence takes on roughly the same route and proceeds through roughly the same intermediates and transition states. Often folding involves first the establishment of regular secondary and supersecondary structures, in particular alpha helices and beta sheets, and afterward tertiary structure. Formation of quaternary structure usually involves the "assembly" or "coassembly" of subunits that have already folded. The regular alpha helix and beta sheet structures fold rapidly because they are stabilized by intramolecular hydrogen bonds, as was first characterized by Linus Pauling. Protein folding may involve covalent bonding in the form of disulfide bridges formed between two cysteine residues or the formation of metal clusters. Shortly before settling into their more energetically favourable native conformation, molecules may pass through an intermediate "molten globule" state.

The essential fact of folding, however, remains that the amino acid sequence of each protein contains the information that specifies both the native structure and the pathway to attain that state. This is not to say that nearly identical amino acid sequences always fold similarly.[13] Conformations differ based on environmental factors as well; similar proteins fold differently based on where they are found. Folding is a spontaneous process independent of energy inputs from nucleoside triphosphates. The passage of the folded state is mainly guided by hydrophobic interactions, formation of intramolecular hydrogen bonds, and van der Waals forces, and it is opposed by conformational entropy.

Disruption of the native state

Under some conditions proteins will not fold into their biochemically functional forms. Temperatures above or below the range that cells tend to live in will cause thermally unstable proteins to unfold or "denature" (this is why boiling makes an egg white turn opaque). High concentrations of solutes, extremes of pH, mechanical forces, and the presence of chemical denaturants can do the same. Protein thermal stability is far from constant, however. For example, hyperthermophilic bacteria have been found that grow at temperatures as high as 122 °C,[14] which of course requires that their full complement of vital proteins and protein assemblies be stable at that temperature or above.

A fully denatured protein lacks both tertiary and secondary structure, and exists as a so-called random coil. Under certain conditions some proteins can refold; however, in many cases, denaturation is irreversible.[15] Cells sometimes protect their proteins against the denaturing influence of heat with enzymes known as chaperones or heat shock proteins, which assist other proteins both in folding and in remaining folded. Some proteins never fold in cells at all except with the assistance of chaperone molecules, which either isolate individual proteins so that their folding is not interrupted by interactions with other proteins or help to unfold misfolded proteins, giving them a second chance to refold properly. This function is crucial to prevent the risk of precipitation into insoluble amorphous aggregates.

Incorrect protein folding and neurodegenerative disease

Aggregated proteins are associated with prion-related illnesses such as Creutzfeldt-Jakob disease, bovine spongiform encephalopathy (mad cow disease), amyloid-related illnesses such as Alzheimer's disease and familial amyloid cardiomyopathy or polyneuropathy,[16] as well as intracytoplasmic aggregation diseases such as Huntington's and Parkinson's disease.[5][17] These age onset degenerative diseases are associated with the aggregation of misfolded proteins into insoluble, extracellular aggregates and/or intracellular inclusions including cross-beta sheet amyloid fibrils. While it is not completely clear whether the aggregates are the cause or merely a reflection of the loss of protein homeostasis, the balance between synthesis, folding, aggregation and protein turnover, the recent European Medicines Agency approval of Tafamidis or Vyndaqel (a kinetic stabilizer of tetrameric transthyretin) for the treatment of the transthyretin amyloid diseases suggests that it is the process of amyloid fibril formation and not the fibrils themselves that causes the degeneration of post-mitotic tissue in human amyloid diseases.[18] Misfolding and excessive degradation instead of folding and function leads to a number of proteopathy diseases such as antitrypsin-associated emphysema, cystic fibrosis and the lysosomal storage diseases, where loss of function is the origin of the disorder. While protein replacement therapy has historically been used to correct the latter disorders, an emerging approach is to use pharmaceutical chaperones to fold mutated proteins to render them functional.

Effect of external factors on the folding of proteins

Several external factors such as temperature, external fields (electric, magnetic),[19] molecular crowding,[20] and limitation of space could have a big influence on the folding of proteins.[21] Modification of the local minima by external factors can also induce modifications of the folding trajectory.

Protein folding is a very finely tuned process. Hydrogen bonding between different atoms provides the force required. Hydrophobic interactions between hydrophobic amino acids pack the hydrophobic residues

The Levinthal paradox and kinetics

Levinthal's paradox is a thought experiment, also constituting a self-reference in the theory of protein folding. In 1969, Cyrus Levinthal noted that, because of the very large number of degrees of freedom in an unfolded polypeptide chain, the molecule has an astronomical number of possible conformations. An estimate of 3300 or 10143 was made in one of his papers.

The Levinthal paradox[22] observes that if a protein were folded by sequentially sampling of all possible conformations, it would take an astronomical amount of time to do so, even if the conformations were sampled at a rapid rate (on the nanosecond or picosecond scale). Based upon the observation that proteins fold much faster than this, Levinthal then proposed that a random conformational search does not occur, and the protein must, therefore, fold through a series of meta-stable intermediate states.

The duration of the folding process varies dramatically depending on the protein of interest. When studied outside the cell, the slowest folding proteins require many minutes or hours to fold primarily due to proline isomerization, and must pass through a number of intermediate states, like checkpoints, before the process is complete.[23] On the other hand, very small single-domain proteins with lengths of up to a hundred amino acids typically fold in a single step.[24] Time scales of milliseconds are the norm and the very fastest known protein folding reactions are complete within a few microseconds.[25]

Experimental techniques for studying protein folding

While inferences about protein folding can be made through mutation studies; typically, experimental techniques for studying protein folding rely on the gradual unfolding or folding of proteins and observing conformational changes using standard non-crystallographic techniques.

Protein nuclear magnetic resonance spectroscopy

Protein folding is routinely studied using NMR spectroscopy, for example by monitoring hydrogen-deuterium exchange of backbone amide protons of proteins in their native state which provides both the residue-specific stability and overall stability of proteins.[26]

Circular dichroism

Circular dichroism is one of the most general and basic tools to study protein folding. Circular dichroism spectroscopy measures the absorption of circularly polarized light. In proteins, structures such as alpha helices and beta sheets are chiral, and thus absorb such light. The absorption of this light acts as a marker of the degree of foldedness of the protein ensemble. This technique has been used to measure equilibrium unfolding of the protein by measuring the change in this absorption as a function of denaturant concentration or temperature. A denaturant melt measures the free energy of unfolding as well as the protein's m value, or denaturant dependence. A temperature melt measures the melting temperature (Tm) of the protein. This type of spectroscopy can also be combined with fast-mixing devices, such as stopped flow, to measure protein folding kinetics and to generate chevron plots.

Dual polarisation interferometry

Dual polarisation interferometry is a surface based technique for measuring the optical properties of molecular layers. When used to characterise protein folding, it measures the conformation by determining the overall size of a monolayer of the protein and its density in real time at sub-Angstrom resolution . Although real time, measurement of the kinetics of protein folding are limited to processes that occur slower than ~10 Hz. Similar to circular dichroism the stimulus for folding can be a denaturant or temperature.

Vibrational circular dichroism of proteins

The more recent developments of vibrational circular dichroism (VCD) techniques for proteins, currently involving Fourier transform (FFT) instruments, provide powerful means for determining protein conformations in solution even for very large protein molecules. Such VCD studies of proteins are often combined with X-ray diffraction of protein crystals, FT-IR data for protein solutions in heavy water (D2O), or ab initio quantum computations to provide unambiguous structural assignments that are unobtainable from CD.

Studies of folding with high time resolution

The study of protein folding has been greatly advanced in recent years by the development of fast, time-resolved techniques. These are experimental methods for rapidly triggering the folding of a sample of unfolded protein, and then observing the resulting dynamics. Fast techniques in widespread use include neutron scattering,[27] ultrafast mixing of solutions, photochemical methods, and laser temperature jump spectroscopy. Among the many scientists who have contributed to the development of these techniques are Jeremy Cook, Heinrich Roder, Harry Gray, Martin Gruebele, Brian Dyer, William Eaton, Sheena Radford, Chris Dobson, Alan Fersht, Bengt Nölting and Lars Konermann.


Proteolysis is routinely used to probe the fraction unfolded under a wide range of solution conditions (e.g. Fast parallel proteolysis (FASTpp).[28][29]

Optical tweezers

Single molecule techniques, such as optical tweezers and AFM, have been used to understand protein folding mechanisms of isolated proteins as well as proteins with chaperones.[30] Optical tweezers have been used to stretch single protein molecules from their C- and N-termini and unfold them and study the subsequent refolding.[31] The technique allows one to measure folding rates at single-molecule level. For example optical tweezers have been recently applied to study folding and unfolding of proteins involved in blood coagulation. von Willebrand factor (vWF) is a protein with an essential role in blood clot formation process. It is discovered -using single molecule optical tweezers measurement - that calcium-bound vWF acts as a shear force sensor in the blood. Shear force leads to unfolding of the A2 domain of vWF whose refolding rate is dramatically enhanced in the presence of calcium.[32] Recently, it was also shown that the simple src SH3 domain accesses multiple unfolding pathways under force.[33]

Computational methods for studying protein folding

Energy landscape of protein folding

The protein folding phenomenon was largely an experimental endeavor until the formulation of an energy landscape theory of proteins by Joseph Bryngelson and Peter Wolynes in the late 1980s and early 1990s. This approach introduced the principle of minimal frustration.[34] This principle says that nature has chosen amino acid sequences so that the folded state of the protein is very stable. In addition, the undesired interactions between amino acids along the folding pathway are reduced making the acquisition of the folded state a very fast process. Even though nature has reduced the level of frustration in proteins, some degree of it remains up to now as can be observed in the presence of local minima in the energy landscape of proteins. A consequence of these evolutionarily selected sequences is that proteins are generally thought to have globally "funneled energy landscapes" (coined by José Onuchic)[35] that are largely directed toward the native state. This "folding funnel" landscape allows the protein to fold to the native state through any of a large number of pathways and intermediates, rather than being restricted to a single mechanism. The theory is supported by both computational simulations of model proteins and experimental studies,[34] and it has been used to improve methods for protein structure prediction and design.[34] The description of protein folding by the leveling free-energy landscape is also consistent with the 2nd law of thermodynamics.[36] Physically, thinking of landscapes in terms of visualizable potential or total energy surfaces simply with maxima, saddle points, minima, and funnels, rather like geographic landscapes, is perhaps a little misleading. The relevant description is really a high-dimensional phase space in which manifolds might take a variety of more complicated topological forms.[37]

Modeling of protein folding

Folding@home uses Markov state models, like the one diagrammed here, to model the possible shapes and folding pathways a protein can take as it condenses from its initial randomly coiled state (left) into its native 3D structure (right).

De novo or ab initio techniques for computational protein structure prediction are related to, but strictly distinct from experimental studies of protein folding. Molecular Dynamics (MD) is an important tool for studying protein folding and dynamics in silico.[38] First equilibrium folding simulations were done using implicit solvent model and umbrella sampling.[39] Because of computational cost, ab initio MD folding simulations with explicit water are limited to peptides and very small proteins.[40][41] MD simulations of larger proteins remain restricted to dynamics of the experimental structure or its high-temperature unfolding. In order to simulate long-time folding processes (beyond about 1 microsecond), like folding of small-size proteins (about 50 residues) or larger, some approximations or simplifications in protein models may be introduced to speed-up the calculation process.[42][43]

The 40-petaFLOP distributed computing project Folding@home simulates protein folding using the idle processing time of CPUs and GPUs of personal computers from volunteers. The project aims to understand protein misfolding and accelerate drug design for disease research.

Long continuous-trajectory simulations have been performed on Anton, a massively parallel supercomputer designed and built around custom ASICs and interconnects by D. E. Shaw Research. The longest published result of a simulation performed using Anton is a 1.112 millisecond simulation of NTL9 at 355 K.[44]

See also


  1. ^  
  2. ^  
  3. ^ Robert Saunders, Charlotte M. Deane (2010). "Synonymous codon usage influences the local protein structure observed". Nucleic Acids Research (Oxford University Press) 38 (19): 6719–6728.  
  4. ^ Jeremy M. Berg, John L. Tymoczko,  
  5. ^ a b Dennis J. Selkoe (2003). "Folding proteins in fatal ways". Nature 426 (6968): 900–904.  
  6. ^ Alberts, Bruce, Dennis Bray, Karen Hopkin, Alexander Johnson, Julian Lewis, Martin Raff, Keith Roberts, and Peter Walter. "Protein Structure and Function." Essential Cell Biology. Edition 3. New York: Garland Science, Taylor and Francis Group, LLC, 2010. Pg 120-170.
  7. ^ Anfinsen CB. (20 July 1973). "Principles that Govern the Folding of Protein Chains". Science. 181 (4096): 223–230.  
  8. ^ van den Berg, B., Wain, R.,  
  9. ^ Pace, C., Shirley, B., McNutt, M., Gajiwala, K. (1 January 1996). "Forces contributing to the conformational stability of proteins". FASEB J. 10 (1): 75–83.  
  10. ^ Rose, G., Fleming, P., Banavar, J., Maritan, A. (2006). "A backbone-based theory of protein folding". Proc. Natl. Acad. Sci. U.S.A. 103 (45): 16623–33.  
  11. ^ Deechongkit, S., Nguyen, H., Dawson, P. E., Gruebele, M., Kelly, J. W. (2004). "Context Dependent Contributions of Backbone H-Bonding to β-Sheet Folding Energetics". Nature 403 (45): 101–5.  
  12. ^ Lee, S., Tsai, F. (2005). "Molecular chaperones in protein quality control". J. Biochem. Mol. Biol. 38 (3): 259–65.  
  13. ^ Alexander, P. A., He Y., Chen, Y., Orban, J., Bryan, P. N. (2007). "The design and characterization of two proteins with 88% sequence identity but different structure and function". Proc Natl Acad Sci U S A. 104 (29): 11963–8.  
  14. ^ Takai, K., Nakamura, K., Toki, T., Tsunogai, U., Miyazaki, M., Miyazaki, J., Hirayama, H., Nakagawa, S., Nunoura, T., Horikoshi, K. (2008). "Cell proliferation at 122 °C and isotopically heavy CH4 production by a hyperthermophilic methanogen under high-pressure cultivation". Proc Natl Acad Sci USA 105 (31): 10949–54.  
  15. ^ Shortle, D. (1 January 1996). "The denatured state (the other half of the folding equation) and its role in protein stability". FASEB J. 10 (1): 27–34.  
  16. ^ Hammarstrom, P., et al., Prevention of Transthyretin Amyloid Disease by Changing Protein Misfolding Energetics. Science, 2003. 299(5607): p. 713-716.
  17. ^ Chiti, F.; Dobson, C. (2006). "Protein misfolding, functional amyloid, and human disease". Annual review of biochemistry 75: 333–366.  
  18. ^ Johnson, S.M., et al., Native State Kinetic Stabilization as a Strategy To Ameliorate Protein Misfolding Diseases: A Focus on the Transthyretin Amyloidoses. Acc. Chem. Res., 2005. 38(12): p. 911-921.
  19. ^ Ojeda, P., Garcia, M. (2010). "Electric Field-Driven Disruption of a Native β-Sheet Protein Conformation and Generation of a Helix-Structure". Biophysical Journal 99 (2): 595–599.  
  20. ^ Berg, B., Ellis, J., Dobson, C. (1999). "Effects of macromolecular crowding on protein folding and aggregation". The EMBO Journal 18 (24): 6927–6933.  
  21. ^ Ellis RJ (July 2006). "Molecular chaperones: assisting assembly in addition to folding". Trends in Biochemical Sciences 31 (7): 395–401.  
  22. ^  
  23. ^ Kim, P. S., Baldwin, R. L. (1990). "Intermediates in the folding reactions of small proteins". Annu. Rev. Biochem. 59: 631–60.  
  24. ^ Jackson S. E. (August 1998). "How do small single-domain proteins fold?". Fold Des 3 (4): R81–91.  
  25. ^ Kubelka, J., Hofrichter, J., Eaton, W. A. (February 2004). "The protein folding 'speed limit'". Curr. Opin. Struct. Biol. 14 (1): 76–88.  
  26. ^ Beatrice M.P. Huyghues-Despointes, C. Nick Pace,S. Walter Englander, and J. Martin Scholtz. "Measuring the Conformational Stability of a Protein by Hydrogen Exchange." Methods in Molecular Biology. Kenneth P. Murphy Ed. Humana Press, Totowa, New Jersey, 2001. Pg. 69-92
  27. ^ Bu, Z., Cook, J.,  
  28. ^ Minde, D.P., Maurice, M.M., and Rudiger, S.G.D. (2012). "Determining biophysical protein stability in lysates by a Fast Proteolysis Assay, FASTpp". PLOS One 7 (10): e46147.  
  29. ^ Park, C., and Marqusee, S. (2005). "Pulse proteolysis: a simple method for quantitative determination of protein stability and ligand binding". Nat. Methods 2 (3): 207–212.  
  30. ^ Mashaghi et al. Chaperone Action at the Single-Molecule Level Chemical Reviews Article ASAP (2013) [1]
  31. ^ Jagannathan, B; Marqusee, S (Nov 2013). "Protein folding and unfolding under force". Biopolymers 99: 860–869.  
  32. ^ Jakobi AJ, Mashaghi A, Tans SJ, Huizinga EG. Calcium modulates force sensing by the von Willebrand factor A2 domain. Nature Commun. 2011 Jul 12;2:385. [2]
  33. ^ Jagannathan, B; Marqusee, S (2012). "Direct observation of a force-induced switch in the anisotropic mechanical unfolding pathway of a protein". PNAS 109 (44): 17820–17825.  
  34. ^ a b c Bryngelson, J. D.,  
  35. ^ Leopold, P. E., Montal, M. and  
  36. ^ Sharma, V., Kaila, V. R. I., and Annila, A. (2009). "Protein folding as an evolutionary process". Physica A 388 (6): 851–862.  
  37. ^ Robson, B, Vaithilingham A. (2008). Protein Folding Revisited. Progress in Molecular Biology and Translational Science, Molecular Biology of Protein Folding. 84:161-202, Elsevier Press/Academic Press
  38. ^ Rizzuti, B., and Daggett, V. (2013). "Using simulations to provide the framework for experimental protein folding studies". Arch. Biochem. Biophys. 531 (1-2): 128–135.  
  39. ^ Schaefer, Michael; Bartels, Christian, Karplus, Martin (3 December 1998). "Solution conformations and thermodynamics of structured peptides: molecular dynamics simulation with an implicit solvation model1". Journal of Molecular Biology 284 (3): 835–848.  
  40. ^ "Fragment-based Protein Folding Simulations". 
  41. ^ "Protein folding" (by Molecular Dynamics). 
  42. ^ Kmiecik, S., and Kolinski, A. (2007). "Characterization of protein-folding pathways by reduced-space modeling". Proc. Natl. Acad. Sci. U.S.A. 104 (30): 12330–5.  
  43. ^ Adhikari AN, Freed KF and Sosnick TR (2012). "De novo prediction of protein folding pathways and structure using the principle of sequential stabilization". Proc. Natl. Acad. Sci. U.S.A. 109 (43): 17442–17447.  
  44. ^ Kresten Lindorff-Larsen, Stefano Piana, Ron O. Dror, and David E. Shaw, "How Fast-Folding Proteins Fold," Science, vol. 334, no. 6055, 2011, pp. 517–520. (Abstract)

External links

  • FoldIt - Folding Protein Game
  • Folding@Home
  • Rosetta@Home
  • Human Proteome Folding Project
  • BHAGEERATH-H: Protein tertiary structure prediction server
This article was sourced from Creative Commons Attribution-ShareAlike License; additional terms may apply. World Heritage Encyclopedia content is assembled from numerous content providers, Open Access Publishing, and in compliance with The Fair Access to Science and Technology Research Act (FASTR), Wikimedia Foundation, Inc., Public Library of Science, The Encyclopedia of Life, Open Book Publishers (OBP), PubMed, U.S. National Library of Medicine, National Center for Biotechnology Information, U.S. National Library of Medicine, National Institutes of Health (NIH), U.S. Department of Health & Human Services, and, which sources content from all federal, state, local, tribal, and territorial government publication portals (.gov, .mil, .edu). Funding for and content contributors is made possible from the U.S. Congress, E-Government Act of 2002.
Crowd sourced content that is contributed to World Heritage Encyclopedia is peer reviewed and edited by our editorial staff to ensure quality scholarly research articles.
By using this site, you agree to the Terms of Use and Privacy Policy. World Heritage Encyclopedia™ is a registered trademark of the World Public Library Association, a non-profit organization.

Copyright © World Library Foundation. All rights reserved. eBooks from Hawaii eBook Library are sponsored by the World Library Foundation,
a 501c(4) Member's Support Non-Profit Organization, and is NOT affiliated with any governmental agency or department.