北京化工大学：《有机化学》课程教学资源（英文讲义）Chapter 26 Amino Acids, Peptides, Proteins and Nucleic Acids：Nitrogen-Containing Polymers in Nature.pdf_大学文库

4 The peptide bond is planar and fairly rigid at room temperature. The N-H hydrogen is almost always located trans to the carbonyl oxygen and rotation about the C-N bond is slow (the C-N bond has partial double-bond character). Both bonds adjacent to the amine function enjoy free rotation. Polypeptides are able to assume many different conformations, however, one conformation, the native state of the peptide, usually has a much lower free energy than the other conformations. Polypeptides are characterized by their sequence of amino acid residues. The amino end, or N-terminal amino acid, is always placed at the left when drawing a polypeptide chain. The C-terminal amino acid will be assumed to be on the right and the configuration at all of the C2 stereocenters will be assumed to be S (L). The main chain is the chain incorporating the peptide bonds and the side chains are the substituents, R, R’, R’’, etc. Naming of polypeptides simply starts at the amino terminal end and lists the names of the amino acids in sequence. Three letter abbreviations are often used. Several small peptides are physiologically significant: Aspartame, a dipeptide, is an artificial sweetener (NutraSweet). Glutathione functions as a biological reducing agent and is unusual in that it contains a γ- carbonyl peptide bond. Gramicidin S is a cyclic peptide antibiotic. Two identical pentapeptides have been joined head to tail. It contains the rare amino acid ornithine. Insulin is a hormone which circulates in the blood and helps regulate the concentration of blood glucose. It contains two polypeptide chains and three disulfide bonds. Proteins fold into pleated sheets and helices: secondary and tertiary structures. The sequence of amino acids in a peptide chain is called its primary structure. As the peptide chain folds into its most stable conformation, the arrangement of close-lying amino acid residues is called its secondary structure. Two important types of secondary structure are the pleated sheet, or β-structure, and the α-helix

5 In the pleated sheet, the two chains line up with the carboxy groups of one chain opposite the amino groups of another. Additional chains may be bonded to either side to construct a “sheet” of chains connected by hydrogen bonds. This arrangement of chains can also be formed by a single polypeptide chain folding back and forth on itself several times. β-Pleated sheets impart considerable rigidity to the system. The α-helix is formed by hydrogen bonds between carbonyl groups and amino groups 3.6 amino acids apart in the amino acid sequence. The carbonyl group of one amino acid hydrogen bonds with the amino group of the amino acid four residues ahead in the sequence. Two equivalent points in adjacent turns of the helix are 5.4 Å apart. Too much charge of the same kind or the presence of the amino acid proline may disrupt secondary structure. The final overall folding of the entire polypeptide chain is called the tertiary structure of the chain. A variety of forces are involved in stabilizing the tertiary structure. •Disulfide bridges •Hydrogen bonds •London forces •Electrostatic attraction and repulsion •Micellar effects (hydrophobic effect) Pronounced folding is found in the globular proteins (chemical transport, catalysis, etc.) In fibrous proteins (myosin, fibrin, α-keratin), several α-helices are coiled together to form a superhelix. Enzymes and transport proteins fold up in such a way to produce three dimensional pockets or groves on their surfaces called active sites or binding sites. The size and shape of these sites provide a very specific fit for the intended substrate or ligand. The inner surface of an active site generally contains a specific arrangement of side chains of polar amino acids that attract functional groups on the substrate by hydrogen bonding or ionic interactions. Active sites align the functional groups on the enzyme and substrates in such a way as to promote the associated chemical reaction. A typical example of an enzyme is chymotprysin which catalyses the hydrolysis of specific peptide bonds (adjacent to phenylalanine, tyrosine or tryptophan) at physiological temperature and pH. Exposure of a protein to extremes of heat or pH usually causes denaturation, or breakdown, of the tertiary structure of a protein. In some proteins, several polypeptide chains, each with its own tertiary structure, assemble to form a larger structure called a quaternary structure

6 Determination of Primary Structure: Amino Acid Sequencing 26-5 First, purify the polypeptide. Protein purification involves the successive application of a variety of physical chemical techniques: Dialysis (size) Gel filtration (size, shape) Ion-exchange chromatography (charge) Electrophoresis (charge) Affinity chromatography (specific binding ability) Second, determine which amino acids are present. The peptide is first subjected to hydrolysis using 6 N HCl at 110oC for 24 hours. The numbers and types of free amino acids present are then determined using an automated amino acid analyzer. Sequence the peptide from the amino (N-terminal) end. The sequence of amino acids in a peptide chain is next determined using an Edman degradation. In this process, one amino acid at a time in the form of a phenylthiohydantoin is released from the N-terminus of the polypeptide chain. Since each amino acid produces a different phenylthiohydantoin, the amino acid sequence can be readily determined. The chopping up of longer chains is achieved with enzymes. The Edman degradation can only be used for relatively short peptides (about 50 residues). For longer peptides it is necessary to break the chains into specific shorter fragments using a selective chemical or enzymatic process. These fragments can then be isolated and individually sequenced. The order of the fragments within the original peptide must next be determined. A second fragmentation of another sample of the original peptide is carried out using a different chemical or enzymatic process. The two sets of peptide fragments are examined for overlap peptides which allow both sets of fragments to be correctly assembled. Given the three peptides produced by trypsin cleavage, it is impossible to tell whether the fragment ending in Arg or the fragment ending in Lys started the original peptide chain. The single Ala fragment can be identified as the C-terminal fragment since it is the only fragment that does not end in Lys or Arg. Hydrolysis of the B-chain by a different proteolytic enzyme (or determination of the N-terminal amino acid of the intact B-chain) is necessary to completely order all fragments

10 Each tRNA contains three bases which are complementary to the group of three bases on the mRNA, which specify a particular amino acid. At another site on the tRNA, that particular amino acid is attached by an enzyme known as an aminoacyl synthetase. The function of the ribosome is to simply match specific tRNA’s to the groups of three bases on the mRNA, and to polymerize the amino acids, thus assembled. DNA Sequencing and Synthesis: Cornerstones of Gene Technology 26-11 Rapid DNA sequencing has deciphered the human genome. In a method similar to that employed in protein sequencing, the long DNA molecule is first cleaved at specific points into more manageable fragments using enzymes known as restriction nucleases. There are more than 200 such enzymes. The sequence of each individual shortened fragment of DNA can then be determined by a chemical (Gilbert-Maxam) or enzymatic (Sanger) procedure. In the Gilbert-Maxam method, a sample of polypeptide is first labeled at its 5’ end with radioactive 32P to enable its detection after the next step. Next, four individual DNA samples are each subjected to chemical degradation. Each degradation is specific for one of the four bases present and cleaves the polynucleotide chain at that position. The concentration of the reagent is adjusted so that each molecule in the sample is cleaved only once. This results in all possible fragments starting at the radioactive label and ending with a particular instance of the base being analyzed. Electrophoresis is used to separate the various fragments produced from the four degradations. The movement of the fragments in the electric field is proportional to their charge (in this case length), which allows the overall sequence to be determined. In the Sanger, method the piece of DNA to be analyzed (the template strand, starting from the 3’ end) is replicated many times by the enzyme DNA polymerase using a mixture of all four deoxynucleoside triphosphates as substrates. The process is started by adding a short piece of complementary DNA known as the primer strand, which is then extended by the DNA polymerase. Again four different experiments are performed. In each experiment, one of the 4 target bases is selected and a small amount of the corresponding fluorescent-dye labeled dideoxyribonucleoside triphosphate is added to the mixture. The concentration of the dideoxy compound is selected so that approximately one molecule would be incorporated per newly synthesized DNA strand. The incorporation of a dideoxy molecule terminates the synthesis of the new chain. This results in a labeled set of all possible sequences ending with the target base, just as in the GilbertMaxam method