Amino Acid Chemistry: Structure, Properties, and Biological Roles
General Structure
All 20 standard amino acids share the same core structure. The alpha carbon (C-alpha) is bonded to four different groups: an amino group (NH2), a carboxyl group (COOH), a hydrogen atom, and a variable side chain called the R group. Because the alpha carbon has four different substituents (except in glycine, where the R group is simply a hydrogen atom), it is a chiral center. All amino acids in proteins are the L-isomer, a convention that was established early in life's evolution and has been conserved across all known organisms.
At physiological pH (approximately 7.4), amino acids exist as zwitterions, meaning the amino group is protonated (NH3+) and the carboxyl group is deprotonated (COO-). This means that even though the molecule carries both a positive and a negative charge, the net charge depends on the pH and the nature of the side chain. The pH at which an amino acid carries no net charge is called its isoelectric point (pI), a property that is important for laboratory techniques such as electrophoresis and isoelectric focusing.
Classification by Side Chain
The 20 standard amino acids are classified into groups based on the chemical properties of their side chains. These properties determine how each amino acid contributes to protein structure, folding, and function.
Nonpolar (hydrophobic) amino acids have side chains composed mainly of carbon and hydrogen atoms. This group includes glycine, alanine, valine, leucine, isoleucine, proline, phenylalanine, tryptophan, and methionine. In water-soluble proteins, these residues tend to cluster in the interior of the folded protein, away from water, forming a hydrophobic core that is a major driving force for protein folding. Proline is unique because its side chain forms a ring with the backbone nitrogen, restricting the backbone's rotational freedom and often introducing bends or kinks in the polypeptide chain.
Polar uncharged amino acids have side chains that can form hydrogen bonds with water but do not carry a formal charge at physiological pH. This group includes serine, threonine, asparagine, glutamine, tyrosine, and cysteine. Serine and threonine are frequently phosphorylated by kinases, a covalent modification that regulates protein activity. Cysteine is notable for its sulfhydryl (SH) group, which can form disulfide bonds (S-S) with another cysteine, covalently linking two parts of a protein or two separate protein chains.
Positively charged (basic) amino acids carry a positive charge on their side chain at physiological pH. Lysine has a long, flexible side chain ending in an amino group. Arginine has a guanidinium group that carries a delocalized positive charge and forms multiple hydrogen bonds. Histidine has an imidazole ring with a pKa near 6, close to physiological pH, which means it can switch between protonated and deprotonated states depending on the local environment. This property makes histidine a common participant in enzyme active sites, where it can act as both a proton donor and acceptor during catalysis.
Negatively charged (acidic) amino acids carry a negative charge at physiological pH. Aspartate and glutamate each have a carboxyl group in their side chain that is deprotonated under normal cellular conditions. These residues frequently participate in salt bridges with positively charged residues and in coordinating metal ions in metalloenzymes.
The Peptide Bond
Amino acids are linked together by peptide bonds, formed through a condensation reaction between the carboxyl group of one amino acid and the amino group of the next. This reaction releases a molecule of water and creates a covalent C-N bond. The resulting chain of amino acids is called a polypeptide.
The peptide bond has partial double-bond character because of resonance between the carbonyl oxygen and the amide nitrogen. This means the six atoms of the peptide bond unit (C-alpha, C, O, N, H, and the next C-alpha) lie in a rigid, planar arrangement. Rotation is restricted around the C-N bond itself but permitted around the bonds connecting each peptide unit to its alpha carbons (the phi and psi angles). The specific phi and psi angles adopted by each residue determine the secondary structure of the protein.
In the cell, peptide bond formation does not occur spontaneously because the equilibrium strongly favors hydrolysis. Protein synthesis requires energy input from GTP and is catalyzed by the ribosome, where the peptidyl transferase activity of the large ribosomal subunit (actually a ribozyme, since the catalytic component is rRNA rather than protein) forms each new peptide bond. Peptide bond hydrolysis, the reverse reaction, is catalyzed by proteases and is thermodynamically favorable.
Essential and Nonessential Amino Acids
Human cells can synthesize 11 of the 20 standard amino acids from metabolic intermediates. These are called nonessential amino acids, not because they are unimportant, but because the body does not require them from the diet. The remaining 9 amino acids (histidine, isoleucine, leucine, lysine, methionine, phenylalanine, threonine, tryptophan, and valine) cannot be synthesized by human cells and must be obtained from food. These are the essential amino acids.
Some amino acids are conditionally essential, meaning they become essential under certain physiological conditions. Arginine, for example, is normally nonessential in adults but becomes essential during periods of rapid growth, illness, or recovery from injury because the body's synthetic capacity cannot keep up with demand. Tyrosine is nonessential when phenylalanine is abundant because it is synthesized from phenylalanine by the enzyme phenylalanine hydroxylase, but it becomes essential in individuals with phenylketonuria (PKU) who lack this enzyme.
Complete proteins, found primarily in animal sources such as meat, eggs, and dairy, contain all nine essential amino acids in adequate proportions. Many plant proteins are incomplete, lacking or having insufficient amounts of one or more essential amino acids. Lysine is often the limiting amino acid in grains, while methionine is limiting in legumes. Combining complementary plant protein sources (such as rice and beans) within the overall diet provides all essential amino acids.
Amino Acids Beyond Proteins
Although their primary role is as protein building blocks, amino acids serve many other functions in cellular metabolism. Glutamate is the most abundant excitatory neurotransmitter in the mammalian brain. Gamma-aminobutyric acid (GABA), synthesized from glutamate by the enzyme glutamate decarboxylase, is the major inhibitory neurotransmitter. Tryptophan is the precursor of serotonin, a neurotransmitter involved in mood regulation, and melatonin, a hormone that regulates circadian rhythms.
Glycine participates in the synthesis of heme (the iron-containing group in hemoglobin), purines (components of DNA and RNA), and the antioxidant glutathione. Histidine is the precursor of histamine, a signaling molecule involved in immune responses and stomach acid secretion. Arginine is the substrate for nitric oxide synthase, which produces nitric oxide (NO), a gaseous signaling molecule that relaxes blood vessels and plays roles in immune defense and neural signaling.
Amino acids also serve as metabolic fuels. When dietary protein exceeds the body's needs for protein synthesis, or during fasting when protein is broken down to supply energy, amino acids are deaminated and their carbon skeletons enter central metabolic pathways. Some are glucogenic, meaning their carbons can be converted to glucose through gluconeogenesis. Others are ketogenic, meaning they produce acetyl-CoA or acetoacetate, which can be used for fatty acid synthesis or ketone body production. Several amino acids are both glucogenic and ketogenic.
Post-Translational Modifications
After a protein is synthesized, many of its amino acid residues can be chemically modified, expanding the functional repertoire of the proteome far beyond the 20 standard amino acids. Phosphorylation of serine, threonine, or tyrosine by protein kinases is one of the most common modifications, serving as a molecular switch that turns protein activity on or off. An estimated one-third of all human proteins are phosphorylated at some point during their functional lifetime.
Glycosylation, the attachment of sugar chains to asparagine (N-linked) or serine and threonine (O-linked) residues, is important for protein folding, stability, and cell-cell recognition. Most secreted and membrane-bound proteins are glycosylated. Acetylation of lysine residues on histone proteins plays a critical role in regulating gene expression by altering how tightly DNA is packaged around the histone core. Methylation, ubiquitination, and SUMOylation are additional modifications that regulate protein function, localization, and degradation.
Amino acids are the monomers of proteins, each defined by a unique side chain that determines its chemical properties. Their classification into nonpolar, polar, charged, and special groups explains how proteins fold, function, and interact with their environment. Beyond protein structure, amino acids serve as neurotransmitter precursors, metabolic fuels, and substrates for essential biosynthetic pathways.