B1.2.1
Generalized structure of an amino acid
B1.2.2
Condensation reactions forming dipeptides and longer chains of amino acids
B1.2.3
Dietary requirements for amino acids
B1.2.4
Infinite variety of possible peptide chains
B1.2.1
An amino acid is an organic molecule with both an amine group and a carboxyl group attached to a central carbon atom. It also has a variable side chain, which gives the amino acid its particular chemical properties. Amino acids are the monomers used to build polypeptides and proteins.
The central carbon is the alpha carbon. In a generalized amino acid, this carbon atom is bonded to four different groups:
When drawing the generalized amino acid, place the alpha carbon in the centre and join all four groups to it with single covalent bonds. A clear written form is usually , but in a diagram the amine group, carboxyl group, hydrogen and R-group should all be shown explicitly. The R-group isnât a mysterious extra atom; itâs the part that varies from one amino acid to another.

The carboxyl group can act as an acid, and the amine group can act as a base, so amino acids respond to the pH of their surroundings. This matters later because protein shape depends on charges and interactions within the molecule.
B1.2.2
A condensation reaction is a chemical reaction where two molecules become covalently linked and a small molecule, usually water, is released. Amino acids join in this way.
The word equation is:
amino acid + amino acid dipeptide + water
A dipeptide is a molecule made of two amino acid residues joined by one peptide bond. A peptide bond is a covalent CâN bond formed between the carboxyl group of one amino acid and the amine group of another. The âOH from the carboxyl group and an âH from the amine group are removed, forming water.

Longer chains form as more amino acids are added by further condensation reactions:
peptide + amino acid longer peptide + water
A polypeptide is a chain of many amino acid residues joined by peptide bonds. In cells, ribosomes catalyse peptide bond formation during translation. The chain has direction: one end has a free amine group, called the amino terminus, while the other has a free carboxyl group, called the carboxyl terminus.
When you draw a generalized dipeptide, focus on the backbone. Show the repeating NâCâC pattern, the peptide bond as CâN, and two R-groups projecting from the alpha carbons. In the middle of the molecule, the original amine and carboxyl groups are no longer both intact, because their atoms have been used to make the peptide bond and water.
A residue is the part of an amino acid that remains incorporated in a peptide chain after condensation has removed atoms to form water. The R-group belongs to an amino acid before joining; once the amino acid is in the chain, it is part of an amino acid residue.
B1.2.3
An essential amino acid is an amino acid that an organism cannot synthesize in sufficient quantity and therefore must obtain from food. A non-essential amino acid is an amino acid that an organism can synthesize from other molecules, including other amino acids.
Humans use 20 amino acids in polypeptides built by ribosomes. Some have to come ready-made from the diet. Others are made through metabolic pathways that modify molecules already present in the body. Donât confuse âessentialâ with âmore importantâ. All 20 are needed for protein synthesis. âEssentialâ only tells you whether the diet must supply that amino acid.
Dietary protein is broken down into amino acids and small peptides. These are absorbed and then used to build the bodyâs own proteins. A food may contain plenty of total protein but still be low in one essential amino acid, so protein quality depends on the balance of amino acids, not just the number of grams of protein.
Vegan diets can meet amino acid requirements, but they need some attention because plant foods differ in their amino acid profiles. The practical point is simple: a varied diet with suitable combinations of plant protein sources is needed so that all essential amino acids are supplied over time.
B1.2.4
The genetic code specifies 20 amino acids for protein synthesis. In principle, a ribosome can put any one of these amino acids into any position in a peptide chain. Some chains are tiny. Others contain hundreds or thousands of amino acid residues, and the order can change at every position.
Possible sequences multiply fast as chain length increases. For a chain of amino acid residues,
A dipeptide has possible sequences; a tripeptide has ; a protein-length polypeptide gives a number so large that âeffectively infiniteâ is fair classroom language.

One useful distinction: ribosomes don't make random chains just because many chains are possible. Genetic information determines the sequence. That connects the genome, which is the complete set of genetic information in an organism, with the proteome, which is the complete set of proteins produced by a cell, tissue or organism under particular conditions.
Examples of polypeptides show the range more clearly. Some peptide hormones are short signalling molecules. Insulin is a small protein made from two polypeptide chains. Many enzymes are single polypeptides of several hundred residues. Some structural muscle proteins are enormous chains containing tens of thousands of residues. The same basic peptide-bond chemistry supports all of that variety.
B1.2.5
A protein usually works only if its three-dimensional shape is right. Interactions within the molecule hold that shape, and many of these are weak on their own. Temperature and pH are abiotic factors that can disturb those interactions, changing the proteinâs molecular form.
Denaturation is a structural change in a protein in which its normal three-dimensional shape is disrupted enough to reduce or destroy its function, without necessarily breaking the peptide bonds of the primary chain. A denatured enzyme may therefore still have the same amino acid sequence, but its active site no longer has the right shape.
When temperature rises, atoms and groups within a protein gain kinetic energy. They move more, so the bonds and interactions holding the protein in shape experience more vibration and strain. Above a certain point, the protein unfolds or changes conformation. Many soluble proteins then become insoluble, because hydrophobic regions that were previously hidden inside the protein become exposed to water.
Some proteins from organisms that live in very hot environments are unusually heat-stable, so one temperature should not be treated as a universal denaturation point. Even so, high temperature visibly denatures many familiar proteins. Egg white turning opaque and solid is the kitchen version of the same principle.
Changing pH changes the availability of hydrogen ions. That can alter the charges on acidic and basic groups in a protein, especially in R-groups. When charges change, ionic bonds may break, and new attractions or repulsions may form. The protein may then change shape. Extreme acidity and extreme alkalinity can both denature proteins.
How temperature and pH changes can denature proteins.
| Factor | Native region | Denaturing region | Why structure changes |
|---|---|---|---|
| Temperature / °C | Low to moderate, e.g. 20â40 °C | High, often above ~60 °C for familiar proteins | More kinetic energy strains weak interactions; unfolding can expose hydrophobic regions and cause precipitation. |
| pH / dimensionless | Near the proteinâs optimum, e.g. about pH 7 for many proteins | Extreme acidic or alkaline pH, e.g. <4 or >10 | Hâș changes R-group charges; ionic bonds may break and new attractions or repulsions alter shape. |
One practical way to track denaturation is to measure turbidity. A colorimeter is an instrument that measures how much selected light passes through or is absorbed by a sample. If albumen protein denatures and precipitates, the mixture becomes cloudier. Less light is transmitted, and absorbance increases. In a temperature investigation, protein concentration and heating time should be controlled; in a pH investigation, buffer solutions are used so that pH is the variable being tested.
B1.2.6
After amino acids join to form a polypeptide, the repeating backbone stays broadly similar along the chain. Most of the chemical variation comes from the R-groups. Different R-groups create different local properties, and the order of those properties along the chain helps decide the proteinâs final form and function.
A hydrophobic R-group is a non-polar side chain that tends to avoid contact with water. A hydrophilic R-group is a polar or charged side chain that can interact favourably with water. Some hydrophilic R-groups are uncharged but polar; others carry charge.

Charged R-groups are often described using acid-base behaviour. An acidic R-group is a side chain that can donate a hydrogen ion and become negatively charged. A basic R-group is a side chain that can accept a hydrogen ion and become positively charged. For the syllabus, that level of detail is enough: you donât need to memorize named R-group structures, but you do need to understand why a chain with different R-groups can fold and interact in many different ways.
This chemical diversity is why proteins can act as enzymes, receptors, structural fibres, transport molecules and signals. Same backbone idea; different R-group sequence; different properties.
B1.2.7
Primary structure is the linear sequence of amino acid residues in a polypeptide chain. That doesnât just mean which amino acids are present; it means their exact order and position from one end of the chain to the other.
Conformation is the three-dimensional arrangement of atoms in a protein or polypeptide. Most proteins fold into precise, repeatable conformations because the R-groups in the primary structure interact with one another and with the surrounding environment in predictable ways.

Hereâs a major form-and-function link in biology. Change the primary structure, and you may change where hydrophobic groups cluster, where charged groups attract or repel, and where hydrogen bonds or disulfide bonds can form. Sometimes a small sequence change has little effect. Sometimes it alters the conformation enough to change function.
The genome-proteome relationship fits here. During protein synthesis, DNA sequence determines the amino acid sequence, and that amino acid sequence helps determine protein conformation. Gene expression regulation matters too, because a cellâs proteome changes depending on which polypeptides it is making.
B1.2.8
Secondary structure is a regular local folding pattern in a polypeptide backbone, held in place by hydrogen bonds. These hydrogen bonds form at regular positions between polar and â groups in the peptide backbone, rather than mainly between R-groups.
An alpha helix is a secondary structure where a polypeptide backbone coils into a spiral, stabilized by hydrogen bonds between nearby turns of the coil. The R-groups point outward from the helix, which is useful: they can still interact with the surroundings or with other parts of the protein.
A beta-pleated sheet is a secondary structure where extended sections of polypeptide lie alongside one another, with hydrogen bonds stabilizing the sections. The sheet looks âpleatedâ because the bond angles in the backbone create a folded, zigzag geometry instead of a flat ribbon.

Donât treat secondary structure as the whole protein shape. A protein may contain several alpha helices, several beta-pleated sheets, both, or neither. These regular regions act as building features within the larger three-dimensional structure.
B1.2.9
Tertiary structure is the overall three-dimensional folding of a single polypeptide chain. It comes from interactions between R-groups that may sit far apart in the primary sequence, then end up close together once the chain folds.
Four interaction types matter especially here:

pH matters here. Amine groups in R-groups can accept hydrogen ions and become positively charged. Carboxyl groups in R-groups can release hydrogen ions and become negatively charged. After that, these charged groups can form ionic bonds. If pH changes, the pattern of charges can change as well, so the tertiary structure can change too.
Many proteins contain secondary structures within their tertiary structure, so the levels of structure are not separate boxes in a real molecule. The final shape comes from the amino acid sequence, backbone folding and R-group interactions.
B1.2.10
A polar amino acid is an amino acid with an R-group that has an uneven distribution of charge and can interact with water. A non-polar amino acid is an amino acid with an R-group that lacks strong charge separation and tends to avoid water.
In water-soluble globular proteins, hydrophobic amino acids usually pack into the proteinâs core, away from water. Hydrophilic amino acids tend to sit on the surface instead, where they can form hydrogen bonds or ionic interactions with water and dissolved ions. That pattern helps the folded protein stay stable in an aqueous environment such as cytoplasm or blood plasma.

Integral membrane proteins have to fold for a different chemical environment. The centre of a phospholipid bilayer is hydrophobic, so the parts of an integral protein embedded in the membrane often have many hydrophobic amino acids on the outside surface of that region. These amino acids help the protein embed in the membrane.
A transmembrane channel protein shows this arrangement neatly. Its outer surface, where it touches the membrane core, is hydrophobic. The channel lining, though, can be hydrophilic, allowing polar molecules or ions to pass through a route that would otherwise be blocked by the hydrophobic membrane interior.
B1.2.11
Quaternary structure describes the three-dimensional arrangement of multiple subunits in a protein. These subunits may be polypeptide chains; some proteins also include non-polypeptide components.
A non-conjugated protein contains only polypeptide subunits. Insulin is non-conjugated because it has two polypeptide chains held together, including by disulfide bonds. Collagen is non-conjugated too: it is made from three polypeptide chains wound into a strong rope-like structure.
A conjugated protein includes one or more non-polypeptide components as well as its polypeptide chains. Haemoglobin is the key example. It has four polypeptide subunits, each associated with a haem group. The haem group contains iron and binds oxygen, so this non-polypeptide component is essential for haemoglobinâs transport function.

Protein molecules are far smaller than the unaided eye can see, and many are too small for ordinary light microscopy. Cryogenic electron microscopy rapidly freezes biological molecules and images them with electrons, allowing scientists to study very small protein structures and protein interactions.

This works well as a Nature of Science example: technology changes what counts as observable. Once scientists can image single protein molecules and their interactions, they can link structure to function much more directly, including changes in conformation during molecular activity.
B1.2.12
A globular protein is a protein with a compact, roughly rounded three-dimensional shape produced by folding of one or more polypeptide chains. Because globular proteins often have precise binding sites or active sites, small details of conformation matter.
A fibrous protein is a protein with an elongated shape in which polypeptide chains form fibres or filaments suited to structural roles. These proteins are generally built for strength, support or elasticity, rather than for binding a specific small molecule at a pocket.

Insulin shows how globular form links to function. Its specific conformation lets it bind to the insulin receptor on target cells. Binding happens because the shape and chemical properties of insulin match the receptorâs binding site closely enough to send a clear signal.
Collagen shows the fibrous pattern. Three polypeptide chains wind together into a triple helix, making a rope-like molecule with high tensile strength. That shape suits collagen for resisting pulling forces in tissues such as skin, tendons and ligaments.
So the exam sentence you should be able to build is not just âshape affects functionâ. Be specific: globular proteins fold into compact shapes with precise binding surfaces, while fibrous proteins form elongated structures whose shape gives mechanical strength.