B1.2: Proteins

Exam prep

Exemplars

Review

HOT

Tutoring

Knowledge Hub

We're hiring a TikTok Content Creator (paid opportunity). Click here to learn more.

Master IB Biology B1.2: Proteins with notes created by examiners and strictly aligned with the syllabus.

Verified by Fatima F.

IB Syllabus Requirements for Proteins

B1.2.1

Generalized structure of an amino acid

B1.2.2

Condensation reactions forming dipeptides and longer chains of amino acids

B1.2.3

Dietary requirements for amino acids

B1.2.4

Infinite variety of possible peptide chains

B1.2.1

Generalized structure of an amino acid

The common plan of an amino acid

An amino acid is an organic molecule with both an amine group and a carboxyl group attached to a central carbon atom. It also has a variable side chain, which gives the amino acid its particular chemical properties. Amino acids are the monomers used to build polypeptides and proteins.

The central carbon is the alpha carbon. In a generalized amino acid, this carbon atom is bonded to four different groups:

an amine group, which is a nitrogen-containing group that can accept a hydrogen ion;
a carboxyl group, which is a carbon-containing acidic group that can donate a hydrogen ion;
a hydrogen atom;
an R-group, which is a variable side chain that differs between amino acids.

When drawing the generalized amino acid, place the alpha carbon in the centre and join all four groups to it with single covalent bonds. A clear written form is usually $H_2N-CH(R)-COOH$ , but in a diagram the amine group, carboxyl group, hydrogen and R-group should all be shown explicitly. The R-group isn’t a mysterious extra atom; it’s the part that varies from one amino acid to another.

The carboxyl group can act as an acid, and the amine group can act as a base, so amino acids respond to the pH of their surroundings. This matters later because protein shape depends on charges and interactions within the molecule.

B1.2.2

Condensation reactions forming dipeptides and longer chains of amino acids

Making peptide bonds

A condensation reaction is a chemical reaction where two molecules become covalently linked and a small molecule, usually water, is released. Amino acids join in this way.

The word equation is:

amino acid + amino acid $\to$ dipeptide + water

A dipeptide is a molecule made of two amino acid residues joined by one peptide bond. A peptide bond is a covalent C–N bond formed between the carboxyl group of one amino acid and the amine group of another. The –OH from the carboxyl group and an –H from the amine group are removed, forming water.

Longer chains form as more amino acids are added by further condensation reactions:

peptide + amino acid $\to$ longer peptide + water

A polypeptide is a chain of many amino acid residues joined by peptide bonds. In cells, ribosomes catalyse peptide bond formation during translation. The chain has direction: one end has a free amine group, called the amino terminus, while the other has a free carboxyl group, called the carboxyl terminus.

What to show in a generalized dipeptide

When you draw a generalized dipeptide, focus on the backbone. Show the repeating N–C–C pattern, the peptide bond as C–N, and two R-groups projecting from the alpha carbons. In the middle of the molecule, the original amine and carboxyl groups are no longer both intact, because their atoms have been used to make the peptide bond and water.

A residue is the part of an amino acid that remains incorporated in a peptide chain after condensation has removed atoms to form water. The R-group belongs to an amino acid before joining; once the amino acid is in the chain, it is part of an amino acid residue.

B1.2.3

Dietary requirements for amino acids

Essential and non-essential amino acids

An essential amino acid is an amino acid that an organism cannot synthesize in sufficient quantity and therefore must obtain from food. A non-essential amino acid is an amino acid that an organism can synthesize from other molecules, including other amino acids.

Humans use 20 amino acids in polypeptides built by ribosomes. Some have to come ready-made from the diet. Others are made through metabolic pathways that modify molecules already present in the body. Don’t confuse “essential” with “more important”. All 20 are needed for protein synthesis. “Essential” only tells you whether the diet must supply that amino acid.

Dietary protein is broken down into amino acids and small peptides. These are absorbed and then used to build the body’s own proteins. A food may contain plenty of total protein but still be low in one essential amino acid, so protein quality depends on the balance of amino acids, not just the number of grams of protein.

Vegan diets can meet amino acid requirements, but they need some attention because plant foods differ in their amino acid profiles. The practical point is simple: a varied diet with suitable combinations of plant protein sources is needed so that all essential amino acids are supplied over time.

B1.2.4

Infinite variety of possible peptide chains

Why peptide variety is effectively enormous

The genetic code specifies 20 amino acids for protein synthesis. In principle, a ribosome can put any one of these amino acids into any position in a peptide chain. Some chains are tiny. Others contain hundreds or thousands of amino acid residues, and the order can change at every position.

Possible sequences multiply fast as chain length increases. For a chain of $n$ amino acid residues,

$S = 20^n$

A dipeptide has $20^2$ possible sequences; a tripeptide has $20^3$ ; a protein-length polypeptide gives a number so large that “effectively infinite” is fair classroom language.

One useful distinction: ribosomes don't make random chains just because many chains are possible. Genetic information determines the sequence. That connects the genome, which is the complete set of genetic information in an organism, with the proteome, which is the complete set of proteins produced by a cell, tissue or organism under particular conditions.

Examples of polypeptides show the range more clearly. Some peptide hormones are short signalling molecules. Insulin is a small protein made from two polypeptide chains. Many enzymes are single polypeptides of several hundred residues. Some structural muscle proteins are enormous chains containing tens of thousands of residues. The same basic peptide-bond chemistry supports all of that variety.

B1.2.5

Effect of pH and temperature on protein structure

Protein shape depends on weak interactions

A protein usually works only if its three-dimensional shape is right. Interactions within the molecule hold that shape, and many of these are weak on their own. Temperature and pH are abiotic factors that can disturb those interactions, changing the protein’s molecular form.

Denaturation is a structural change in a protein in which its normal three-dimensional shape is disrupted enough to reduce or destroy its function, without necessarily breaking the peptide bonds of the primary chain. A denatured enzyme may therefore still have the same amino acid sequence, but its active site no longer has the right shape.

Temperature

When temperature rises, atoms and groups within a protein gain kinetic energy. They move more, so the bonds and interactions holding the protein in shape experience more vibration and strain. Above a certain point, the protein unfolds or changes conformation. Many soluble proteins then become insoluble, because hydrophobic regions that were previously hidden inside the protein become exposed to water.

Some proteins from organisms that live in very hot environments are unusually heat-stable, so one temperature should not be treated as a universal denaturation point. Even so, high temperature visibly denatures many familiar proteins. Egg white turning opaque and solid is the kitchen version of the same principle.

pH

Changing pH changes the availability of hydrogen ions. That can alter the charges on acidic and basic groups in a protein, especially in R-groups. When charges change, ionic bonds may break, and new attractions or repulsions may form. The protein may then change shape. Extreme acidity and extreme alkalinity can both denature proteins.

How temperature and pH changes can denature proteins.

Factor	Native region	Denaturing region	Why structure changes
Temperature / °C	Low to moderate, e.g. 20–40 °C	High, often above ~60 °C for familiar proteins	More kinetic energy strains weak interactions; unfolding can expose hydrophobic regions and cause precipitation.
pH / dimensionless	Near the protein’s optimum, e.g. about pH 7 for many proteins	Extreme acidic or alkaline pH, e.g. <4 or >10	H⁺ changes R-group charges; ionic bonds may break and new attractions or repulsions alter shape.

One practical way to track denaturation is to measure turbidity. A colorimeter is an instrument that measures how much selected light passes through or is absorbed by a sample. If albumen protein denatures and precipitates, the mixture becomes cloudier. Less light is transmitted, and absorbance increases. In a temperature investigation, protein concentration and heating time should be controlled; in a pH investigation, buffer solutions are used so that pH is the variable being tested.

B1.2.6

Chemical diversity in the R-groups of amino acids as a basis for the immense diversity in protein form and functionHL

R-groups make amino acids chemically different

After amino acids join to form a polypeptide, the repeating backbone stays broadly similar along the chain. Most of the chemical variation comes from the R-groups. Different R-groups create different local properties, and the order of those properties along the chain helps decide the protein’s final form and function.

A hydrophobic R-group is a non-polar side chain that tends to avoid contact with water. A hydrophilic R-group is a polar or charged side chain that can interact favourably with water. Some hydrophilic R-groups are uncharged but polar; others carry charge.

Charged R-groups are often described using acid-base behaviour. An acidic R-group is a side chain that can donate a hydrogen ion and become negatively charged. A basic R-group is a side chain that can accept a hydrogen ion and become positively charged. For the syllabus, that level of detail is enough: you don’t need to memorize named R-group structures, but you do need to understand why a chain with different R-groups can fold and interact in many different ways.

This chemical diversity is why proteins can act as enzymes, receptors, structural fibres, transport molecules and signals. Same backbone idea; different R-group sequence; different properties.

B1.2.7

Impact of primary structure on the conformation of proteinsHL

Sequence determines shape

Primary structure is the linear sequence of amino acid residues in a polypeptide chain. That doesn’t just mean which amino acids are present; it means their exact order and position from one end of the chain to the other.

Conformation is the three-dimensional arrangement of atoms in a protein or polypeptide. Most proteins fold into precise, repeatable conformations because the R-groups in the primary structure interact with one another and with the surrounding environment in predictable ways.

Here’s a major form-and-function link in biology. Change the primary structure, and you may change where hydrophobic groups cluster, where charged groups attract or repel, and where hydrogen bonds or disulfide bonds can form. Sometimes a small sequence change has little effect. Sometimes it alters the conformation enough to change function.

The genome-proteome relationship fits here. During protein synthesis, DNA sequence determines the amino acid sequence, and that amino acid sequence helps determine protein conformation. Gene expression regulation matters too, because a cell’s proteome changes depending on which polypeptides it is making.

B1.2.8

Pleating and coiling of secondary structure of proteinsHL

Regular folding patterns in the backbone

Secondary structure is a regular local folding pattern in a polypeptide backbone, held in place by hydrogen bonds. These hydrogen bonds form at regular positions between polar $C=O$ and $N$ – $H$ groups in the peptide backbone, rather than mainly between R-groups.

An alpha helix is a secondary structure where a polypeptide backbone coils into a spiral, stabilized by hydrogen bonds between nearby turns of the coil. The R-groups point outward from the helix, which is useful: they can still interact with the surroundings or with other parts of the protein.

A beta-pleated sheet is a secondary structure where extended sections of polypeptide lie alongside one another, with hydrogen bonds stabilizing the sections. The sheet looks “pleated” because the bond angles in the backbone create a folded, zigzag geometry instead of a flat ribbon.

Don’t treat secondary structure as the whole protein shape. A protein may contain several alpha helices, several beta-pleated sheets, both, or neither. These regular regions act as building features within the larger three-dimensional structure.

B1.2.9

Dependence of tertiary structure on hydrogen bonds, ionic bonds, disulfide covalent bonds and hydrophobic interactionsHL

Folding the whole polypeptide

Tertiary structure is the overall three-dimensional folding of a single polypeptide chain. It comes from interactions between R-groups that may sit far apart in the primary sequence, then end up close together once the chain folds.

Four interaction types matter especially here:

A hydrogen bond is an attraction in which a hydrogen atom covalently bonded to an electronegative atom is attracted to another electronegative atom. In tertiary structure, hydrogen bonds can form between polar R-groups.
An ionic bond is an electrostatic attraction between oppositely charged groups. In proteins, these can form between positively and negatively charged R-groups.
A disulfide bond is a covalent bond between sulfur atoms in two cysteine residues. It is stronger than the other interactions listed here and can hold parts of a polypeptide, or different polypeptides, together firmly.
A hydrophobic interaction is the clustering of non-polar groups away from water. It is not a single bond in the same sense as a covalent bond; instead, hydrophobic R-groups tend to pack together when water is present.

pH matters here. Amine groups in R-groups can accept hydrogen ions and become positively charged. Carboxyl groups in R-groups can release hydrogen ions and become negatively charged. After that, these charged groups can form ionic bonds. If pH changes, the pattern of charges can change as well, so the tertiary structure can change too.

Many proteins contain secondary structures within their tertiary structure, so the levels of structure are not separate boxes in a real molecule. The final shape comes from the amino acid sequence, backbone folding and R-group interactions.

B1.2.10

Effect of polar and non-polar amino acids on tertiary structure of proteinsHL

Water-soluble proteins and membrane proteins fold differently

A polar amino acid is an amino acid with an R-group that has an uneven distribution of charge and can interact with water. A non-polar amino acid is an amino acid with an R-group that lacks strong charge separation and tends to avoid water.

In water-soluble globular proteins, hydrophobic amino acids usually pack into the protein’s core, away from water. Hydrophilic amino acids tend to sit on the surface instead, where they can form hydrogen bonds or ionic interactions with water and dissolved ions. That pattern helps the folded protein stay stable in an aqueous environment such as cytoplasm or blood plasma.

Integral membrane proteins have to fold for a different chemical environment. The centre of a phospholipid bilayer is hydrophobic, so the parts of an integral protein embedded in the membrane often have many hydrophobic amino acids on the outside surface of that region. These amino acids help the protein embed in the membrane.

A transmembrane channel protein shows this arrangement neatly. Its outer surface, where it touches the membrane core, is hydrophobic. The channel lining, though, can be hydrophilic, allowing polar molecules or ions to pass through a route that would otherwise be blocked by the hydrophobic membrane interior.

B1.2.11

Quaternary structure of non-conjugated and conjugated proteinsHL

Proteins with more than one part

Quaternary structure describes the three-dimensional arrangement of multiple subunits in a protein. These subunits may be polypeptide chains; some proteins also include non-polypeptide components.

A non-conjugated protein contains only polypeptide subunits. Insulin is non-conjugated because it has two polypeptide chains held together, including by disulfide bonds. Collagen is non-conjugated too: it is made from three polypeptide chains wound into a strong rope-like structure.

A conjugated protein includes one or more non-polypeptide components as well as its polypeptide chains. Haemoglobin is the key example. It has four polypeptide subunits, each associated with a haem group. The haem group contains iron and binds oxygen, so this non-polypeptide component is essential for haemoglobin’s transport function.

Technology and seeing protein structure

Protein molecules are far smaller than the unaided eye can see, and many are too small for ordinary light microscopy. Cryogenic electron microscopy rapidly freezes biological molecules and images them with electrons, allowing scientists to study very small protein structures and protein interactions.

This works well as a Nature of Science example: technology changes what counts as observable. Once scientists can image single protein molecules and their interactions, they can link structure to function much more directly, including changes in conformation during molecular activity.

B1.2.12

Relationship of form and function in globular and fibrous proteinsHL

Shape suits job

A globular protein is a protein with a compact, roughly rounded three-dimensional shape produced by folding of one or more polypeptide chains. Because globular proteins often have precise binding sites or active sites, small details of conformation matter.

A fibrous protein is a protein with an elongated shape in which polypeptide chains form fibres or filaments suited to structural roles. These proteins are generally built for strength, support or elasticity, rather than for binding a specific small molecule at a pocket.

Insulin shows how globular form links to function. Its specific conformation lets it bind to the insulin receptor on target cells. Binding happens because the shape and chemical properties of insulin match the receptor’s binding site closely enough to send a clear signal.

Collagen shows the fibrous pattern. Three polypeptide chains wind together into a triple helix, making a rope-like molecule with high tensile strength. That shape suits collagen for resisting pulling forces in tissues such as skin, tendons and ligaments.

So the exam sentence you should be able to build is not just “shape affects function”. Be specific: globular proteins fold into compact shapes with precise binding surfaces, while fibrous proteins form elongated structures whose shape gives mechanical strength.

B1.1 Carbohydrates and lipids

B2.1 Membranes and membrane transport

The Fast Track To Your
Best IB Coursework & College Essays

Products

New

Exemplars

IA Exemplars

EE Exemplars

TOK Exemplars

Common App Essay Exemplars

Supplements Essay Exemplars

Company

Legal

All content on this website has been developed independently from and is not endorsed by the International Baccalaureate Organization. International Baccalaureate and IB are registered trademarks owned by the International Baccalaureate Organization.