Clastify logo
Clastify logo
Exam prep
Exemplars
Review
HOT
Back to D: Change

D1.2: Protein synthesis

Master IB Biology D1.2: Protein synthesis with notes created by examiners and strictly aligned with the syllabus.

IB Syllabus Requirements for Protein synthesis

D1.2.1 Transcription as the synthesis of RNA using a DNA template

D1.2.2 Role of hydrogen bonding and complementary base pairing in transcription

D1.2.3 Stability of DNA templates

D1.2.4 Transcription as a process required for the expression of genes

DNA is a nucleic acid polymer made of deoxyribonucleotides, with genetic information stored in its base sequence. RNA is a nucleic acid polymer made of ribonucleotides, and it can carry or use genetic information inside a cell. Transcription is the enzyme-catalysed synthesis of RNA using one DNA strand as a template.

A template strand is the DNA strand copied by complementary RNA nucleotides during transcription. For any particular gene, only one of the two DNA strands is used. The other strand is often called the coding strand, because its base sequence matches the RNA transcript except that DNA has thymine where RNA has uracil.

RNA polymerase is the enzyme that catalyses transcription. It locally separates the DNA strands, lines up RNA nucleotides against the template strand, and joins them into a sugar-phosphate backbone. As it moves along the DNA template, the new RNA strand grows nucleotide by nucleotide. When it reaches the end of the transcribed region, the RNA molecule is released and the DNA double helix reforms. Image

A gene is a sequence of DNA bases with a functional product, usually a polypeptide or a functional RNA. The same gene can be transcribed many times, so one stable DNA sequence can produce many RNA copies. In this way, cells turn stored genetic information into working molecules without using up the original DNA.

Hydrogen bonding is a weak intermolecular attraction where a hydrogen atom covalently bonded to an electronegative atom is attracted to another electronegative atom. In nucleic acids, hydrogen bonds let bases pair in specific ways.

Complementary base pairing is the specific pairing of nucleic acid bases caused by matching hydrogen-bonding patterns. In transcription, cytosine pairs with guanine, guanine pairs with cytosine, thymine on the DNA template pairs with adenine on the RNA strand, and adenine on the DNA template pairs with uracil on the RNA strand. Say the last one carefully: DNA template A gives RNA U, not RNA T. Image

Transcription relies on these temporary hydrogen bonds. Together, they’re strong enough to hold the correct RNA nucleotide in place while RNA polymerase forms the covalent backbone, but weak enough for the RNA transcript to separate from DNA once it has been made.

Here, hydrogen bonding matters because it makes information copying accurate while still allowing the strands to separate when the process has finished.

During transcription, the DNA template is read, not rewritten. Its base sequence is the same before and after RNA polymerase moves along it. The two DNA strands separate only briefly; complementary base pairing then restores the double-stranded structure.

That matters because the same DNA sequence can be transcribed again and again. If transcription changed the template, later RNA copies would carry new errors. Over time, a cell would lose reliable access to the information it needs to make proteins.

This is especially serious in somatic cells, the body cells not used directly to form gametes. Some somatic cells do not divide, so they must conserve important DNA sequences for the whole life of the cell. Genetic continuity between cell generations, and within long-lived cells, depends on DNA acting as a stable information store while RNA serves as a temporary working copy.

Gene expression is the process where information in a gene is used to make a functional product that affects the cell or the organism. In most protein-coding genes, expression starts with transcription and then moves on to translation.

A cell doesn’t express every gene all the time. A liver cell, a neurone and a pancreatic beta cell have broadly the same genome, but they use different sets of genes. Part of a cell’s identity depends on which genes are being transcribed and which remain silent.

Transcription is the first stage in expression of a protein-coding gene, so it becomes a major control point. Switch transcription of a gene off, and no mRNA for that gene is made; the corresponding polypeptide cannot be produced. Switch it on, and mRNA is available for translation. Selective expression lets cells with the same inherited DNA become different in structure and function.

Messenger RNA (mRNA) is an RNA molecule that carries a base sequence copied from DNA to a ribosome, where it is used for polypeptide synthesis. Translation is the synthesis of a polypeptide, using the base sequence of mRNA to set the amino acid sequence.

A polypeptide is a polymer of amino acids joined by peptide bonds. During translation, the cell changes molecular “language”: a sequence of nucleotide bases in mRNA is read to build a sequence of amino acids in a polypeptide.

In eukaryotic cells, transcription occurs in the nucleus. The mRNA is then exported to the cytoplasm, where ribosomes translate it. Prokaryotic cells have no nucleus, but the same basic idea still applies: mRNA carries the information that directs the order of amino acids.

Translation depends on three main components working together.

mRNA provides the sequence of bases that will be read. It binds to the small subunit of a ribosome, so its base sequence sits in the right position for decoding.

A ribosome is a ribonucleoprotein complex that catalyses polypeptide synthesis by holding mRNA and tRNA molecules in position. Its small subunit binds mRNA, while its large subunit binds tRNA molecules. During translation, two tRNAs can bind to the large subunit at the same time, which lets the ribosome hold the growing chain and the next amino acid in the same catalytic space.

Transfer RNA (tRNA) is an RNA molecule that carries a specific amino acid and has a base triplet that pairs with mRNA. One end of the tRNA carries an amino acid; another part of the molecule contains the bases that recognize the mRNA sequence. The tRNA must be loaded with the correct amino acid, because the ribosome checks base pairing, not the chemical identity of the amino acid. Image

In plain classroom terms: mRNA is the message, tRNA is the adaptor, and the ribosome is the workbench and catalyst. Protein diversity in a cell starts here, since different mRNA sequences produce different amino acid sequences.

A codon is a three-base sequence in mRNA that specifies an amino acid or a stop signal during translation. An anticodon is a three-base sequence in tRNA that is complementary to an mRNA codon.

During translation, a tRNA anticodon pairs with the mRNA codon through complementary base pairing. In RNA-RNA pairing, adenine pairs with uracil, and cytosine pairs with guanine. If the anticodon does not match the codon closely enough, the ribosome does not hold the tRNA in the correct position. Image

Hydrogen bonding does another precise job here. It lets a tRNA check whether it fits the next codon, then lets that tRNA detach later as the ribosome moves along. Transcription and translation both use the same basic chemical principle: complementary bases can recognize one another reversibly.

The genetic code is the set of rules that mRNA codons use to specify amino acids or stop signals during translation.

RNA has four possible bases: adenine, uracil, cytosine and guanine. If the code used one base at a time, it could give only four meanings. Two bases would give sixteen. Cells must code for twenty amino acids, so the code has to use triplets. With three bases per codon, sixty-four codons are possible, enough for all the amino acids plus stop signals.

Most codons specify amino acids. A small number are stop signals; they don't code for an amino acid, but they help end translation.

Degeneracy is a feature of the genetic code in which more than one codon can specify the same amino acid. Because of this, some base substitutions do not change a polypeptide: the altered codon may still code for the same amino acid.

Universality is a feature of the genetic code in which almost all organisms use the same codon meanings. There are minor exceptions, but the broad pattern is shared across life. This universality provides evidence for common ancestry and helps explain why genes can sometimes be expressed in a different species.

A genetic code table uses mRNA codons. Not DNA triplets, and not tRNA anticodons. Check that first, before you start. Standard mRNA codon table. Cells show mRNA codon → amino acid; Stop is not an amino acid.

1st base3rd base2nd base U2nd base C2nd base A2nd base G
UUUUU → PheUCU → SerUAU → TyrUGU → Cys
UCUUC → PheUCC → SerUAC → TyrUGC → Cys
UAUUA → LeuUCA → SerUAA → StopUGA → Stop
UGUUG → LeuUCG → SerUAG → StopUGG → Trp
CUCUU → LeuCCU → ProCAU → HisCGU → Arg
CCCUC → LeuCCC → ProCAC → HisCGC → Arg
CACUA → LeuCCA → ProCAA → GlnCGA → Arg
CGCUG → LeuCCG → ProCAG → GlnCGG → Arg
AUAUU → IleACU → ThrAAU → AsnAGU → Ser
ACAUC → IleACC → ThrAAC → AsnAGC → Ser
AAAUA → IleACA → ThrAAA → LysAGA → Arg
AGAUG → Met/startACG → ThrAAG → LysAGG → Arg
GUGUU → ValGCU → AlaGAU → AspGGU → Gly
GCGUC → ValGCC → AlaGAC → AspGGC → Gly
GAGUA → ValGCA → AlaGAA → GluGGA → Gly
GGGUG → ValGCG → AlaGAG → GluGGG → Gly

To work out an amino acid sequence from mRNA, split the mRNA sequence into triplets from the given starting point. Then read the codons in order and use the table to find the amino acid or stop signal. For example, if an mRNA sequence begins AUG, the table gives methionine, and AUG also acts as a start codon in normal translation.

If the question gives you a DNA template strand and asks for mRNA, apply complementary base pairing: DNA template A gives RNA U, DNA T gives RNA A, DNA C gives RNA G, and DNA G gives RNA C. If it gives the coding strand of DNA instead, the mRNA sequence matches it except that U replaces T. Students often lose marks here not because the idea is difficult, but because they read the wrong molecule.

When you deduce a polypeptide, stop at a stop codon if one appears in the reading frame. A stop codon is not an amino acid, so don’t include it in the amino acid sequence.

Elongation is the repeated stage of translation where the polypeptide lengthens, one amino acid at a time. With each cycle, the ribosome moves along the mRNA by one codon, keeping the reading frame in groups of three bases.

A charged tRNA carrying the next amino acid binds to the ribosome through codon-anticodon pairing. The ribosome then catalyses a peptide bond between the amino acid on this newly arrived tRNA and the growing polypeptide chain. The chain is passed onto the tRNA in the next position.

Once the peptide bond has formed, the ribosome shifts one codon along the mRNA. The tRNA now carrying the growing polypeptide moves into the position where the chain is held ready for the next cycle. The empty tRNA leaves the ribosome and can be reused after another amino acid is loaded onto it. Image

For elongation, keep your attention on three linked events: codon recognition, peptide bond formation, and ribosome movement. This stepwise movement depends on direction, since the ribosome has to read the mRNA in one consistent direction to keep the amino acids in the correct order.

A mutation is a change in the base sequence of genetic material. A point mutation affects one nucleotide position in DNA.

After transcription, a point mutation can change an mRNA codon. If the new codon specifies a different amino acid, translation makes a polypeptide with an altered primary structure. Sometimes one amino acid change has little effect. In other cases, it can change protein folding, bonding, solubility or the active site.

A clear example occurs in the beta-globin gene. A DNA base substitution changes a codon, so the mRNA codon specifies valine instead of glutamic acid in the beta-globin polypeptide. The surface properties of haemoglobin change as a result. In low oxygen conditions, the altered haemoglobin molecules can stick together and form fibres, which distort red blood cells into a sickle shape. Image

This example connects protein synthesis with genetic continuity. If the mutation is present in a germ-line cell that forms a gamete, it can pass to the next generation. The inherited DNA sequence then affects the mRNA sequence, amino acid sequence, protein structure and, finally, the phenotype.

Directionality is the property of a nucleic acid strand having chemically different ends that determine the direction in which enzymes work. DNA and RNA strands have a 5′ end and a 3′ end because the sugar-phosphate backbone is asymmetric.

During 5′ to 3′ transcription, RNA polymerase builds the RNA strand by adding each incoming RNA nucleotide to the 3′ end of the growing RNA. The RNA transcript therefore grows in the 5′ to 3′ direction, while the DNA template is read the other way.

In 5′ to 3′ translation, the ribosome travels along the mRNA from its 5′ end towards its 3′ end. Codons are read in that order, so direction affects the amino acid sequence. If the same bases were read in the opposite direction, they would not produce the same polypeptide. Image

This answers the linking question about biological mechanisms relying on directionality. Protein synthesis has direction at both stages: RNA is made 5′ to 3′, and mRNA is translated 5′ to 3′. Directionality stops the information from being read ambiguously.

A promoter is a DNA sequence near a gene that gives the proteins needed to start transcription a place to bind. It is not translated into a polypeptide; it regulates the gene.

Transcription starts when RNA polymerase is recruited to the promoter. Often, transcription factors help control whether RNA polymerase can bind and begin transcription. These regulatory proteins bind specific DNA sequences. Some make transcription more likely; others reduce it or block it. Image

This is one way cells switch genes on and off at the transcription stage. The cell doesn’t have to change the gene itself to change gene expression. Instead, it can alter which transcription factors are present or active, and therefore which promoters are used.

A non-coding sequence is a DNA base sequence that is not translated into a polypeptide. Non-coding doesn’t mean useless. Some non-coding sequences have essential structural, regulatory or RNA-producing roles.

The examples to know are limited and specific:

  • Regulators of gene expression are non-coding DNA sequences that influence whether genes are transcribed, such as promoters and other regulatory binding regions.
  • Introns are non-coding sequences within eukaryotic genes that are transcribed into pre-mRNA but removed before translation.
  • Telomeres are repetitive non-coding DNA sequences at the ends of eukaryotic chromosomes that help protect chromosome ends.
  • Genes for rRNA and tRNA are DNA sequences whose products are functional RNAs, not polypeptides.

A genome, then, is not just a list of protein recipes. It also contains switches, spacers, chromosome-end protection and genes for RNA molecules that help make proteins.

Post-transcriptional modification is the processing of an RNA transcript after transcription and before it is used in translation. In eukaryotic cells, this happens before mature mRNA leaves the nucleus.

A newly transcribed eukaryotic RNA molecule from a protein-coding gene is often called pre-mRNA. For this topic, it undergoes three main modifications.

First, a 5′ cap is added to the 5′ end. The 5′ cap is a modified nucleotide structure attached to the 5′ end of eukaryotic mRNA that helps protect the transcript and supports later translation.

Next, a polyA tail is added to the 3′ end. The polyA tail is a sequence of many adenine nucleotides added to the 3′ end of eukaryotic mRNA that helps stabilize the transcript.

Finally, introns are removed and exons are joined. An exon is a sequence in a eukaryotic gene that remains in mature mRNA after splicing and can contribute to the translated sequence. Splicing is the RNA-processing step in which introns are removed and exons are joined to form mature mRNA. Image

The cap and tail protect mRNA from rapid breakdown. Splicing creates a continuous coding sequence for translation. Because transcription happens in the nucleus and translation happens in the cytoplasm, eukaryotic cells have time and space to process RNA before ribosomes read it.

Alternative splicing is a form of RNA processing where different combinations of exons from the same pre-mRNA are joined, producing different mature mRNA molecules.

The key point is straightforward: one gene can give rise to more than one polypeptide. The same DNA sequence is transcribed, but the mature mRNA changes depending on which exons are kept. Those different mature mRNAs are then translated into related polypeptides with different amino acid sequences. Image

This raises protein diversity without needing a separate gene for every polypeptide variant. It helps answer the linking question about how protein diversity contributes to cell function: by changing RNA processing, cells can make protein variants suited to different tissues, developmental stages or conditions.

Translation initiation sets the reading frame and puts the first tRNAs in the right position. Start in the wrong place, and every codon after that point is read incorrectly.

The small ribosomal subunit attaches to the 5′ terminal region of the mRNA. As it moves along the mRNA to find the start codon, usually AUG, an initiator tRNA carrying methionine pairs with it. The initiator tRNA has an anticodon complementary to AUG.

After the start codon is recognized, the large ribosomal subunit attaches. A second tRNA can then bind at the next codon, and elongation begins. Image

The large ribosomal subunit has three tRNA binding sites. The A site is the aminoacyl site, where the next tRNA carrying an amino acid enters. The P site is the peptidyl site, where the tRNA holding the growing polypeptide is positioned. The E site is the exit site, where a tRNA leaves after its amino acid has been transferred.

During elongation, tRNAs pass through these sites in order: A, then P, then E. This ordered movement gives another example of directionality in protein synthesis.

Many polypeptides don’t work straight after translation. Post-translational modification is the chemical or structural processing of a polypeptide after translation to produce its functional form.

A polypeptide may have amino acids removed, fold into a stable three-dimensional shape, form disulfide bonds, gain chemical groups or carbohydrate chains, or assemble with other polypeptides or non-polypeptide components. The exact modification depends on the protein.

Insulin is the required example. Pre-proinsulin is the initial inactive polypeptide translated from insulin mRNA. After removal of a signal sequence, it becomes proinsulin, an inactive precursor. Proinsulin folds and forms disulfide bonds. It is then cut again to remove a connecting segment, leaving the A chain and B chain held together by disulfide bonds. This final product is functional insulin. Image

The key point is the two-stage conversion: pre-proinsulin to proinsulin, then proinsulin to insulin. Translation supplies the amino acid chain, but modification produces the active hormone.

A proteome is the full set of proteins present in a cell, tissue or organism at a particular time. Keeping a proteome functional takes more than making proteins; cells also remove proteins that are damaged, misfolded or no longer needed.

A proteasome is a protein complex that breaks selected proteins into short peptides, so their amino acids can be recycled. Proteins marked for destruction are recognized, unfolded and fed into the proteasome, where protease active sites break peptide bonds. Image

The short peptides produced are broken down further into amino acids in the cytoplasm. The cell can then use those amino acids to synthesize new proteins.

This constant turnover isn’t wasteful. It’s quality control. Cells need continual protein breakdown and synthesis to keep the proteome matched to current needs. Protein diversity helps cells function, but that diversity has to be managed, refreshed and repaired throughout the life of the cell.

Were those notes helpful?

D1.1 DNA replication

D1.3 Mutations and gene editing