A3.2.1 Need for classification of organisms
A3.2.2 Difficulties classifying organisms into the traditional hierarchy of taxa
A3.2.3 Advantages of classification corresponding to evolutionary relationships
A3.2.4 Clades as groups of organisms with common ancestry and shared characteristics
Classification is a system for arranging organisms into groups according to shared traits or shared evolutionary origin. Biology needs it because life is so diverse: millions of species have already been described, and many more are discovered or detected every year.
A good classification system does more than keep things neatly sorted. Once an organism has been classified, its name and group membership help biologists find stored information about it, compare it with related organisms, identify unknown specimens more efficiently, and make sensible starting predictions about its anatomy, physiology, ecology and evolution.
Take an unknown animal. If it is placed step by step into broader-to-narrower groups, each decision cuts down the number of possible identities. Eukaryote animal mammal carnivore mustelid genus species is far more useful than starting with “some animal”. That’s the practical power of classification: it turns overwhelming diversity into searchable, testable order.

There is a nice philosophical edge here. Humans invent classification systems, but in biology the aim is to make them match something real: the history of life. A cloud classification is useful if it predicts weather; a biological classification is strongest if it reflects ancestry and therefore helps us predict shared characteristics.
So the guiding question “What tools are used to classify organisms into taxonomic groups?” has more than one answer. Traditionally, visible morphology was the main tool. Today, morphology still matters, especially for fossils, but DNA base sequences and protein amino acid sequences provide more objective evidence for many living organisms.
A taxon is a named classification group containing organisms judged to belong together. Taxonomy is the branch of biology that identifies, names and classifies organisms into taxa.
The familiar hierarchy uses fixed ranks:
A genus contains one or more species; a family contains one or more genera, and the pattern continues upward. As you move up the hierarchy, each taxon usually includes more species, but those species tend to share fewer traits.
Same seven fixed taxonomic ranks applied to two contrasting organisms.
| Rank | Human | Maize plant |
|---|---|---|
| Kingdom | Animalia | Plantae |
| Phylum | Chordata | Magnoliophyta |
| Class | Mammalia | Liliopsida |
| Order | Primates | Poales |
| Family | Hominidae | Poaceae |
| Genus | Homo | Zea |
| Species | Homo sapiens | Zea mays |
Evolution doesn’t produce neat, evenly spaced steps called “family”, “order” and “class”. Lineages diverge gradually. At first, a group of species may seem similar enough to stay in one genus. Later, after more divergence, taxonomists may split them into separate genera. There is no objective instant when “genus-level difference” turns into “family-level difference”.
This is the boundary paradox: a classification problem where gradual evolutionary divergence has to be fitted into sharp named ranks. Two taxonomists may agree about which species are related, yet disagree about the rank that relationship deserves. That isn’t bad science. It shows that the ranking system is partly arbitrary.

A paradigm shift is a major change in the framework scientists use to explain and investigate evidence. The shift from fixed ranked taxonomy to cladistic classification is a good example.
Cladistics is a method of classification that groups organisms according to common ancestry and shared derived characteristics. Instead of asking, “Is this group different enough to be a family?”, cladistics asks, “Which organisms form a branch of evolutionary history?” Cladistics can use unranked clades, so every group does not have to be squeezed into kingdom, phylum, class and the rest.
Traditional taxonomy is still useful. It remains widely used and convenient. Its fixed ranks, though, do not always match the patterns of divergence generated by evolution, which is why cladistics offers a different approach rather than just a minor tweak.
Classification works best when it reflects evolutionary relationships. In an ideal taxonomic group, every member evolved from a common ancestor, and the group includes all descendants of that ancestor.
That matters because inherited characteristics don't appear at random. Organisms with a recent common ancestor are likely to share many structural, biochemical and developmental features. Shared derived characteristics are traits inherited from a common ancestor that help identify a related group.

Classification has real power when it helps us predict. If a newly discovered organism is confidently classified as a mammal, we can predict features such as mammary glands, hair, internal fertilization and a four-chambered heart. If a newly described plant is placed in a genus known to produce a particular class of defensive chemicals, researchers may investigate related species for similar compounds.
Notice the wording here: classification allows predictions, not guarantees. Evolution can involve loss of traits, convergent similarities, and unusual exceptions. Still, classification based on ancestry gives a much better first hypothesis than classification based only on a few superficial similarities.
This links back to persistence and extinction from the previous topic. Related species may share adaptations that help them persist in similar environments, but they may share vulnerabilities too. Classification can therefore guide ecological and conservation research.
A clade is a group of organisms that includes a common ancestor and all of its descendants. Learn that wording carefully: a clade does not just mean “similar organisms”. It means an ancestry group.
Clades can be huge, or they can be small. A large clade may contain thousands of living species plus many extinct species. A small clade may contain only a few living species. Extinct organisms still count if they descended from the same common ancestor; we just often have limited evidence for them, usually from fossils.
The most objective evidence for placing organisms in the same clade usually comes from molecular sequences. A base sequence is the order of nucleotide bases in a DNA or RNA molecule. An amino acid sequence is the order of amino acids in a polypeptide or protein. Closely related organisms tend to have more similar sequences because less time has passed for differences to accumulate since divergence.
Morphology still matters. A morphological trait is a structural feature of an organism’s body or body parts. These traits are especially useful when classifying extinct species, because DNA and protein sequences are often unavailable from fossils.
Every organism belongs to many clades at the same time. Smaller clades fit inside larger clades, like folders inside folders. For example, a species may sit in a clade with its closest relatives, then in a larger clade containing a wider group, and then in a still larger clade containing all members of a major lineage.

This nesting is one way cladistics differs from traditional taxonomy. Cladistics does not require every branch to have a fixed rank label. What matters is the branching pattern of common ancestry.
A mutation is a heritable change in the nucleotide sequence of genetic material. Mutations introduce differences in DNA base sequences; some DNA changes can change the amino acid sequences in proteins too. After two lineages split from a common ancestor, these sequence differences tend to build up over long periods.
A molecular clock is a method for estimating the time since two lineages diverged by comparing the number of sequence differences between them. The idea is fairly simple: two species with more sequence differences probably separated longer ago than two species with fewer differences.

The molecular clock rests on one assumption: sequence changes accumulate at a roughly steady rate. Real organisms don't always work that neatly. Mutation rates and fixation rates vary, and generation time, population size, intensity of selective pressure and other factors can affect them.
With short generation times, there can be more rounds of DNA replication per unit time, so there may be more chances for mutation. Population size affects whether mutations are retained or lost by chance. Strong selection can quickly remove harmful variants or favour beneficial ones, changing the apparent rate of change in particular genes.
Molecular clocks are useful, but they have some wobble in the mechanism. They provide estimates of divergence times, not exact dates. Where possible, the best studies calibrate molecular evidence against independent evidence, such as fossils or well-dated geological events.
A cladogram is a branching diagram that represents a hypothesis about evolutionary relationships among organisms or clades. Sequence data give biologists much of the evidence used to build cladograms, because related organisms inherit DNA from common ancestors.
The basic procedure is fairly simple. Biologists compare the same gene, or the same protein, in different organisms. They align the sequences so equivalent positions can be compared. Fewer differences usually point to a more recent common ancestor; more differences usually point to an older divergence.

In classroom examples, the data may be kept deliberately simple: a few short DNA sequences or amino acid sequences, with differences counted by eye. In research, software compares much longer sequences, often using many genes at once.
Parsimony analysis is a method for choosing among possible cladograms by selecting the one that explains the observed sequence differences with the smallest number of evolutionary changes. It gives a criterion for judgement. It doesn't prove the answer.
Different criteria can produce different hypotheses. One method might prioritize the fewest mutations; another might model different mutation rates, or give extra weight to certain sequence positions. In IB Biology, the key point is that parsimony chooses the simplest explanation that accounts for the data.

Here is the catch I always want students to remember: evolution does not promise to take the simplest route. A base could change, change back, or change again in the same lineage. Parsimony may hide that complexity by counting it as one apparent change. For that reason, a cladogram is best treated as a well-supported hypothesis, especially when several independent genes or proteins give the same branching pattern.
This shows a wider kind of biological disagreement: scientists may accept the same evidence but use different criteria for judgement, leading to different proposed relationships.
When you analyse a cladogram, read the branching pattern first, not the left-to-right order of the names. Terminal labels can often be rotated around nodes without changing the relationships.
A root is the base of a cladogram that represents the hypothetical common ancestor of all the organisms or clades shown. A node is a branching point on a cladogram that represents a hypothetical common ancestor where one lineage split into two or more lineages. A terminal branch is an end branch of a cladogram that represents a taxon being compared, such as a species or a larger clade.

Two taxa are more closely related if they share a more recent node. Trace each taxon back until their branches meet: the more recent the meeting point, the closer the relationship shown by the cladogram.
You should be able to deduce:
Some cladograms use branch lengths proportional to time or number of sequence changes. Others don’t. Never assume distance on the page means time unless the diagram tells you it is scaled.
A cladogram gives a hypothesis about phylogeny, not a photograph of the past. Strong evidence may support it, but researchers still build it from evidence and assumptions. When different genes produce the same pattern, confidence increases. When different datasets conflict, biologists investigate why: incomplete data, convergent evolution, different mutation rates, or limitations of the method may be involved.
Cladistics can test whether a traditional classification really matches evolutionary relationships. Since gene sequencing became widely available, many long-standing groupings have been compared with DNA evidence. Some have held up; others have needed revision.
The key question is whether the named group is a true clade. If a traditional family includes species that do not all descend from a single common ancestor within that group, the classification gives a misleading picture. The reverse can happen too: species that do share a common ancestor may have been placed in separate groups, which can also lead to reclassification.

The figwort family, Scrophulariaceae, gives a good example. It was once treated as a very large flowering plant family, mainly on the basis of morphology. Cladistic analysis showed that the traditional family was not a single clade. Several groups were moved into other families, and some smaller families were merged with the remaining figwort group.
You don't need to memorize the detailed transfers. The point is the principle: cladistic evidence can show that a familiar classification is false, so taxonomy is revised to fit evolutionary history more closely.
Falsifiability is the property of a scientific claim that it can, in principle, be shown to be false by evidence. Traditional classifications based on morphology can be falsified when molecular evidence shows a different pattern of ancestry.
One reason for this is convergent evolution, a process in which distantly related organisms evolve similar traits because they face similar selection pressures, not because they inherited the traits from a recent common ancestor. This addresses the linking question about similarities between distantly related organisms: similar niches can favour similar adaptations. Gliding membranes, streamlined bodies, spines, or insect-eating snouts can evolve independently in separate lineages.
Morphology is useful, but it needs careful interpretation. A similar appearance may point to common ancestry, or it may reflect similar selection pressures. Cladistics helps separate those possibilities.
When referring to organisms in examinations, either the common name or the scientific name is acceptable, provided it is clear which organism you mean.
Ribosomal RNA is an RNA molecule that forms part of ribosomes and takes part in protein synthesis. It helps with deep classification because all cellular organisms have ribosomes. Its sequences are conserved enough for comparison across very different organisms, but still variable enough to show evolutionary divergence.
For many years, organisms were often split into two broad cell types: prokaryotes and eukaryotes. Then rRNA sequencing changed the picture. The organisms once grouped together as “prokaryotes” turned out to include two deeply different lineages.
A domain is the highest widely used taxonomic rank, placed above kingdom, that groups organisms according to very deep evolutionary relationships. The three-domain system classifies all organisms into:

This reclassification, proposed in 1977, was revolutionary because it added a taxonomic level above kingdoms and changed how biologists understood the deepest branches of life. The key evidence was not general appearance. It was comparison of rRNA base sequences.
The three-domain system gives a clear final example of the topic’s main message: classification improves when it follows evolutionary relationships, and molecular evidence can overturn classifications that once seemed obvious.