Master IB Physics E.5: Fusion and stars with notes created by examiners and strictly aligned with the syllabus.
E.5.1 Stellar stability as a balance of pressures
E.5.2 Fusion as a source of energy in stars
E.5.3 Conditions for fusion in stars
E.5.4 The effect of stellar mass on stellar evolution
A star is a self-gravitating sphere of hot plasma that emits radiation because energy is generated in its interior. For most of its life, especially on the main sequence, a star stays roughly the same size because inward and outward effects balance.
Hydrostatic equilibrium is a mechanical equilibrium in a fluid body in which the inward gravitational compression is balanced by an outward pressure gradient. In a star, gravity pulls each layer inward, while hot gas pressure and radiation pressure push outward. The surface is not “held up” by anything solid; pressure from hotter, deeper layers supports it.

Thermal pressure is a pressure produced by the random motion of particles in a gas or plasma. Radiation pressure is a pressure produced when photons transfer momentum to matter. In a stellar core, matter and radiation are both energetic, so both can help support the star against gravity.
A star is not an ideal gas sitting in a school laboratory. Even so, gas models give the right first instinct. When a gas cloud contracts, gravitational potential energy is transferred into internal energy, so the temperature rises. A hotter gas has particles with larger average kinetic energy, giving a greater pressure for a given density.
That is why the gas laws can model stars: they connect pressure, temperature and particle number density. Their limitation matters too. Stellar material is an ionized plasma, photons carry energy and momentum, and nuclear reactions change the composition. The model helps because it captures the pressure–temperature connection, not because a star is literally a simple ideal gas sample.
The balance in a star is a large-scale equilibrium between gravity and pressure. A nucleus is stable for a different reason: the short-range strong nuclear interaction can overcome electric repulsion between protons at nuclear separations. If only gravity and electric forces were available, ordinary nuclei could not exist. Gravity between nucleons is far too weak, and the electric force between protons is repulsive. Stars and nuclei both involve competing interactions, but they operate on very different length scales, with different dominant forces.
Nuclear fusion is a nuclear reaction where light nuclei join to make a heavier nucleus with a larger binding energy per nucleon. Stars release energy by fusion because the products are more tightly bound than the reactants; the missing mass comes out as energy carried by particles and photons.
For energy release calculations, use ΔE = Δm c², where ΔE is the energy released (J), Δm is the decrease in rest mass (kg) and c is the speed of light in vacuum (m s⁻¹). In nuclear calculations you may also use 1 u c² = 931.5 MeV, where u is the atomic mass unit (kg) and MeV is mega-electronvolt, a nuclear energy unit with 1 MeV = 1.60 × 10⁻¹³ J.
In Sun-like main-sequence stars, the proton–proton chain is the dominant fusion process. Taken as a whole, four hydrogen nuclei are converted into one helium nucleus, with positrons, electron neutrinos and energy also produced:
4 ¹₁H → ⁴₂He + 2 β⁺ + 2 νₑ + energy
A commonly quoted total energy release is about 26.7 MeV per completed helium nucleus, although neutrinos carry away some of this energy.

The usual simplified chain has three stages:
¹₁H + ¹₁H → ²₁H + β⁺ + νₑ: two protons form deuterium, and a positron plus an electron neutrino are emitted. This is the slow stage, because it needs a weak-interaction change from proton to neutron.¹₁H + ²₁H → ³₂He + γ: a proton fuses with deuterium to form helium-3 and a gamma photon.³₂He + ³₂He → ⁴₂He + ¹₁H + ¹₁H: two helium-3 nuclei form helium-4 and return two protons to the plasma.A neutrino is a neutral lepton that interacts only very weakly with matter. Solar neutrinos give direct evidence that fusion is happening in the Sun’s core, since they escape from the core with very little interaction. Historically, detectors found fewer electron neutrinos than expected; the solution was that neutrinos can change type during flight, so experiments sensitive to only one type measured only part of the flux. It’s a neat example of conservation laws and particle detection making the physics more subtle.
A binding energy is the energy needed to separate a bound system into its individual components. For nuclei, a greater binding energy per nucleon means a more stable nucleus. In a fusion calculation, compare the total binding energy of the products with the total binding energy of the reactants. If the products have greater total binding energy, the difference is released.
Fusion and fission both release nuclear energy by moving nuclei towards the high-binding-energy region near iron and nickel. The direction is different: fusion combines light nuclei, while fission splits heavy nuclei. Fusion of light nuclei usually gives a large energy release per unit mass because the binding energy per nucleon changes steeply for small nucleon numbers.

Atomic transitions emit photons when electrons change energy levels in atoms; their energies are usually in the visible, ultraviolet or X-ray range depending on the atom and transition. Nuclear transitions involve changes inside the nucleus and usually emit much higher-energy gamma photons. In both cases the photon is the same kind of particle, but the energy scale and the physical system making the transition are different.
Fusion in a star needs two conditions together: very high temperature and very high density. Temperature measures the average random kinetic energy of particles in matter. When the core temperature is high, nuclei collide at high speeds and can get close enough for the strong nuclear interaction to act.
Density is mass per unit volume. In a stellar core, high density packs many nuclei into a small volume, so collisions happen often. Temperature makes the collisions energetic enough; density makes enough collisions happen each second.
Hydrogen nuclei are protons, so they repel each other electrically. For fusion to happen, two protons must come extremely close, within the range of the strong nuclear interaction. Even in the Sun, most proton–proton encounters don’t produce fusion. The first step of the proton–proton chain is slow because, while the two particles are briefly close together, a proton must change into a neutron through the weak interaction.
That slow rate is not a defect. It’s why Sun-like stars last so long. If proton fusion were easy, main-sequence stars like the Sun would use up their fuel far too quickly for stable planetary systems to develop over billions of years.
A protostar is a contracting gas object that has not yet settled into stable hydrogen fusion. It starts as a slightly denser region of interstellar gas. Gravity increases the density difference, the cloud contracts, and the central temperature rises. Once the core reaches the required temperature and density, sustained fusion begins and the star joins the main sequence.
The first elements, mainly hydrogen and helium with a trace of lithium, formed early in the universe. Heavier elements formed later inside stars and in explosive stellar events. So, to answer “How are elements created?” briefly: light elements came from early-universe nucleosynthesis, while many heavier elements are produced by stellar fusion and supernova processes.
A star’s initial mass mainly controls how long it lives, how luminous it is and what it becomes at the end. With more mass, gravity compresses the star more strongly, so the core has to get hotter to produce enough outward pressure. In a hotter core, fusion happens much faster. That is why massive stars are extremely luminous but short-lived.
This is the point students often find counter-intuitive: a massive star has more fuel, but it burns through that fuel vastly faster. Low-mass stars live quietly for a very long time; high-mass stars live brilliantly and die violently.

A main-sequence star is a star in the long-lived stage during which hydrogen fusion in the core supplies most of its energy. Once the hydrogen in the core becomes depleted, outward pressure drops. The core then contracts and heats up, while hydrogen fusion carries on in a shell around the core. The outer layers expand and cool, and the star becomes a red giant.
In a star with moderate mass, the core gets hot enough for helium fusion to make heavier nuclei such as carbon and oxygen, but not hot enough to keep fusion going through many later stages. Eventually, the outer layers are expelled and form a planetary nebula. What remains is the hot dense core: a white dwarf.
A white dwarf is a compact stellar remnant supported against gravitational collapse by electron degeneracy pressure. Electron degeneracy pressure is a pressure caused by the quantum rule that electrons cannot all occupy the same quantum state. A white dwarf no longer has sustained fusion; it shines because it is hot, then slowly cools.
A high-mass star leaves the main sequence with a hotter core. It can fuse heavier elements in shells, building an onion-like internal structure: lighter-element fusion in the outer shells and heavier products closer to the core. Fusion up to the iron region does not keep releasing useful energy in the same way, because iron-group nuclei are already near the maximum binding energy per nucleon.

The Chandrasekhar limit is the maximum mass of a white dwarf, about 1.4 solar masses, above which electron degeneracy pressure cannot support the remnant. If the core remnant exceeds this limit, the collapse continues. Electrons and protons combine to form neutrons and neutrinos, and the outer layers can be expelled in a supernova.
A neutron star is a compact stellar remnant made mainly of neutrons and supported by neutron degeneracy pressure. A black hole is a region of spacetime from which nothing, not even light, can escape. The Oppenheimer–Volkoff limit is the maximum mass of a neutron star, of order 2–3 solar masses, above which neutron degeneracy pressure cannot prevent collapse to a black hole.

Astronomers cannot run controlled experiments on stars, so they treat observations of many stars at different stages as a substitute for time-lapse evidence. Since light takes time to travel, looking at distant objects also means seeing earlier states of the universe. These observations help us build models for stellar evolution and for the universe’s future, but they do not prove the predictions by themselves; the predictions depend on the quality of the observations and the assumptions in the model.
A Hertzsprung–Russell diagram is a scatter graph used to classify stars by plotting luminosity against surface temperature. Luminosity means the total power a star radiates. In IB-style HR diagrams, luminosity goes on the vertical axis and temperature goes on the horizontal axis. The odd bit: temperature increases to the left.
For a spherical star, the Stefan–Boltzmann law is L = 4πσR²T⁴, where L is luminosity (W), σ is the Stefan–Boltzmann constant (W m⁻² K⁻⁴), R is stellar radius (m) and T is surface temperature (K). On an HR diagram, stars with the same radius sit along diagonal lines, since luminosity depends on surface area as well as temperature.

The main sequence is the diagonal band where stars are undergoing core hydrogen fusion. Hot, luminous main-sequence stars appear toward the upper left; cool, dim ones sit toward the lower right. The Sun is roughly in the middle of the main sequence.
A red giant is cool but luminous, with a very large radius. On the HR diagram it appears toward the upper right: cool surface, large emitting area, high total luminosity. A supergiant is an extremely large and luminous star near the top of the HR diagram.
A white dwarf lies in the lower left of the HR diagram. It may be hot but still dim because its surface area is tiny. The instability strip is a narrow nearly vertical region containing stars whose luminosities vary because they pulsate. You need to locate it and interpret it, but detailed Cepheid-variable work is not required.
You can estimate a star’s surface temperature from its black-body spectrum: hotter stars peak at shorter wavelengths and look bluer; cooler stars peak at longer wavelengths and look redder. So black-body radiation is not just a thermal physics idea — it is one of the main tools for reading stellar properties from light.
A stellar spectrum is the intensity of light from a star separated by wavelength. Absorption and emission lines show chemical composition, because atoms and ions absorb or emit photons at characteristic wavelengths. Atomic spectra tell us about elements in stellar atmospheres and intervening gas; shifts of those spectral lines can also reveal motion along the line of sight.

HR diagrams matter because they turn a crowd of data points into a pattern. Physics uses classification in plenty of similar ways: scalar and vector quantities, types of radiation, elementary particles in the Standard Model, and regions of the electromagnetic spectrum. Good classification does more than label things; it reveals relationships that help models make predictions.
An astronomical unit is the mean Earth–Sun distance. Use 1 AU ≈ 1.50 × 10¹¹ m.
A light year is the distance travelled by light in vacuum in one year. Use 1 ly ≈ 9.46 × 10¹⁵ m.
A parsec is defined using stellar parallax: a star at 1 pc has a parallax angle of 1 arcsecond. Use 1 pc ≈ 3.26 ly ≈ 3.09 × 10¹⁶ m. You are expected to convert between AU, ly and pc.
Stellar parallax measures distance by using the apparent shift of a nearby star against distant background stars as Earth moves around the Sun. The full baseline is the diameter of Earth’s orbit, but the parallax angle uses half of that baseline, so the simple parsec definition effectively uses 1 AU.
Use d(parsec) = 1 / p(arcsecond), where d is the distance to the star (pc) and p is the parallax angle (arcsecond). An arcsecond is 1/3600 of a degree. Small p gives a large d; that’s the practical problem.

Atmospheric turbulence and absorption limit the precision of parallax measurements from Earth’s surface. Space telescopes avoid much of the atmosphere, so they can measure much smaller angular shifts. Missions such as Gaia have transformed stellar astronomy by measuring positions, distances and motions for enormous numbers of stars. Better instruments do not just produce prettier pictures; they give physicists data that test models more severely.
To find a stellar radius, start with the Stefan–Boltzmann law. Rearranging L = 4πσR²T⁴ gives R = √(L / 4πσT⁴). So you need two measurements: the star’s luminosity and its surface temperature.
A neater method is often to compare the star with the Sun:
L/L☉ = (R/R☉)²(T/T☉)⁴,
where L☉ is the solar luminosity (W), R☉ is the solar radius (m) and T☉ is the Sun’s surface temperature (K). Rearranging,
R/R☉ = (T☉/T)² √(L/L☉).
Use this version when luminosity is given in solar units.
A telescope measures apparent brightness, not luminosity directly. Apparent brightness is the power received per unit area at the observer. Use L = 4πbd², where b is apparent brightness (W m⁻²) and d is the distance to the star (m). For nearby stars, the distance may come from parallax; for more distant objects, it may come from other calibrated methods.

You can estimate surface temperature from the wavelength where the star’s black-body spectrum peaks. Use T = (2.90 × 10⁻³) / λmax, where λmax is the peak wavelength (m). The constant has units m K, so T is in kelvin.
The sequence is: measure parallax to get distance, measure apparent brightness to get luminosity, use the spectrum to get surface temperature, then calculate radius. Each step adds uncertainty, so the final radius is an estimate rather than a directly measured ruler-length.
A standard candle is an astronomical object with a known luminosity that can be used to estimate distance from its observed brightness. Standard candles push the same basic brightness–distance idea far beyond the range where parallax is practical.