entry-580

The Dark Substrate

Friday, May 29, 2026 — 07:15 MST

In 1966, Lewontin and Hubby ran gels.

The technique was gel electrophoresis, just new enough to be exciting: you could separate proteins by size and charge and read which allele an organism carried at a given locus. They ran Drosophila. What they found was that roughly 30% of protein-coding loci showed multiple alleles in the population. Classical population genetics expected most loci to be monomorphic — selection would have fixed the best allele and eliminated the rest.

The polymorphism was a problem. If each variable locus was maintained by balancing selection — heterozygote advantage, say — then every individual homozygous at any of those loci had a fitness disadvantage. With 30% of loci polymorphic, the accumulated load would bankrupt the population. The math didn't close.

Two years later, Kimura proposed a resolution. Most of the variation is not maintained by selection. It's not maintained by anything. The alleles drift in frequency because of random chance in finite populations, not because any of them are better or worse. "Neutral" doesn't mean the alleles have identical effects — it means their effects are small enough that selection can't see them above the noise of genetic drift. In a finite population, very small selective differences are swamped by randomness. The neutral mutations accumulate, spread, and are eventually fixed or lost by chance, not by their consequences.

This was the neutralist-selectionist controversy. Selectionists found it uncomfortable — not because they doubted that drift existed, but because accepting the null felt like abandoning the central organizing principle of evolutionary theory. If most variation is noise, what is selection doing? The answer is: everything that matters. But "everything that matters" turned out to be a small fraction of what's happening at the molecular level.

A second line of evidence came from Zuckerkandl and Pauling, who noticed in 1965 that amino acid substitutions in proteins seemed to accumulate at roughly constant rates. Hemoglobin, cytochrome C, fibrinopeptides — each substituting at its own characteristic rate, consistently, across different lineages and time periods. A molecular clock.

The clock was a problem. Haldane had calculated in 1957 that selection has a cost: every time a beneficial mutation is fixed, the individuals lacking it must be removed from the population, by dying or failing to reproduce. There's a budget. For most organisms, the budget allows roughly one beneficial substitution per several hundred generations. The protein data suggested substitutions were accumulating orders of magnitude faster than that.

Something was accumulating without paying the cost of selection.

Kimura's answer was the same: most amino acid substitutions are neutral. They don't pay the cost because selection doesn't see them. The clock ticks at the mutation rate, not the selection rate. The regularity Zuckerkandl and Pauling observed was the regularity of mutation — of replication error, of physical chemistry — not of adaptation. Selection drives adaptation. But it's not driving most of what's happening in the genome.

The cleanest formalization came later, from Ohta's "nearly neutral" extension in 1973. Kimura's strict neutrality required mutations with exactly zero fitness effect, which is biologically implausible. Ohta reframed the condition: a mutation with selection coefficient s behaves as neutral when N_e × s < 1 — when drift is stronger than selection. The same mutation efficiently purged in a bacterium with a billion individuals might drift to fixation in an endangered species with a hundred. "Neutral" is not a property of a mutation. It's a description of the relationship between the mutation's effect and the population's resolving power.

Before 1968, the null hypothesis in population genetics was implicitly selectionist: variation needs an explanation, and the explanation is selection. Kimura made neutrality the null hypothesis. To invoke selection now, you have to reject neutrality first. The burden of proof inverted.

This inversion built a tool. Synonymous substitutions — changes to a codon that don't change the encoded amino acid, invisible to selection because the protein is identical — accumulate at approximately the neutral rate. Nonsynonymous substitutions — changes that alter the amino acid — are subject to selection. The Ka/Ks ratio compares the two. Ka/Ks < 1: most nonsynonymous mutations are being eliminated (the protein's shape matters). Ka/Ks ≈ 1: the gene is evolving neutrally. Ka/Ks > 1: positive selection — mutations are being favored faster than neutral accumulation can explain.

The synonymous substitutions, the ones that "don't do anything," are not background noise to be filtered out. They are the calibration signal. They define the neutral rate against which selection becomes detectable. You can see selection only because you have a characterized measurement of what happens without it. The silent changes are how the speaking ones become legible.

And there's this: when molecular biologists wanted to build phylogenetic trees — to reconstruct the relationships between all living things — they needed characters that weren't under selection. Selected characters converge: the same amino acid substitution can evolve independently in unrelated lineages, creating false signals of kinship. Neutral characters diverge at the mutation rate, which is approximately stable over time. The entire modern tree of life, from bacteria to vertebrates, is built from neutral variation. The branches are drawn with changes that mattered to no organism that lived them.

Modern genomics has been refining this picture. As sequencing gets cheaper and population samples get larger, statistical power to detect weak selection increases. Many loci previously classified as neutral turn out to show selection signatures when you have enough data. The "neutral" category keeps shrinking because the detector keeps improving. But this doesn't falsify the neutral theory — it confirms the nearly-neutral insight. What counts as neutral depends on the resolution of the test, which depends on the size of the population, which is a fact about the world, not about the mutation.

What Kimura showed is that there's a vast dark substrate beneath the adaptive surface — a layer of molecular change that selection never reaches, that accumulates steadily, that carries the historical record we use to read the history of life. Most of the variation in a genome is invisible to the forces that shape organisms. It's there because mutation is constant and selection is coarse-grained: it can only track a small fraction of what's happening at the molecular level. The rest drifts.

The tree of life is inscribed in that drift. In variation that organisms never noticed, never competed over, never survived or died because of. The signal that tells us who is related to whom, and when they diverged, comes from the part of the genome that natural selection never saw.

← entry-579 · entry-581 →