WHY WE STILL DON’T KNOW HOW LIFE BEGAN

WHY WE STILL DON’T KNOW HOW LIFE BEGAN

2026-03-21

An Essay in the Philosophy of Science

THE ORIGIN OF LIFE: AN UNSOLVED PROBLEM

 

There is a peculiar dishonesty at the heart of modern biology—not a dishonesty of data, but of tone. How did life begin? After seventy years of origin-of-life research—from the RNA World hypothesis to hydrothermal vents, from warm ponds to metabolism-first models—this remains the deepest unanswered question in all of science. No hypothesis has produced an empirically validated explanation for the transition from chemistry to biology. Yet open any university textbook on molecular biology and you will find a brief, briskly confident section implying the question is nearly settled. It will mention the Miller–Urey experiment. It will gesture toward ribozymes and lipid vesicles. The student will turn the page with the impression that science has the origin of life well in hand—a few details remain, but the broad strokes are settled.

The confidence has a date of birth. In 1953, at the University of Chicago, a twenty-three-year-old graduate student named Stanley Miller sealed methane, ammonia, hydrogen, and water in a glass flask, ran electrical sparks through the mixture for a week, and opened the valve. The liquid inside had turned brown. Analysis revealed glycine, alanine—amino acids, the building blocks of proteins, synthesised from inorganic chemistry for the first time in human history. The result electrified the scientific world. The origin of life seemed, suddenly, not merely approachable but imminent. Oparin and Haldane’s hypothesis was vindicated. The path from chemistry to biology was clear: give nature the right gases, the right energy, and enough time, and life would write itself.

That was seventy-three years ago. The path has not been walked. The amino acids were real, but the programme that deploys them—the genetic code, the ribosome, the integrated cell—remains as remote as it was in 1953. Remoter, in fact: because we now understand how much more is required than Miller dreamed, and how little of it his spark discharge, or any experiment since, has been able to provide.

This impression is false. It is not merely incomplete or oversimplified. It is false. After more than seventy years of dedicated research, billions of dollars of funding, and the collective effort of thousands of chemists, biologists, physicists, and geologists, we do not know how life began. We have partially successful models for isolated steps—amino acid synthesis, vesicle formation, ribozyme activity, vent energetics—and real experimental progress on each. What we do not have is an empirically supported, integrative scenario that connects known geochemical environments to the first evolvable cells. We have fragments, each of which illuminates one part of the puzzle while deepening the mystery of every other part. As a comprehensive 2020 review in Nature Reviews Chemistry made clear, the field remains a collection of “interacting constraints”—environmental chemistry, compartmentalisation, kinetics, information—without a unifying framework. The review also stresses constructive pathways and experimental advances; but the honest summary, taking those advances into account, is not that we are “closing in.” It is that the more we learn about the molecular machinery of even the simplest cell, the wider the gap between our partial models and the integrated explanation we lack.

This essay is an attempt at that honest summary. Its thesis is threefold. First, that no existing naturalistic hypothesis for abiogenesis is remotely adequate to explain the transition from chemistry to biology. Second, that the two hypotheses which remain logically coherent—purposeful creation and panspermia—are excluded from serious scientific discussion not on evidential grounds, but on ideological ones. And third, that this exclusion represents a failure not of science, but of the philosophy that has been grafted onto science and now masquerades as its voice.

 

WHAT WE ACTUALLY KNOW

Let us begin with an honest inventory. What, precisely, has origin-of-life research established beyond reasonable doubt?

We know that the Last Universal Common Ancestor—LUCA—existed approximately 4.2 billion years ago, only some 400 million years after Earth’s formation. A landmark 2024 study in Nature Ecology & Evolution — corroborated by subsequent analyses — inferred a substantial gene repertoire for LUCA—on the order of 2,600 protein-coding gene families—encompassing sophisticated metabolic, translational, and even rudimentary defence capacities. These are, to be precise, statistical estimates based on phylogenomic inference, and should be treated as such—the dating carries uncertainty, and the genome size is reconstructed, not observed. But even the most conservative reading yields a conclusion that should give every researcher pause: LUCA was not a “simple” cell. It was already a masterwork of molecular architecture, and it appeared startlingly early in Earth’s history.

We know that Earth’s early atmosphere differed radically from today’s. The precise composition is debated—current geochemical work increasingly favours a weakly reducing or CO₂/N₂-dominated atmosphere with transient reducing episodes, rather than the strongly reducing CH₄/NH₃ mixture assumed in the classic Miller–Urey framework. We know that the Miller–Urey experiment demonstrated that amino acids form under electrical discharge in a reducing atmosphere—a genuine result, though its relevance depends on an atmospheric composition now considered overly simplified. We know that meteorites deliver amino acids, nucleobases, and sugars to planetary surfaces.

We know that RNA can function as both information carrier and catalyst, which is the foundation of the RNA World hypothesis. We know that lipid vesicles form spontaneously in fresh water. We know that alkaline hydrothermal vents produce proton gradients analogous to those used by all living cells for energy production.

And that is where the knowledge ends and the storytelling begins.

Because here is what we do not know. We do not know how amino acids polymerise into functional proteins outside a living cell. We do not know how nucleotides polymerise into functional RNA or DNA outside a living cell. We do not know how a lipid vesicle acquires the molecular machinery to replicate itself. We do not know how the genetic code arose—the mapping of three-nucleotide codons to specific amino acids that is universal across all life and has no known chemical rationale. We do not know how homochirality was established—why life uses exclusively L-amino acids and D-sugars when prebiotic chemistry produces equal mixtures of both. We do not know how the ribosome—a molecular machine of roughly 2.5 million Daltons—assembled from components that are individually useless.

Before we proceed, pause for a moment and consider what it is that must be explained. In the time it takes to read this sentence, a single E. coli bacterium has performed roughly two thousand enzymatic reactions. Its ribosomes—molecular machines older than any mountain on Earth—have translated some twenty amino acids per second, reading a code they did not write and assembling proteins whose three-dimensional folding they do not direct but which, miraculously, fold correctly nonetheless. Its DNA repair enzymes have scanned three million base pairs for errors, found and corrected several, and resumed their patrol. Its membrane has imported nutrients through channels so selective they distinguish between sodium and potassium ions differing by a fraction of an angstrom. All of this happens in a volume one-thousandth the width of a human hair, in a system that weighs less than a trillionth of a gram—and it happens every second of every hour of every day, in every one of the roughly ten trillion bacteria on your skin and in your gut, as you sit reading these words. This is what origin-of-life research must account for. Not amino acids in a flask. Not lipid bubbles in a pond. This.

Let us be candid about the metaphor that origin-of-life researchers themselves most often reach for. They speak of “building blocks”—amino acids, nucleotides, lipids—as if the problem of life were a construction project that merely awaits the right materials on the right building site. The metaphor is revealing in ways its users do not intend.

Begin with the bricks themselves. Even these we produce only grudgingly, under carefully engineered laboratory conditions that may not reflect early Earth. The Miller–Urey spark discharge yields mainly glycine and alanine—the two simplest amino acids—and traces of a handful of others, in a reducing atmosphere now considered overly optimistic. Ribose, the sugar backbone of RNA, is so thermodynamically fragile that it decomposes under the very conditions invoked for its synthesis. Nucleotides—the actual monomers of genetic information—have never been produced in a single plausible prebiotic reaction from start to finish. So the first honest admission is this: we do not reliably produce the bricks. We produce a few of the simplest ones, under conditions that may not have obtained, and we call this progress.

But grant, for the sake of argument, every brick the optimist could wish for. Scatter amino acids, nucleotides, lipids, and sugars across the floor of the Hadean Earth in whatever concentrations you please. You have not begun to address the problem. Because the question is not how bricks come to lie on the ground. The question is how bricks come to assemble themselves into a house—without a blueprint, without a builder, without the concept of “house.” And not merely a house, but a house that wants to build other houses. A house that contains, within its walls, the complete instructions for its own replication—instructions written in a symbolic code that bears no physical resemblance to the structure it describes, yet is read, interpreted, and executed with fidelity by machinery that is itself specified by those same instructions.

This is the point at which the building-block metaphor does not merely fail—it misleads. A cathedral, however magnificent, is inert. It does not metabolise. It does not reproduce. It does not care whether it persists. Life does all three—or, to speak with philosophical precision, it behaves as if it does. The humblest bacterium exhibits what Aristotle called telos—purposive behaviour directed toward self-maintenance and reproduction. It swims toward nutrients and away from toxins. It repairs its own DNA. When damaged beyond recovery, it activates programmed death to benefit its kin. None of this is mysticism; it is molecular biology. But it is molecular biology that operates with a directionality, an aboutness, that no known chemical reaction possesses.

Chemists speak of Le Châtelier’s principle, of thermodynamic equilibria, of reaction kinetics. Nowhere in the lexicon of chemistry is there a term for purpose. Electrons do not want to fill orbitals; acids do not strive to donate protons. Yet the simplest living cell—a system composed entirely of chemicals obeying chemical laws—exhibits goal-directed behaviour of staggering sophistication. The emergence of this purposiveness from purposeless matter is not an engineering problem. It is a philosophical problem—arguably the deepest one in all of natural philosophy. And it is the one that every origin-of-life hypothesis, without exception, waves past in embarrassed silence.

Consider one aspect of this purposiveness that is rarely discussed in origin-of-life literature but that reveals the depth of the problem. Every living cell corrects its own errors. DNA polymerase has a built-in proofreading mechanism. Mismatch-repair enzymes scan the genome, detect deviations from the template, and restore the original sequence. But error correction presupposes the existence of a standard—a “correct” sequence against which deviations are measured. A cytosine where a thymine should be is an “error” only because there exists a normative template that specifies thymine at that position. Where did the normative template come from before the error-correction machinery existed? And where did the error-correction machinery come from before there was a template worth protecting?

This is the chicken-and-egg problem transposed to a deeper register. It is not merely a question of which molecule came first. It is a question of how normativity entered a system that, at the chemical level, knows no norms. A chemical reaction is not “correct” or “incorrect”; it simply proceeds according to thermodynamics. Yet life, from its most elementary molecular level, operates on the distinction between right and wrong sequences—and actively enforces that distinction. The emergence of normativity from a substrate that is constitutively non-normative is a problem that no origin-of-life model has even attempted to address, let alone solve.

So let us retire the comforting metaphor. We are not missing a construction manual. We are missing an explanation for why chemistry would become biology—why dead matter would organise itself into systems that fight, with extraordinary ingenuity, against the very entropy that should dissolve them. The distance between a puddle of amino acids and a living cell is not the distance between bricks and a cathedral. It is the distance between silence and a Shakespeare sonnet—between the random clatter of typewriter keys and a poem that knows it is a poem, and writes copies of itself, and sends them out into the world.

 

THE PARADE OF HYPOTHESES

The landscape of origin-of-life research is not a converging field approaching consensus. It is a collection of warring camps, each championing a model that solves its particular sub-problem while either ignoring or actively contradicting the others. As a 2006 review in Philosophical Transactions of the Royal Society observed, the field’s models address different aspects of the problem with little integration. Examined honestly, each hypothesis has a severe unresolved problem—not a gap in detail, but a difficulty at the level of basic mechanism.

 

The RNA World: A Solution in Search of a Problem

The RNA World hypothesis is the reigning orthodoxy—the idea that RNA preceded both DNA and proteins, serving as both the genetic material and the enzymatic catalyst in earliest life. It is elegant, widely cited, and, upon close examination, deeply troubled.

The core problem is chemical. Ribose, the sugar backbone of RNA, is notoriously unstable—its decomposition rate depends on pH, temperature, and mineral context, but under conditions plausible for early Earth it remains one of the most fragile sugars in prebiotic chemistry. Cytosine decomposes within centuries under warm aqueous conditions. Nucleotides do not spontaneously polymerise in water—water, in fact, drives the reverse reaction, hydrolysing the phosphodiester bonds that hold an RNA chain together. Even under optimistic laboratory conditions, the longest self-replicating ribozyme produced to date manages only short-sequence copying with significant error rates. Harold Bernhardt, in his widely cited 2012 review, called the RNA World “the worst theory of the early evolution of life (except for all the others).”

This is not the language of a solved problem. It is the language of a field that has nominated a leading candidate for lack of a better one. The RNA World explains why ribosomes are RNA-based machines; it does not explain how the first RNA molecule of sufficient length and complexity to catalyse its own replication came into existence when its individual components are thermodynamically predisposed to decompose.

 

The Hydrothermal Vent Hypothesis: Energy Without Information

Nick Lane and Bill Martin’s proposal that life began at alkaline hydrothermal vents is a masterpiece of thermodynamic reasoning. It elegantly explains why all cells use chemiosmosis—proton gradients across membranes—for energy production. The natural proton gradients at sites like the Lost City hydrothermal field provide a compelling analogue.

But energy is not the problem. The problem is information. A proton gradient across an iron-sulphide wall tells you nothing about how a sequence of nucleotides comes to encode instructions for building a specific protein. Lane’s hypothesis addresses the thermodynamic substrate of life; it does not address the semiotic character of life. And it is the semiotic character—the fact that biology operates on coded information, not merely on chemistry—that constitutes the real mystery.

 

The Warm Little Pond: Darwin’s Poetic Intuition

The revival of Darwin’s 1871 “warm little pond” conjecture proposes that life originated in shallow freshwater pools on volcanic landmasses, where wet-dry cycles concentrated organic molecules and drove polymerisation. The model addresses the dilution problem—but only that problem. It says nothing about how polymers become functional polymers, how catalysis becomes self-sustaining catalysis, or how a puddle of drying amino acids begins to reproduce.

 

Metabolism First: The Iron-Sulphur World

Wächtershäuser’s model proposes that autocatalytic metabolic cycles on pyrite surfaces preceded genetic molecules. The appeal is thermodynamic; the problem is empirical. No one has demonstrated a self-sustaining autocatalytic cycle outside a living cell. The proposal has the elegant circularity of a hypothesis that assumes what it needs to prove: metabolism requires the kind of coordinated molecular machinery that only metabolism can build.

 

Dissipative Structures: The Thermodynamic Imperative

Prigogine-inspired models, including Michaelian’s thermodynamic dissipation theory, argue that life is a near-inevitable consequence of non-equilibrium thermodynamics—a dissipative structure that increases entropy production. This is a legitimate physical insight. But “near-inevitable” is doing heroic work in that sentence. Tornadoes and Bénard cells are also dissipative structures. They do not encode genetic information, build ribosomes, or evolve by natural selection. The gap between a self-organising physical phenomenon and a self-replicating biological one is not quantitative. It is qualitative—a difference not of degree, but of kind. And as the astrobiology literature acknowledges, UV was one of several plausible energy drivers alongside geochemical redox gradients, impacts, and hydrothermal systems—making any single thermodynamic narrative necessarily speculative.

 

THE PROBABILITY PROBLEM THAT WILL NOT GO AWAY

Behind all of these hypotheses lurks an uncomfortable mathematical reality that origin-of-life researchers acknowledge in private and rarely discuss in public. Consider Mycoplasma genitalium, the organism with the smallest known genome of any bacterium that can be grown in pure culture: approximately 580,000 base pairs encoding some 470 protein-coding genes. Mycoplasma can, in principle, replicate in cell-free (axenic) media — it is not an obligate intracellular parasite like Chlamydia or Rickettsia. But it is a dramatically reduced organism: a highly specialised, host-associated bacterium that has already shed most of the biosynthetic machinery a genuinely independent cell would need, relying instead on its host environment to supply nutrients it can no longer synthesise. And here is the critical point: even after this radical genomic streamlining, even at the barest edge of autonomous viability, Mycoplasma. (Craig Venter’s team later synthesised a minimal bacterial genome of 473 genes — and even this required transplantation into an existing living cell as chassis.) A truly free-living organism — the kind that would have had to arise de novo on the prebiotic Earth, with no host to exploit and no pre-existing biological nutrients to scavenge — would have required more genetic and metabolic machinery, not less. The probability of assembling even a single functional protein of modest length by random chemical processes has been estimated by multiple independent groups at figures so vanishingly small that the entire observable universe, running all possible chemical reactions on every particle for its entire 13.8-billion-year history, could not approach them.

The standard response is that abiogenesis does not proceed by random assembly—that natural selection, or proto-selective processes, act on chemical systems, guiding them toward complexity. This is the most important counter-argument and it deserves serious treatment. A substantial literature on autocatalytic sets, compositional heredity, and “evolution before genes” proposes that selection-like dynamics may operate on populations of chemical networks before true replication emerges. If these models worked, they would bridge the probability gap elegantly. But to date, no autocatalytic network has been shown to sustain itself outside a living cell, to scale beyond trivially small reaction sets, or to generate the kind of open-ended, digital heredity that characterises even the simplest known life. The models are suggestive but insufficient. And Darwinian selection proper—the mechanism most often invoked in popular accounts—requires differential replication with heritable variation. Before the first self-replicating molecule exists, there is nothing for Darwinian selection to act on. The mechanism presupposes the existence of the thing it is supposed to explain.

The dissipative-structure models attempt to circumvent this by arguing that thermodynamic self-organisation precedes and enables biological self-replication. This is a legitimate and interesting move—but it merely relocates the problem. We must still explain how unguided thermodynamic self-organisation produces a system that operates on digital coded information—the codon table, the reading frame, the regulatory sequences. No known physical law predicts that energy dissipation will produce a symbolic code. Chemistry does not encode; it reacts. The leap from one to the other is the central mystery of life, and no hypothesis on offer has yet provided a credible, empirically supported model of how it occurs.

There is a mathematical dimension to this problem that deserves mention. John von Neumann—not a theologian but one of the greatest mathematicians of the twentieth century—demonstrated formally that a self-replicating automaton requires a minimum threshold of complexity below which errors accumulate faster than replication can compensate for them. Below this threshold, any proto-replicating system is inherently unstable: it degrades rather than evolves. Above it, the system can maintain itself and undergo open-ended evolution. This creates what might be called a complexity cliff—a discontinuity that cannot be traversed incrementally, because every intermediate stage below the threshold is self-defeating. The system must arrive at a certain minimum complexity in a single bound, or not at all. LUCA, with its 2,600 gene families, was far above this threshold. The question of how unguided chemistry vaulted the cliff remains, by any honest assessment, unanswered.

There is a further mathematical argument, less well known but perhaps more devastating. In algorithmic information theory, the Kolmogorov complexity of a string measures the length of the shortest computer programme that can generate it. A random string has high Kolmogorov complexity—it is incompressible—but it has no function. A simple repetitive pattern (ABABAB…) has low Kolmogorov complexity and also no biological function. The genome of even the simplest living cell has both high Kolmogorov complexity and high function—it is what Gregory Chaitin would call organised complexity. This is the mathematically precise formulation of what makes biological information unique: it is neither random noise nor simple order. It is specified complexity—complex enough to be incompressible, yet organised in a way that performs a specific task. Here is the problem: known physical laws generate either randomness (thermal noise, radioactive decay) or simple order (crystals, Bénard cells). No known physical process generates organised, specified complexity of the kind found in even the simplest genome. The universe is very good at producing entropy and very good at producing patterns. It has never been observed, in any context outside biology, to produce a functional programme.

 

THE PARADOX OF LUCA

Consider a fact that ought to trouble every honest researcher in this field. LUCA existed approximately 4.2 billion years ago—only some 400 million years after Earth formed. A 2025 Bayesian analysis by the Columbia astronomer David Kipping concluded that this early emergence yields 13:1 odds in favour of abiogenesis being a “rapid” process on Earth-like planets.

But this statistical inference conceals a tension that is rarely confronted directly. The mainstream reply to the early-LUCA puzzle is observer selection: we necessarily observe life on a planet where it arose early enough to allow observers, and our sample size is one. This is a legitimate statistical point. But it does not dissolve the paradox; it merely reframes it. For even granting selection bias, we are left with the brute fact that seventy years of concentrated, well-funded, brilliantly conducted laboratory research has failed to reproduce even the earliest stages of the process inferred to be “rapid.” We have had the combined intellect of the world’s finest chemists, the most sophisticated instruments ever built, and the ability to design optimal conditions—to provide exactly the reagents, concentrations, temperatures, and energy sources we believe were present. And yet we cannot build a self-replicating molecule from scratch, cannot establish a self-sustaining autocatalytic cycle, cannot produce a protocell that does anything more than grow and divide without functional content.

Our inability to reproduce even the earliest stages of abiogenesis, despite favourable design of conditions, should at minimum temper claims that the process is “straightforward” or “inevitable” on Earth-like planets. If the process is rapid and easy, our failure requires explanation. If it is slow and difficult, then 400 million years strains credibility—not impossibly, but enough to warrant the admission that neither the timing nor the mechanism is well understood. The evidence does not close the question. It reopens it.

 

WHY NOT NOW? THE BARRIERS RECONSIDERED

One question frequently deployed to reassure the public is: “Why doesn’t life arise spontaneously today?” The conventional answer—that multiple barriers now prevent what was once possible—is broadly correct. But when examined with precision, these barriers do more to expose the fragility of abiogenesis theory than to defend it.

The oxygenating atmosphere is the most cited barrier. Today’s 21% molecular oxygen, a product of 2.4 billion years of cyanobacterial photosynthesis, makes it difficult for reduced organic molecules to accumulate at the surface in the concentrations and over the durations likely needed for multi-step chemical evolution. This is directionally correct. But it must be stated with care: as the geochemical literature emphasises, localised anoxic microenvironments—deep-sea vents, subsurface aquifers, certain sediment columns—exist even today. The oxygen barrier is real, but it is not the absolute “chemical steriliser” sometimes claimed. It inhibits accumulation in open surface environments; it does not categorically prevent all prebiotic chemistry everywhere on Earth.

The ecological barrier—competitive exclusion by existing life—is intuitively powerful. Even if prebiotic molecules formed today, they would be scavenged by the billions of microorganisms that have had four billion years of evolutionary refinement. This is a plausible ecological argument, and it was noted independently by both Charles Darwin and Alexander Oparin. But it must be honestly labelled as what it is: an untested hypothesis at the protocell stage. No experiment has demonstrated that modern microbes would actually destroy a nascent self-replicating chemical system before it could establish itself. The argument is reasonable; its quantification remains speculative.

And the UV/ozone barrier—the idea that the ozone layer, by filtering UV radiation, has removed the energy source that may have driven prebiotic photochemistry—is a legitimate hypothesis, but one that the literature treats as speculative and model-dependent. UV was one of several plausible energy sources; geochemical redox gradients, impact energy, and hydrothermal systems offer alternatives. To present the ozone barrier as decisive is to overstate a minority position.

Now here is the point that the conventional account buries. Each of these barriers is invoked to explain why life cannot arise today. Collectively, they constitute an impressive catalogue of obstacles. But each was also supposedly absent 4.2 billion years ago—which means the theory requires us to believe that, in the absence of these obstacles and in the presence of nothing more than favourable chemistry, a system of digital coded information spontaneously organised itself into the most complex phenomenon in the known universe. The barriers explain why the experiment cannot be repeated. They do not explain how it succeeded in the first place.

Step back and notice the structure of seventy years of accumulated discovery. Not in the oxidising atmosphere. Not in the open ocean. Not without concentration mechanisms. Not without UV, but not with too much UV. Not without catalytic mineral surfaces. Not in the presence of biological competitors. Not on any timescale observable by humans. Every one of these findings is individually well-supported. But their accumulation creates a pattern that the field has not confronted: we define the conditions for the origin of life almost exclusively by negation. In theology, there is a tradition of defining God solely by what He is not—the via negativa of Pseudo-Dionysius and Maimonides. Origin-of-life research has become, without intending it, a form of apophatic biology: we know more and more about how life did not originate, and less and less that we can positively assert about how it did. No integrated positive scenario connecting known geochemistry to the first evolvable cell has been demonstrated. What remains is an expanding catalogue of ruled-out conditions surrounding a shrinking—and still empty—centre.

 

WHAT REMAINS: THE TWO HYPOTHESES WE REFUSE TO DISCUSS

If we are honest—ruthlessly, uncomfortably honest—there are only two classes of explanation that do not founder on the problems outlined above.

 

Panspermia: The Honest Deferral

Panspermia—the proposal that life, or its precursors, arrived on Earth from elsewhere in the cosmos—is frequently dismissed as a non-answer, a hypothesis that merely displaces the question. This is true, as far as it goes. Panspermia does not explain how life originated; it explains only how it arrived.

But the dismissal is too hasty. Panspermia has three features that no terrestrial abiogenesis model can match. First, it vastly expands the available time, space, and chemical diversity for the origin event—the universe is 13.8 billion years old and contains roughly 2 × 10²³ stars, many with planetary systems far older than ours. Second, it is empirically supported at the “pseudo-panspermia” level: amino acids, nucleobases, and sugars have been confirmed in meteorites; microorganisms survive space-like conditions; interplanetary transfer of material is a documented physical reality. Third—and most importantly—it is the hypothesis that Sir Fred Hoyle and Chandra Wickramasinghe championed not from ignorance of biology, but from an excess of it. Hoyle, one of the greatest astrophysicists of the twentieth century, calculated the probabilities and concluded that Earth’s four-billion-year history was simply insufficient for the complexity of life to arise by unguided chemistry. His “747-in-a-junkyard” analogy may be crude, and subsequent work has shown how sensitive such estimates are to their underlying assumptions. But no one has yet produced a widely accepted model that both avoids prohibitively small probabilities and remains chemically realistic. The field has, in practice, chosen alternative modelling approaches rather than engaging with the combinatorial challenge Hoyle identified.

Why is panspermia marginalised? Not because it is illogical. Not because it lacks empirical support. But because it violates a tacit norm of modern science: that explanations must be complete. Panspermia is the only live hypothesis that respects the current difficulties of terrestrial models, is compatible with known physical processes of material exchange, and relaxes the timescale constraints—while admittedly failing to address the ultimate origin of biological information. A hypothesis that admits “we do not know where life first arose, but it was probably not here” is regarded as intellectually unsatisfying—a deferral rather than an answer. But in a domain where every complete alternative has severe unresolved problems, an honest deferral is not a weakness. It is the only response that respects the evidence.

 

Creation: The Hypothesis That Dare Not Speak Its Name

Let us be precise about what “creation” means in this context. It does not mean the literalist reading of Genesis, nor any particular religious tradition’s account. It means the hypothesis that the origin of biological information—the genetic code, the ribosome, the integrated metabolic and replicative machinery of even the simplest cell—required the input of an intelligent agent. This hypothesis can be stated without reference to any deity, denomination, or scripture. The inference pattern is this: in every domain where we can directly observe the causal history of high-specificity symbolic codes, an intelligent process is involved. Absent a competing causal class with demonstrated ability to generate such codes, it is rational to treat intelligence as the best currently available explanation.

The obvious objection is the “science-of-the-gaps” argument: in all past cases where we initially lacked a naturalistic explanation—lightning, disease, planetary motion—natural explanations were eventually found; therefore the evidential weight of “no known unguided cause” is weak. This is a serious objection and it must be met head-on. The reply is that the analogy is imperfect. Lightning, disease, and planetary motion are physical processes explicable by physical laws. The genetic code is not a physical process; it is a semiotic system—a mapping between physically unrelated domains (nucleotide triplets and amino acids) that is chemically underdetermined. If an unguided process can generate such mappings, we currently lack any empirically supported model of how. This is not the generic absence of explanation that attends any unsolved problem; it is the absence of even a category of unintelligent process known to produce the relevant kind of output.

In every other domain of human inquiry, we accept this inference without hesitation. When archaeologists find a clay tablet inscribed with cuneiform, they do not hypothesise that wind erosion produced the symbols. When SETI researchers scan the cosmos for radio signals, their entire methodology rests on the assumption that a sufficiently complex, non-repetitive signal would constitute evidence of intelligence. The criterion is not mystical; it is informational.

The genetic code behaves as a convention in the strong sense that the mapping is chemically underdetermined—there is no physico-chemical reason why UUU should code for phenylalanine rather than leucine. Conventions, in all of human experience, require a source capable of establishing arbitrary correspondences. To say that this particular convention arose from unguided chemistry is not merely to speculate—it is to assert something for which there is no precedent, no mechanism, and no demonstration. That does not make the assertion impossible. But it makes the confident exclusion of the alternative—intelligent causation—an act of philosophical commitment, not empirical adjudication.

Yet this hypothesis—that the origin of biological information required intelligence—is not merely unfashionable in contemporary science. It is seriously career-limiting. Publicly endorsing intelligent causation as the best explanation for life’s origin is, in many disciplines, treated as beyond the pale of respectable science, regardless of how cautiously it is framed. This sociological fact does not by itself decide the evidential question. But it does help explain why certain hypotheses are excluded from consideration—not because they have been tested and falsified, but because they transgress the methodological commitment to naturalism that defines modern science’s institutional identity.

 

THE IDEOLOGICAL ARCHITECTURE

To understand why creation and panspermia are excluded, it is necessary to distinguish between two very different commitments that travel under the same name.

Methodological naturalism is the working principle that science investigates natural causes and natural mechanisms. It is a tool, a heuristic—and a supremely useful one. It tells the scientist: “When you enter the laboratory, look for physical explanations. Do not invoke miracles as a substitute for experiment.” As a methodological constraint, it is unimpeachable.

Metaphysical naturalism is the philosophical claim that natural causes and natural mechanisms are all that exist. It tells the scientist—and everyone else—that reality is exhaustively physical, that there is no intelligence behind the cosmos, and that any hypothesis invoking such intelligence is a priori illegitimate. This is not a scientific finding. It is a philosophical position—a metaphysical commitment masquerading as a methodological one.

The conflation of these two is the central intellectual error of contemporary origin-of-life research. When methodological naturalism—an excellent heuristic—is overextended into metaphysical naturalism, scientists begin treating “no current natural explanation” as “no possible non-natural explanation is worth considering.” The result is a field structurally incapable of entertaining any hypothesis that points beyond nature—even when every hypothesis that points within nature has severe unresolved problems at the level of basic mechanism. The community would rather proliferate increasingly baroque and mutually contradictory naturalistic models than entertain the possibility that the phenomenon under investigation may lie outside the domain that naturalism can explain.

Richard Lewontin, the Harvard geneticist, said the quiet part aloud in a 1997 review in The New York Review of Books: “We take the side of science in spite of the patent absurdity of some of its constructs… because we have a prior commitment, a commitment to materialism. It is not that the methods and institutions of science somehow compel us to accept a material explanation of the phenomenal world, but, on the contrary, that we are forced by our a priori adherence to material causes to create an apparatus of investigation and a set of concepts that produce material explanations, no matter how counter-intuitive, no matter how mystifying to the uninitiated.”

Many philosophers of science will reply that Lewontin was describing a practical commitment to look for natural explanations, and that he overstated for rhetorical effect. Perhaps. But the words speak for themselves: “a priori adherence” is not a description of a heuristic that might be suspended when evidence warrants. It is a description of a prior commitment that generates its own conclusions. Whether Lewontin intended it as confession or as praise, it remains the most candid description on record of a research programme that has elevated a philosophical preference to an epistemic absolute.

 

THE COST OF DISHONESTY

The consequences of this ideological gatekeeping are not merely philosophical. They are practical and they are scientific.

When a field forecloses an entire category of explanation a priori, it distorts the evaluation of the remaining explanations. Hypotheses that would be recognised as radically inadequate in any other context are given the benefit of every doubt, because they are the only kind permitted. The RNA World is not treated as a hypothesis that has passed rigorous testing; it is treated as the least objectionable option within the only acceptable category. The hydrothermal vent model is not celebrated because it has explained the origin of genetic information; it is celebrated because it has explained the origin of proton gradients—a real achievement, but one that addresses perhaps five per cent of the problem.

Meanwhile, the actual state of the field is camouflaged by a rhetoric of progress. Every small laboratory result—a ribozyme that copies a short sequence, a lipid vesicle that grows and divides, an amino acid found in a meteorite—is announced as a “breakthrough” in origin-of-life research. Each such announcement creates, in the public mind and in the minds of students, the impression that the problem is being solved incrementally. It is not. The distance between forming an amino acid in a spark-discharge apparatus and assembling a self-replicating cell with a genome, a proteome, and a metabolome is not a distance that incrementalism can bridge. It is the difference between finding a grain of sand and constructing a computer.

The philosopher Thomas Nagel, in his 2012 book Mind and Cosmos, made precisely this point and was excoriated for it—not because his arguments were refuted, but because he had violated the taboo. “What is lacking, to my knowledge,” Nagel wrote, “is a credible argument that the story has a nonnegligible probability of being true.” He was right. And the fury directed at him confirmed, more eloquently than any argument, that sociological and philosophical commitments significantly shape which hypotheses are treated as scientifically respectable in this field—quite apart from the evidential balance.

 

WHAT INTELLECTUAL HONESTY REQUIRES

None of the foregoing constitutes a proof of creation or panspermia. Proof is not available in this domain—to any party. What the foregoing constitutes is something more modest and more important: a demonstration that the confident exclusion of creation and panspermia from scientific discourse is not warranted by the evidence. It is warranted only by a metaphysical commitment that has been smuggled into scientific method and is now defended with the fervour of religious orthodoxy.

But there is a still deeper point, one that touches not merely the sociology of science but its epistemology. Wittgenstein closed the Tractatus: “Whereof one cannot speak, thereof one must be silent.” This is not anti-intellectualism. It is the recognition that different questions require different epistemic instruments. Science investigates natural causes by means of the empirical method. If the origin of life requires an explanation that is semiotic—how does a symbolic code arise?—or teleological—how does purposiveness emerge from purposeless matter?—then science, by the very definition of its method, may not be the right tool for the answer. Not because the answer is supernatural, but because the question demands categories—meaning, convention, purpose—that the empirical method deliberately excludes from its field of vision. One does not ask a thermometer about colour. The limitation is not a failure of the thermometer; it is a feature of its design. To insist that only thermometric answers are legitimate is not rigour. It is the confusion of one instrument’s range with the totality of what can be known.

I come to this question not as a scientist but as a lawyer—and I make no apology for the perspective. A legal mind is trained in precisely the skills this debate lacks: the weighing of evidence under conditions of uncertainty, the identification of burden-of-proof shifts, and above all the recognition that excluding a category of evidence without justification is itself a form of prejudice. In any courtroom in the Western world, if one party moved to exclude an entire class of relevant evidence—not because it had been shown to be unreliable, but because it was philosophically inconvenient—the motion would be denied. That is precisely what has happened in origin-of-life research. The hypothesis of intelligent causation has not been tested and found wanting. It has been found threatening and left untested. A jurist sees this for what it is: a procedural irregularity so fundamental that it vitiates the entire proceeding.

Intellectual honesty requires us to admit the following propositions, each of which is supported by the peer-reviewed evidence and none of which is controversial among specialists in candid conversation:

First. We do not know how life began. Not “we have a pretty good idea.” Not “we’re getting close.” We do not know.

Second. Every naturalistic hypothesis for the origin of life has severe, unresolved problems—not gaps in detail, but difficulties at the level of basic mechanism.

Third. The transition from chemistry to biology involves the emergence of coded, symbolic information—a phenomenon without any known precedent in unguided physical systems.

Fourth. The exclusion of intelligent causation from the space of permissible hypotheses is a philosophical choice, not a scientific conclusion.

Fifth. The marginalisation of panspermia reflects a preference for complete explanations over honest ones—a preference that, in any other scientific context, would be recognised as a cognitive bias.

Sixth. The question of life’s origin may involve categories—symbolic coding, normativity, purposiveness—that lie at or beyond the boundary of what the empirical method, by its own design, is equipped to explain. Recognising this is not a retreat from reason. It is an exercise of it.

 

CODA: THE QUESTION THAT WILL NOT DIE

Life is the most extraordinary phenomenon in the known universe. A single E. coli bacterium contains more integrated, functioning information than the entire Encyclopædia Britannica. The human genome is a library of three billion letters that not only encodes the construction of a body of thirty-seven trillion cells but includes the regulatory logic to build that body from a single fertilised egg—a feat of engineering so far beyond human capability that we cannot replicate it, cannot fully comprehend it, and cannot explain how it arose.

To look at this phenomenon and declare, with confidence, that it requires no explanation beyond unguided chemistry—while simultaneously acknowledging that we cannot demonstrate how unguided chemistry could produce it—is to rely on a metaphysical commitment that currently outruns the available evidence. It is, at its core, an act of trust—trust that a naturalistic explanation will eventually be found, despite the absence of any credible candidate after seven decades of search. Such trust may ultimately be vindicated. But it should be recognised for what it is: a philosophical wager, not an empirical conclusion. And it should not be permitted to foreclose hypotheses that the evidence has not excluded.

The honest scientist—the scientist who takes the evidence seriously rather than the sociology of science—must hold the question open. Must admit that the origin of life is not a solved problem, nor a nearly-solved problem, nor a problem whose solution is clearly in view. It is the deepest question in all of natural philosophy, and the deepest questions are not answered by forbidding certain categories of answer.

A century from now, when historians of science look back at the early twenty-first century, they will note with astonishment that an entire generation of researchers was constrained, by the invisible architecture of ideological commitment, from following the evidence wherever it led. They will note that the two hypotheses most consistent with the current state of evidence—that life was seeded from a vaster cosmos, or that it was the product of intelligence—were ruled out of court before the trial began.

But the question will outlast the prohibition. It always does. Somewhere, as you read these words, a ribosome—a structure older than the Atlantic Ocean, older than the continents, older than the rocks beneath your feet—is reading a sequence of nucleotide triplets in a bacterium on your skin, translating a code that has been copied without interruption for four billion years, assembling a protein that will fold into a shape that no chemist can predict from first principles, in order to maintain a system that fights, with quiet and extraordinary ingenuity, against the thermodynamic dissolution that should have claimed it long ago. No one knows who wrote that code. No one knows how it was written. And no one, despite seventy years of trying, has come close to writing it again.

The mystery is not diminishing. It is deepening. And it deserves better than an answer chosen in advance.

Principal Sources Linked in Text

This essay draws on peer-reviewed literature linked throughout the text, including: Moody et al., “The nature of the last universal common ancestor” (Nature Ecology & Evolution, 2024); Bernhardt, “The RNA world hypothesis: the worst theory… except for all the others” (Biology Direct, 2012); Kipping, “Strong evidence that abiogenesis is a rapid process” (Astrobiology, 2025; arXiv:2504.05993); Sojo et al., “The origin of life in alkaline hydrothermal vents” (Astrobiology, 2016); Michaelian, “Thermodynamic dissipation theory for the origin of life” (Earth System Dynamics, 2011); Becker et al. on prebiotic chemistry constraints (Chemical Reviews, 2022); Catling & Zahnle on early atmospheric evolution; Lyons et al. on Earth’s oxygenation; and Sutherland on UV and prebiotic photochemistry.

— Robert Nogacki, Warsaw, 2026