Few development interventions enjoy the intuitive appeal of sanitation. The causal chain seems self-evident: latrines reduce open defecation, which reduces fecal-oral disease transmission, which reduces diarrhea, stunting, and child mortality. Billions of dollars in development finance, from the Millennium Development Goals through Sustainable Development Goal 6, have flowed on the strength of this logic.

Yet over the past fifteen years, a series of rigorous randomized controlled trials has delivered an uncomfortable verdict. Large-scale sanitation programs, even when they successfully increase latrine coverage, have repeatedly failed to produce the expected reductions in child diarrhea, enteric infection, or linear growth. The disconnect between coverage gains and health outcomes constitutes one of the more consequential evidence puzzles in contemporary development economics.

This gap deserves careful examination, not because sanitation lacks intrinsic value, but because policy design built on weak causal evidence wastes scarce resources and crowds out interventions with stronger empirical foundations. Understanding why sanitation RCTs have underperformed expectations requires attention to behavioral economics, externalities in disease transmission, and the methodological choices that shape what trials can detect. The emerging revised theories of change suggest that achieving health impact from sanitation may require thresholds and complementarities far more demanding than universal coverage goals acknowledge.

WASH RCT Findings: Coverage Without Consequence

The WASH Benefits trials in Bangladesh and Kenya, alongside the SHINE trial in Zimbabwe, represent the most methodologically rigorous attempts to date to estimate causal effects of sanitation interventions on child health. Their combined sample sizes exceed 15,000 households, their measurement protocols are exacting, and their results converge on a finding the sector did not expect.

Across these trials, sanitation arms produced negligible effects on linear growth, the primary endpoint chosen precisely because chronic enteric exposure is hypothesized to drive environmental enteric dysfunction and stunting. Diarrhea reductions, where detected, were modest and often statistically fragile. Combined WASH packages performed no better than individual components, undermining the synergy hypothesis that justified bundled programming.

Earlier observational evidence had suggested far larger effects, sometimes attributing 30 to 40 percent reductions in child mortality to improved sanitation. The RCT findings expose how much of that estimated impact reflected confounding by income, maternal education, and unobserved household characteristics rather than the causal contribution of latrines themselves.

Critics have raised legitimate questions about trial design. Coverage increases in intervention arms were sometimes modest, compliance imperfect, and follow-up periods potentially too short to capture growth trajectories. Yet sensitivity analyses and per-protocol estimates have largely failed to recover the anticipated effects, suggesting the null results reflect substantive reality rather than statistical artifact.

The implication is sobering. A sector that justified expansion on the strength of expected health returns must now reckon with evidence that those returns, at least at the scale and intensity of programs evaluated, are difficult to demonstrate empirically.

Takeaway

When rigorous evidence contradicts intuitive causal chains, the appropriate response is not to dismiss the evidence but to interrogate the assumptions embedded in the chain itself.

Behavioral Challenges: The Community Threshold Problem

The null findings become more interpretable once we take seriously the epidemiology of fecal-oral transmission. Pathogens do not respect household boundaries. A child living in a household with a pristine latrine remains exposed to fecal contamination from neighbors, livestock, surface water, and shared environmental reservoirs. Sanitation generates health benefits primarily through community-level externalities, not private household consumption.

This means coverage thresholds matter enormously. Models calibrated to recent trial data suggest that meaningful reductions in enteric pathogen exposure may require neighborhood coverage approaching 80 or 90 percent, combined with consistent use. Programs that raise village coverage from 30 to 60 percent, while substantial achievements administratively, may fail to cross the epidemiological threshold at which transmission dynamics change.

Sustained behavior change compounds the problem. Latrine construction is a one-time event measurable through asset surveys. Latrine use is a continuous behavioral choice influenced by privacy preferences, social norms, maintenance burden, and the persistent appeal of open defecation in many contexts. Trials consistently find that constructed facilities go unused, or are used inconsistently, particularly by men and older children whose excreta carry significant pathogen loads.

Community-Led Total Sanitation approaches were designed precisely to address these collective action and norm-shifting dimensions, with mixed empirical success. Where CLTS has worked, it has often relied on intensive facilitation and social pressure mechanisms that prove difficult to scale through government bureaucracies focused on construction subsidies and coverage targets.

The behavioral economics here is unforgiving. Sanitation produces positive externalities that individuals incompletely internalize, requires coordinated adoption to deliver health benefits, and demands sustained behavior change against ingrained preferences. Few development challenges combine these features so completely.

Takeaway

Interventions that generate health benefits through externalities cannot be evaluated on private household uptake alone; the unit of meaningful change is the community, not the household.

Revised Theories of Change: From Coverage to Containment

The empirical disappointments are reshaping how development economists conceptualize sanitation impact pathways. The simple coverage-to-health model is giving way to more demanding frameworks that specify the conditions under which sanitation can plausibly deliver health gains.

Emerging theories of change emphasize fecal sludge containment across the entire sanitation service chain, not just point-of-defecation facilities. A latrine that discharges into a leaky pit contaminating shallow groundwater may register as coverage but generate negligible epidemiological benefit. Safely managed sanitation, as defined under SDG 6.2, requires attention to containment, emptying, transport, and treatment, dimensions largely ignored in earlier programming.

Animal reservoirs are receiving overdue attention. Recent evidence suggests that livestock feces, particularly poultry, contribute substantially to child enteric exposure in many low-income settings. Sanitation interventions focused exclusively on human waste may address only a fraction of the pathogen load children actually encounter, explaining why even high-quality latrine adoption produces limited growth effects.

Some researchers now argue for transformative WASH packages that combine high coverage, animal feces management, food hygiene, and infant feeding interventions. Such packages may be necessary to achieve health impact but raise difficult questions about cost-effectiveness and implementation feasibility at scale. The complexity required for impact may exceed what most development programs can deliver.

Other scholars draw a more radical conclusion: sanitation should be financed and justified on grounds other than child health, including dignity, gender equity, environmental protection, and economic productivity. These benefits are real and important, but justifying multi-billion-dollar programs requires honesty about which outcomes the evidence supports and which it does not.

Takeaway

When the evidence base for a primary justification weakens, intellectual honesty demands either strengthening the intervention to meet that justification or selecting different justifications the evidence can support.

The sanitation evidence gap is not a critique of sanitation as a development priority. Dignity, privacy, and environmental quality are sufficient justifications for substantial investment. The gap is rather a critique of how the sector has marketed itself, claiming health returns that rigorous evaluation has struggled to confirm.

For development economists and program designers, the lesson extends beyond WASH. Interventions justified by intuitively compelling causal chains deserve the same evidentiary scrutiny as those whose mechanisms are less obvious. Confounded observational estimates can generate decades of confident expansion before randomized evidence reveals how much of the apparent impact was selection effects in disguise.

The path forward likely involves more ambitious, integrated programs designed to cross epidemiological thresholds, paired with humility about which outcomes any given intervention can credibly deliver. Better evidence does not always produce easier answers, but it produces more honest ones, and honesty is the foundation on which durable development policy must be built.