For nearly a century, cosmology has rested on a peculiar bargain: general relativity works beautifully at describing the geometry of spacetime, but only if we accept that roughly 95% of the universe's energy content is composed of substances we have never directly detected. Dark matter and dark energy are not observed entities so much as inferred necessities—placeholders inserted to reconcile Einstein's equations with what telescopes actually reveal. The arrangement is elegant in its predictive power and deeply uncomfortable in its ontological implications.
But what if the problem lies not with the inventory of the cosmos, but with the theory we use to take that inventory? Modified gravity theories propose exactly this inversion. Rather than populating the universe with invisible scaffolding, they ask whether the law of gravitation itself changes character at galactic and cosmological scales—regimes far removed from the solar system experiments where general relativity was first confirmed. It is a radical hypothesis, but one with a respectable intellectual pedigree stretching back to the early 1980s.
The stakes are considerable. If any modified gravity framework succeeds, it would not merely eliminate dark matter or dark energy from the cosmic ledger; it would fundamentally rewrite our understanding of what gravity is and how spacetime responds to matter and energy at the largest scales. If these frameworks fail—and many already have in specific domains—the manner of their failure teaches us something equally profound about why general relativity, supplemented by dark components, has proven so stubbornly resilient. Either way, the exercise sharpens our picture of reality.
MOND and Its Extensions
Modified Newtonian Dynamics, or MOND, began in 1983 as Mordehai Milgrom's deceptively simple proposal: below a critical acceleration threshold of approximately 1.2 × 10⁻¹⁰ m/s², the effective gravitational force transitions from the familiar Newtonian inverse-square law to a different regime where gravitational attraction declines as 1/r rather than 1/r². This single empirical parameter—the acceleration scale a₀—generates flat rotation curves in spiral galaxies without invoking any dark matter halo. The success is not merely qualitative; MOND predicts the detailed shape of rotation curves from the observed baryonic mass distribution alone, a feat that cold dark matter models achieve only through halo fitting with additional free parameters.
The deeper surprise is the baryonic Tully-Fisher relation. MOND predicts a tight, scatter-free correlation between a galaxy's total baryonic mass and the fourth power of its asymptotic rotational velocity. Observationally, this relation holds with remarkable precision across several orders of magnitude in galaxy mass. In the standard ΛCDM framework, this tightness is unexpected—it emerges only through a fine-tuned interplay between baryonic physics and dark matter halo properties that has no obvious fundamental explanation. MOND renders it a direct consequence of the gravitational law itself.
Yet MOND's triumphs at the galactic scale give way to significant difficulties elsewhere. Galaxy clusters remain problematic: MOND reduces but does not eliminate the mass discrepancy in clusters, still requiring some form of unseen matter—perhaps massive neutrinos or residual baryonic dark matter. More critically, the original non-relativistic formulation cannot address gravitational lensing, cosmological structure formation, or the cosmic microwave background power spectrum, phenomena where ΛCDM excels with quantitative precision.
Relativistic extensions have attempted to bridge this gap. Jacob Bekenstein's Tensor-Vector-Scalar gravity (TeVeS), introduced in 2004, embedded MOND-like phenomenology within a covariant framework involving additional dynamical fields—a scalar field and a timelike vector field alongside the metric tensor. TeVeS could produce gravitational lensing consistent with some observations, but it struggled to reproduce the acoustic peaks of the CMB without supplementary dark matter. More recent incarnations, including extensions by Skordis and Złośnik (dubbed RMOND), have made remarkable progress, constructing relativistic MOND theories that can fit the full CMB power spectrum and the matter power spectrum simultaneously—though at the cost of introducing field content that begins to resemble, in complexity if not identity, the dark sector it sought to replace.
The philosophical tension is stark. MOND's galactic-scale predictions remain its strongest empirical calling card, and no dark matter simulation has convincingly explained why the acceleration scale a₀ appears so universally in galactic dynamics. But the theory's inability to stand alone at cosmological scales—without either new fields or residual dark matter—raises a persistent question: is MOND revealing a genuine modification of gravity, or is it an effective description of emergent behavior within a more conventional dark matter framework that we have not yet fully understood?
TakeawayA theory's greatest strength can also be its most confining boundary. MOND's uncanny success at galactic scales is precisely what makes its cosmological failures so instructive—sometimes a pattern that works perfectly in one regime is not a fundamental law, but a clue pointing toward something deeper.
f(R) Gravity and Beyond
General relativity derives from the Einstein-Hilbert action, where the gravitational Lagrangian density is simply the Ricci scalar R—a measure of spacetime curvature. The most natural theoretical generalization is to replace R with an arbitrary function f(R), allowing higher-order curvature terms to modify gravitational dynamics. This is not an ad hoc move; it reflects the expectation from quantum gravity that the Einstein-Hilbert action is merely the leading term in an effective field theory expansion, with corrections becoming significant at extreme curvatures or very large scales.
The cosmological appeal of f(R) gravity is immediate. Certain functional forms—particularly the Starobinsky model, where f(R) = R + αR²—naturally produce accelerated expansion in both the early universe (inflation) and the late universe (mimicking dark energy), without requiring a cosmological constant or scalar inflaton field. The additional degree of freedom in f(R) theories manifests as a scalaron—an effective scalar particle mediating a fifth force. The Starobinsky model remains one of the best fits to Planck satellite observations of primordial perturbations, a remarkable achievement for a theory proposed in 1980, before inflation was even widely accepted.
However, the scalaron introduces a fundamental tension. A fifth force operating at cosmological scales would, if unchecked, produce detectable deviations from general relativity within the solar system and in laboratory experiments—deviations that are not observed. Viable f(R) models must therefore incorporate a chameleon mechanism or similar screening effect, whereby the scalaron acquires a large effective mass in high-density environments, suppressing the fifth force locally while allowing it to operate freely in the low-density cosmic web. Models like the Hu-Sawicki f(R) gravity achieve this, but the screening requirement severely constrains the parameter space, limiting the observable cosmological signatures.
Beyond f(R), the theoretical landscape broadens considerably. Scalar-tensor theories—where a scalar field is explicitly coupled to gravity—encompass Brans-Dicke theory, Horndeski gravity (the most general scalar-tensor theory with second-order equations of motion), and beyond-Horndeski extensions. These frameworks can reproduce virtually any expansion history, which is both their power and their weakness: with enough freedom, fitting data becomes trivial, and the theories risk becoming unfalsifiable unless specific predictions for growth of structure or gravitational wave propagation are extracted and tested. The detection of GW170817 and its electromagnetic counterpart GRB 170817A already delivered a devastating blow to many of these models by establishing that gravitational waves travel at the speed of light to extraordinary precision, eliminating large classes of Horndeski and beyond-Horndeski theories in a single observation.
What remains is a highly constrained but non-trivial subspace of modified gravity theories. f(R) models with chameleon screening, certain shift-symmetric Horndeski theories, and specific cuscuton-like constructions survive current observational bounds. But survival is not triumph. These theories must now confront an increasingly precise observational program—from the Euclid satellite's weak lensing surveys to next-generation CMB experiments—that will measure the growth rate of cosmic structure with percent-level accuracy, precisely the regime where surviving modified gravity models predict subtle but measurable deviations from ΛCDM.
TakeawayThe freedom to modify a fundamental theory is not the same as the freedom to predict anything you want. Each additional degree of freedom must survive not only cosmological tests but local ones—and the universe has proven remarkably efficient at closing loopholes.
Testing Gravity at Cosmic Scales
The decisive question is no longer whether modified gravity theories exist that can mimic dark matter or dark energy—they clearly do—but whether observations can distinguish them from the standard ΛCDM paradigm. The critical insight is that while modified gravity models can often be tuned to reproduce the same expansion history H(z) as ΛCDM, they generically predict different growth histories for cosmic structure. In general relativity with dark energy, the growth rate of matter perturbations is uniquely determined by the expansion rate. In modified gravity, the relationship between expansion and growth is broken, because the modified gravitational coupling alters how matter clumps under its own attraction.
This decoupling is observationally accessible. Redshift-space distortions in galaxy surveys measure the quantity fσ₈—the product of the linear growth rate and the amplitude of matter fluctuations—directly probing how fast structure assembles at different epochs. Current measurements from BOSS, eBOSS, and the Dark Energy Spectroscopic Instrument (DESI) are beginning to reach the precision needed to detect the few-percent-level deviations that viable f(R) or Horndeski models predict. Simultaneously, weak gravitational lensing offers a complementary probe: it measures the sum of the two metric potentials (Φ + Ψ), while galaxy dynamics measure only one (Ψ). In general relativity, Φ = Ψ in the absence of anisotropic stress. Modified gravity theories generically introduce a slip between these potentials—a gravitational slip parameter η ≠ 1—that lensing and dynamical mass estimates can jointly constrain.
Gravitational wave astronomy has opened an entirely orthogonal testing ground. The propagation speed of gravitational waves, their damping rate over cosmological distances, and the presence of additional polarization modes all carry signatures of modified gravity. The GW170817/GRB 170817A coincidence established |c_gw/c − 1| < 10⁻¹⁵, immediately ruling out theories where the gravitational wave speed deviates from light speed at low redshifts. Future detections from LISA and third-generation ground-based detectors will extend these constraints to higher redshifts and probe the gravitational wave luminosity distance—which, in theories with extra dimensions or running Planck mass, can differ from the electromagnetic luminosity distance, providing a clean, model-independent test.
Perhaps the most powerful near-term discriminant will come from combined analyses. The Euclid satellite, the Vera C. Rubin Observatory's LSST, and CMB-S4 will simultaneously constrain the expansion history, the growth rate of perturbations, the lensing potential, and the matter power spectrum across a wide range of scales and redshifts. When these datasets are analyzed jointly within a parameterized framework—typically using the (μ, Σ) parameterization, where μ captures modifications to the Poisson equation and Σ captures modifications to lensing—any deviation from the general relativistic values μ = Σ = 1 would constitute direct evidence for modified gravity.
The observational program is, in a sense, theory-agnostic. It does not test MOND or f(R) gravity individually; it tests general relativity itself as the correct description of gravity at cosmic scales. If GR passes these tests at percent-level precision across multiple independent probes, the case for modified gravity as an alternative to the dark sector will become extraordinarily difficult to sustain. If it fails—if the data reveal a consistent, correlated pattern of deviations—we will confront one of the most consequential revisions in the history of physics: the possibility that gravity, as Einstein described it, is incomplete.
TakeawayThe most powerful test of a theory is not whether it can explain what we already see, but whether independent observations, designed to probe different aspects of the same physics, converge on the same answer. Consistency across probes is the signature of truth; discrepancy is the signature of discovery.
Modified gravity theories occupy a singular position in modern cosmology: they are simultaneously the most radical challenge to the standard paradigm and, through the precision of their failures, among the strongest indirect confirmations of ΛCDM's internal consistency. MOND reveals patterns in galactic dynamics that dark matter models struggle to explain naturally. f(R) gravity and its extensions demonstrate that alternative gravitational frameworks can reproduce cosmic expansion—but only within increasingly narrow observational corridors.
The coming decade will likely be decisive. Euclid, DESI, Rubin, LISA, and CMB-S4 will collectively probe gravity across scales and epochs with unprecedented precision. The question is no longer philosophical but empirical: does gravity behave as Einstein prescribed at every scale the universe presents?
If it does, modified gravity will have served as the most productive null hypothesis in cosmological history—sharpening our understanding of exactly what general relativity predicts and why it works. If it doesn't, the cracks will not merely modify a theory. They will redefine what we mean by gravity itself.