Authenticated Encryption: Why Encrypt-Then-MAC Wins

8 min read

Three ways to combine encryption and authentication exist — encrypt-then-MAC, MAC-then-encrypt, and encrypt-and-MAC — but only encrypt-then-MAC generically achieves both CCA-security and ciphertext integrity.

MAC-then-encrypt forces decryption before authentication verification, creating structural side channels that enabled real-world attacks like Lucky Thirteen and BEAST against TLS.

Modern AEAD modes like AES-GCM and ChaCha20-Poly1305 internalize the encrypt-then-MAC principle, packaging correct composition into single-primitive constructions with formal security proofs.

Nonce reuse in GCM catastrophically exposes the GHASH authentication key through polynomial factorization, destroying both confidentiality and integrity from a single repeated value.

Nonce-misuse resistant constructions like AES-GCM-SIV use synthetic initialization vectors derived from plaintext to guarantee graceful degradation when nonce uniqueness fails.

Encryption provides confidentiality. A message authentication code provides integrity. Combining the two should be entirely mechanical — apply both primitives to your data and transmit the result. But the precise order in which you compose encryption and authentication carries deep, non-obvious security implications that remain invisible until you attempt a formal reduction proof. This is one of those domains where cryptographic intuition reliably misleads, and only rigorous mathematical analysis reveals what is and what isn't actually secure.

Three natural composition strategies present themselves: encrypt-then-MAC, MAC-then-encrypt, and encrypt-and-MAC. Each applies the same pair of underlying primitives. Each produces a ciphertext bundled with an authentication tag. Yet only one of these compositions generically preserves the full security guarantees of both underlying schemes. The other two introduce vulnerabilities subtle enough to survive years of real-world deployment — vulnerabilities that eventually enabled devastating attacks against TLS, SSH, and IPsec.

Modern authenticated encryption with associated data modes internalize these hard-won composition lessons, packaging confidentiality and integrity into single-primitive constructions governed by formal security definitions. Understanding why encrypt-then-MAC wins the provable security argument — and how widely deployed schemes like AES-GCM and ChaCha20-Poly1305 build on that theoretical foundation — is essential for anyone designing or auditing protocol-level cryptographic security. And when those same AEAD modes encounter nonce misuse, the catastrophic failures that result reveal exactly how precise and fragile the underlying security conditions really are.

Composition Order Matters

Consider three ways to combine a CPA-secure encryption scheme E and a strongly unforgeable MAC M. Encrypt-then-MAC first encrypts the plaintext to produce ciphertext, then computes the authentication tag over that ciphertext. MAC-then-encrypt computes the tag over the plaintext first, then encrypts the plaintext and tag together. Encrypt-and-MAC encrypts the plaintext and independently MACs the plaintext, transmitting both outputs side by side. Same primitives, three orderings, vastly different security outcomes.

The foundational result comes from Bellare and Namprempre's 2000 analysis. Encrypt-then-MAC is the only generic composition that simultaneously achieves IND-CCA2 confidentiality and INT-CTXT ciphertext integrity. The proof proceeds by clean reduction: any efficient adversary breaking the CCA-security of the composed scheme implies an efficient adversary that breaks either the CPA-security of E or the strong unforgeability of M. The MAC computed over ciphertext acts as a gatekeeper — the decryption oracle rejects any adversary-crafted ciphertext with overwhelming probability, completely neutralizing the CCA advantage.

MAC-then-encrypt fails this generic guarantee because decryption must occur before tag verification. The decryptor strips the encryption layer, recovers the plaintext-tag pair, and only then checks validity. This ordering creates observable behavioral differences — timing variations, distinct error codes, padding validation responses — that leak information about decrypted content. The Lucky Thirteen attack against TLS exploited CBC-mode padding checks that occurred prior to MAC verification. BEAST leveraged the same structural weakness in TLS's CBC cipher suites. These were not implementation bugs. They were direct, structural consequences of choosing the wrong composition order.

Encrypt-and-MAC fails for a fundamentally different reason. The MAC operates directly on plaintext, and most practical MAC constructions are deterministic. Identical plaintexts therefore produce identical tags, regardless of any randomness the encryption scheme introduces. This directly violates IND-CPA — an adversary submits two chosen messages to a challenge oracle, observes whether repeated encryptions yield matching tags, and trivially distinguishes which message was encrypted. The original SSH Binary Packet Protocol deployed encrypt-and-MAC composition, embedding this generic theoretical deficiency into one of the most widely used secure transport protocols in existence.

The formal hierarchy is unambiguous. Encrypt-then-MAC achieves both IND-CCA2 and INT-CTXT from generic assumptions about its component primitives. MAC-then-encrypt can achieve IND-CCA2 for specific encryption scheme families with particular structural properties, but not generically. Encrypt-and-MAC fails to achieve even basic IND-CPA in the general case. This is not a spectrum of reasonable engineering tradeoffs. It is a strict, provable ordering where composition choice alone determines whether security guarantees survive or collapse entirely.

Takeaway
In cryptographic composition, the order of operations is a provable security boundary. Encrypt-then-MAC is the only generic composition that preserves both confidentiality and integrity — the alternatives carry structural defects no implementation can fully repair.

AEAD Security Model

Authenticated encryption with associated data formalizes the encrypt-then-MAC principle into a single-primitive abstraction. An AEAD scheme accepts a key, a nonce, a plaintext, and optional associated data, then produces a ciphertext that simultaneously guarantees confidentiality of the plaintext and integrity of both plaintext and associated data. The security definition captures this dual guarantee through a pair of formal games: one for indistinguishability, where the adversary cannot distinguish ciphertexts from random strings, and one for integrity, where the adversary cannot produce any valid ciphertext not generated by the encryption oracle.

The standard AEAD security notion assumes nonce uniqueness — the encryption oracle is never queried twice with the same nonce under the same key. Given this assumption, security reduces to proving that the scheme's output is computationally indistinguishable from random bits and that no efficient adversary can forge a valid ciphertext-tag pair. This nonce-respecting condition is not a minor technicality buried in proof appendices. It is a load-bearing precondition on which every line of the security reduction depends. Violate nonce uniqueness, and the entire formal argument collapses.

AES-GCM exemplifies this model cleanly. It pairs AES in counter mode for keystream generation with GHASH — a universal hash function operating over the finite field GF(2¹²⁸) — for authentication. Counter mode provides CPA-security for confidentiality. GHASH, keyed by the AES encryption of the all-zero block under the master key, evaluates a polynomial over the ciphertext and associated data to produce the authentication tag. The internal structure is encrypt-then-MAC: the CTR-mode ciphertext is what GHASH authenticates. Under nonce uniqueness, GCM's concrete security bound ties to the block cipher's PRP advantage and the total volume of data processed.

ChaCha20-Poly1305 follows the same structural pattern with different underlying primitives. ChaCha20 generates the encryption keystream, and the first 32 bytes of that keystream key Poly1305 — a Carter-Wegman MAC that evaluates over the ciphertext and associated data. The composition is again encrypt-then-MAC by construction. Poly1305's one-time security guarantee holds because ChaCha20 produces a fresh, uniform key for each message, provided the nonce is never reused. The resulting construction meets the same AEAD security definition as GCM, with concrete bounds derived from different underlying assumptions.

Both schemes share the same critical structural commitment: the authentication function processes ciphertext, not plaintext. This is encrypt-then-MAC realized within a single algorithm boundary. The AEAD abstraction does not bypass the composition question — it answers it correctly by construction and removes the opportunity for protocol designers to select the wrong ordering. The interface enforces what the theory requires, making the provably secure choice the only choice available.

Takeaway
AEAD modes don't bypass the composition problem — they solve it by hardcoding encrypt-then-MAC internally. The nonce-uniqueness assumption is not a soft guideline; it is the structural precondition on which every security guarantee depends.

Nonce Misuse Consequences

When GCM's nonce-uniqueness assumption fails, the result is not graceful degradation — it is catastrophic, total security collapse. If two messages are encrypted under the same key and nonce, the CTR-mode keystreams are identical. XORing the two ciphertexts yields the XOR of the two plaintexts directly, destroying confidentiality in the classical two-time pad scenario. But the damage extends far beyond leaked plaintext relationships. The authentication layer collapses completely as well.

GHASH computes authentication tags as polynomial evaluations in GF(2¹²⁸), using a secret authentication key H derived from the block cipher key. When two messages share a nonce, the adversary obtains two tag equations — polynomials in the unknown H with known coefficients derived from the ciphertexts and associated data. Subtracting these equations produces a new polynomial whose roots include the secret H. Factoring over GF(2¹²⁸) recovers H efficiently, typically with high probability from a single nonce collision. With H in hand, the adversary forges valid authentication tags on arbitrary messages, breaking INT-CTXT completely and silently.

This failure mode is not theoretical. Nonce collisions arise in practice through counter wraparound, virtual machine snapshot restoration, faulty random nonce generation in high-volume systems, and multi-device key sharing without coordinated nonce partitioning. Joux demonstrated the GHASH key recovery technique in 2006, and real deployments have been shown vulnerable since. The distance between a fully authenticated channel and one with no integrity protection whatsoever is exactly one repeated nonce value.

AES-GCM-SIV, designed by Gueron, Langley, and Lindell, addresses this through the synthetic initialization vector construction. Instead of using the nonce directly as the CTR-mode IV, GCM-SIV derives the IV by applying POLYVAL — a GHASH variant optimized for little-endian architectures — over the plaintext and associated data, then encrypting the result with AES. The IV becomes a deterministic function of the message content. If a nonce repeats but the messages differ, the derived IVs differ, and confidentiality of both distinct messages is preserved. Reusing a nonce with an identical message reveals only that the same plaintext was encrypted twice.

This defines the nonce-misuse resistance security notion: the scheme degrades gracefully, leaking at most plaintext equality when nonces collide. The engineering cost is real — encryption requires two passes over the data, one for IV derivation and one for CTR-mode encryption, making GCM-SIV modestly slower than standard GCM. The construction is also non-online, requiring the full plaintext before emitting any ciphertext byte. But in any environment where nonce uniqueness cannot be absolutely guaranteed, GCM-SIV's bounded and provable failure mode is categorically superior to GCM's silent, total collapse.

Takeaway
Nonce reuse in GCM does not merely leak plaintext — it algebraically recovers the authentication key, annihilating both confidentiality and integrity from a single repeated value. Where nonce guarantees are uncertain, nonce-misuse resistant constructions contain the damage structurally rather than hoping the precondition holds.

The encrypt-then-MAC principle is not a historical convention or a stylistic preference. It is a provable security boundary, formally established through reduction arguments that have withstood decades of cryptanalytic scrutiny. Every composition that deviates from this ordering carries generic security defects — defects that no amount of careful implementation fully eliminates.

AEAD modes represent the field's definitive engineering response: constructions that internalize the correct composition order and present a clean, formally analyzed interface to protocol designers. But even AEAD schemes rest on preconditions. Nonce uniqueness is the most fragile among them, and its failure in GCM produces total, silent security collapse.

The progression from generic composition analysis to AEAD design to nonce-misuse resistance traces a consistent principle in cryptographic engineering: security guarantees must be structural, not aspirational. When a property is critical, the construction itself must enforce it — not delegate responsibility to the hope that every deployment will perfectly satisfy its preconditions.