Quantifying Qualitative Sources: Mixed Methods in Contemporary History

6 min read

Mixed methods approaches combining computational analysis with traditional interpretation are reshaping contemporary historical practice.

Coding qualitative sources for computational analysis embeds interpretive choices that fundamentally shape results, requiring historians to treat their categorization schemes as arguments rather than neutral instruments.

Sophisticated mixed methods develop protocols for moving between distant and close reading, using discrepancies between scales as analytically productive rather than problematic.

Debates over reproducibility standards reveal tensions between scientific norms and humanistic interpretation, with transparency emerging as a middle position for digital history.

The most productive historical work happens in the friction between computational and interpretive methods, where each approach's limitations become visible through contrast.

The promise is seductive: feed thousands of documents into a computer, run sophisticated algorithms, and watch patterns emerge that no individual scholar could detect through traditional reading alone. Digital humanities projects now routinely analyze millions of sources, mapping networks, tracking discourse shifts, and quantifying what historians once could only describe impressionistically. The scale is genuinely transformative.

But here's the methodological tension that animates contemporary historical practice: how do we preserve the interpretive richness of humanistic inquiry while harnessing computational power? The sources historians care about most—diaries, correspondence, bureaucratic records, oral histories—resist easy quantification. Their meaning often lies in ambiguity, in what's left unsaid, in the gap between official language and lived experience.

The emerging answer isn't choosing between computational and traditional methods but learning to move fluidly between them. Mixed methods approaches are reshaping how historians work with contemporary sources, demanding new skills and sparking fierce debates about what counts as rigorous historical knowledge. The stakes extend beyond methodology to fundamental questions about what historical understanding actually is.

Coding as Interpretation: The Hidden Choices in Data Preparation

Before any algorithm runs, someone must decide how to categorize sources. This preparatory work—coding qualitative data for computational analysis—is where interpretive choices get locked into datasets. Every tagging schema embeds assumptions. When you code a letter as expressing "dissent" or "compliance," you've already made analytical decisions that will shape whatever patterns the computer subsequently finds.

Consider a project analyzing Cold War diplomatic correspondence. Researchers must decide: Does irony count as criticism? How do you categorize hedged statements? What about silence on expected topics? These aren't technical questions with technical answers—they're interpretive judgments that require deep contextual knowledge. The coding scheme becomes a theory of meaning, usually implicit, rarely scrutinized in final publications.

The challenge intensifies with scale. Large coding projects typically involve multiple researchers whose individual interpretive tendencies must somehow be harmonized. Inter-coder reliability metrics—borrowed from social science—attempt to measure consistency, but high reliability scores can mask shared biases rather than ensuring validity. A team might consistently misread a source type they all equally misunderstand.

Some historians respond by embracing interpretive coding's subjectivity explicitly, documenting their decision-making processes and acknowledging their schemas as arguments rather than neutral instruments. This approach treats the codebook itself as a historical interpretation requiring justification and critique. The computer doesn't discover patterns—it operationalizes the historian's interpretive framework at scale.

Others argue this transparency isn't enough. If coding decisions fundamentally shape results, then computational findings tell us more about researcher assumptions than about historical reality. The counterargument: traditional close reading involves equally consequential interpretive choices, just less visible ones. Mixed methods at least force those choices into the open.

Takeaway
Every computational analysis of historical sources embeds interpretive choices in its categorization schemes—the question isn't whether to make such choices but whether to acknowledge them.

Scale-Shifting Strategies: Between Distant and Close Reading

Franco Moretti's concept of "distant reading" proposed that scholars could understand literary systems by analyzing thousands of texts computationally without reading most of them individually. The approach promised to escape the tyranny of the canon—the few texts traditionally deemed worth close attention. For historians working with massive contemporary archives, similar logic applies: computational methods might reveal structures invisible from within any single document.

But the most sophisticated mixed methods projects don't simply choose between scales—they develop systematic protocols for moving between them. A typical workflow might begin with computational analysis identifying clusters or anomalies across thousands of sources, then zoom into representative samples for intensive reading, then return to the full corpus to test interpretations developed through close engagement.

The practical challenge is preventing each scale from simply confirming what the other found. Historians risk cherry-picking close reading examples that support computational findings, or running computational analyses designed to validate interpretations already formed through traditional methods. Genuine mixed methods require allowing each approach to challenge the other—letting distant reading defamiliarize assumptions, and close reading complicate algorithmic patterns.

Some historians formalize this through "surprise seeking" protocols: explicitly looking for cases where computational and close reading methods produce different results, then investigating those discrepancies. A topic model might cluster documents together that close reading reveals as substantively different, or separate documents that share important features the algorithm couldn't detect. These failures become the most analytically productive moments.

The temporal dimension adds complexity for contemporary historians. Digital sources often carry extensive metadata—timestamps, edit histories, network traces—enabling computational analysis of how documents changed over time and how they circulated. Close reading can then focus on moments of revision or transmission that computational analysis flagged as significant, asking why changes occurred when they did.

Takeaway
The power of mixed methods lies not in choosing a scale of analysis but in developing rigorous protocols for moving between computational pattern-detection and interpretive close reading—especially when they disagree.

Reproducibility Standards: Scientific Norms Meet Historical Practice

In the natural and social sciences, reproducibility serves as a crucial quality control mechanism: other researchers should be able to follow your methods and reach similar conclusions. Digital humanities projects increasingly face demands to meet similar standards—to share datasets, document code, and enable replication. The impulse seems democratizing: open methods, verifiable claims, cumulative knowledge-building.

Yet historians have pushed back. Historical interpretation isn't supposed to be reproducible in the scientific sense. Two equally competent historians examining identical sources will—and should—produce different interpretations reflecting their distinct perspectives, questions, and analytical frameworks. This isn't a bug; it's how humanistic knowledge works. Imposing reproducibility standards might privilege the computational components of mixed methods projects while devaluing the interpretive work that gives those computations meaning.

The debate reveals deeper tensions about what digital history aspires to be. Some practitioners see computational methods as finally putting historical claims on firmer empirical footing—not quite scientific, but more rigorous than impressionistic reading. Others view computation as simply another interpretive tool, no more requiring reproducibility than close reading requires other scholars to have identical reactions to texts.

A middle position is emerging around the concept of "transparency" rather than "reproducibility." Mixed methods historians might document their procedures thoroughly—sharing code, explaining coding decisions, identifying sample texts—without claiming that following those procedures would necessarily produce identical interpretations. The goal shifts from replication to evaluation: giving readers enough information to assess the quality of interpretive choices, even if they might choose differently.

For contemporary historians, these debates carry special urgency. Digital sources often can't be shared due to copyright, privacy concerns, or platform terms of service. Computational analyses of Twitter archives, government databases, or corporate records may be impossible to replicate because the data itself is legally or ethically inaccessible. Transparency about methods may be the only available substitute for true reproducibility.

Takeaway
The tension between scientific reproducibility and humanistic interpretation isn't a problem to solve but a productive friction that forces digital historians to articulate what kind of knowledge they're actually producing.

Mixed methods in contemporary history represent neither a rejection of traditional interpretation nor an embrace of computational scientism. They constitute a genuinely new mode of historical practice—one still working out its standards, its training requirements, and its relationship to older disciplinary norms.

The historians who will shape this emerging practice are those who resist easy positions: neither computational evangelists dismissing close reading as obsolete, nor traditionalists treating digital methods as threats to humanistic values. The most productive work happens in the friction between approaches, where each method's limitations become visible through contrast with the other.

What remains constant is the historian's fundamental task: making meaningful arguments about the past that illuminate something worth understanding. The tools are changing. The interpretive responsibility isn't.