¸ŁŔűĽ§

20 May 2026

Sociologists survey the landscape of computational text analysis, showing how methods ranging from topic modeling and word embeddings to large language models can serve as rigorous tools for both thick description and causal inference in the social sciences. 

A table that has some words on it.

Large corpora of text serve as social sensors, capturing what people discuss at a scale and richness that neither qualitative nor survey research designs can match—and that are suited for studying social dynamics and collective phenomena as they unfold in interactive social contexts. Sociological research, the authors argue, is transitioning from using text analysis as an end in itself to treating it as a means of advancing social theory

Treating text-analytic methods as measurement strategies that extract sociologically relevant information from unstructured language data, the authors organize approaches into data-first, theory-first, and theory–data integration paradigms, showing how each balances inductive discovery with theoretical specification. Data-first approaches emphasize openness to discovery, surfacing latent features that are then interpreted through a theoretical lens. Theory-first approaches require researchers to specify in advance which linguistic features or categories to measure, lending themselves to deductive theory testing. Theory–data integration approaches occupy the middle ground, combining theory-driven modeling with openness to unexpected patterns. While the rise of large language models has democratized text analysis, enabling researchers to deploy the same foundation models across all three paradigms, the authors caution that challenges around reproducibility, bias, and prompt sensitivity require researchers to remain anchored in theoretically grounded measurement.

The article demonstrates how computational text analysis expands sociological inquiry along two dimensions. First, these methods enable thick description at scale by combining pattern recognition across large corpora with interpretive close reading, and complementing approaches such as grounded theory, abductive analysis, and forensic social science. Second, the article examines the growing incorporation of text data into causal inference, reviewing strategies for using text as outcome, treatment, or confounder, while highlighting persistent methodological challenges stemming from the high dimensionality of text and the iterative nature of text modeling.

Computational text analysis is, in the authors' view, not merely a technical innovation but a substantive expansion of what sociology can measure, model, and interpret. By bridging quantitative scale with qualitative depth, computational text analysis has the potential to revitalize interpretive approaches in sociology and open new avenues for studying phenomena that have long resisted systematic empirical inquiry.


Read or download the article

Hurtado Bodell, M., Keuschnigg, M., Macanovic, A., and Menshikova, A. (2026). Computational Text Analysis for Building and Testing Social Theory. Kölner Zeitschrift für Soziologie und Sozialpsychologie.

Contact

More about Computational Text Analysis

Latest news from LiU

A view of a space shuttle flying over the audience in the dome.

Anyone can land on the moon with a new immersive film

The film Once Upon the Moon allows the audience to experience the moon landings as if they were there themselves. Authentic footage, astronauts’ own stories and the latest visualisation technology make this possible.

En kopp som stĂĄr pĂĄ ett bord.

LEAD appointed as Swedish accelerator for NATO DIANA

LEAD, an innovation incubator based in Östergötland, has been appointed Swedish accelerator for NATO DIANA – NATO’s innovation programme for the development of new technologies with both civilian and military applications.

A woman standing in front of a laptop computer.

LiU educates elected representatives on AI and societal impact

LiU is launching a learning platform that gathers research-based knowledge about the impact of artificial intelligence and digitalisation on our democracy. The aim is to equip politicians for a new reality where AI is an integral part of society.