Exploiting affinities between topic modeling and the sociological perspective on culture: Application to newspaper coverage of U.S. government arts funding

  title={Exploiting affinities between topic modeling and the sociological perspective on culture: Application to newspaper coverage of U.S. government arts funding},
  author={Paul DiMaggio and Manish Nag and David M. Blei},

Figures and Tables from this paper

Structural Topic Modeling For Social Scientists: A Brief Case Study with Social Movement Studies Literature, 2005–2017

Sociologists frequently make use of language as data in their research using methodologies including open-ended surveys, in-depth interviews, and content analyses. Unfortunately, the ability of

Comparative Discourse Analysis Using Topic Models: Contrasting Perspectives on China from Reddit

A comparative analysis of the linguistic features that differentiate two China-focused discussion communities with contrasting perspectives from Reddit and describes the rhetorical techniques and discursive frames implied by these features and how they are utilized by each community in discussions surrounding the Hong Kong protests during 2019.

TOPIC MODELING FOR ANALYSIS OF PUBLIC DISCOURSE -Enriching topic modeling with linguistic information to analyze Swedish housing policies

This work investigates how the method of topic modeling can be applied to investigate the public discourse of Swedish housing policies. The data used to represent this discourse is both from the

Combining CDA and topic modeling: Analyzing discursive connections between Islamophobia and anti-feminism on an online forum

In this article we present an analysis of the discursive connections between Islamophobia and anti-feminism on a large Internet forum. We argue that the incipient shift from traditional media toward

Quantitative analysis of large amounts of journalistic texts using topic modelling

A case study of the New York Times coverage of nuclear technology from 1945 to the present shows that LDA is a useful tool for analysing trends and patterns in news content in large digital news archives relatively quickly.

Topic models do not model topics: epistemological remarks and steps towards best practices

  • A. Shadrova
  • Computer Science
    J. Data Min. Digit. Humanit.
  • 2021
It is concluded that topic modeling in its present state of methodological integration does not meet the requirements of an independent research method.

Key Topics in environmental sociology, 1990–2014: results from a computational text analysis

ABSTRACT Environmental sociology is a growing field producing a diverse body of literature while also moving into the mainstream of the larger discipline. The twin goals of this paper are to

Towards Topic Modeling Swedish Housing Policies: Using Linguistically Informed Topic Modeling to Explore Public Discourse

This work investigates what effect linguistically informed preprocessing has on topic modeling, and exemplifies how topic modeling can be used to explore public discourse the area of Swedish housing policies is chosen, as represented by documents from the Swedish parliament and Swedish newstexts.

Agenda-Setting in Cross-National Coverage of COVID-19: An Analysis of Elite Newspapers in US and China with Topic Modeling

  • Kan Wu
  • Sociology
    Online Journal of Communication and Media Technologies
  • 2021
This study examines agenda-setting in US-China elite newspapers coverage of COVID-19 through topic modeling. It attempts to contribute to studies of media agenda first by demonstrating the relevance



How to Analyze Political Attention with Minimal Assumptions and Costs

Previous methods of analyzing the substance of political attention have had to make several restrictive assumptions or been prohibitively costly when applied to large-scale political texts. Here, we

A Bayesian Hierarchical Topic Model for Political Texts: Measuring Expressed Agendas in Senate Press Releases

A statistical model is introduced that attends to the structure of political rhetoric when measuring expressed priorities: statements are naturally organized by author to simultaneously estimate the topics in the texts, as well as the attention political actors allocate to the estimated topics.

Automatic Annotation of Semantic Fields for Political Science Research

Three types of automatic annotation for automatic annotation of political texts for semantic fields are presented: unsupervised clustering, dictionary-based approaches, and a method based on relevant experimental data applied to analyzing Margaret Thatcher's political rhetoric.

Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts

Politics and political conflict often occur in the written and spoken word. Scholars have long recognized this, but the massive costs of analyzing even moderately sized collections of texts have

Topic Modeling for the Social Sciences

Recent work aimed at solving barriers to adoption encountered during a collaboration between the Stanford NLP group and social scientists in the school of education is introduced including the Stanford Topic Modeling Toolbox software.

Finding scientific topics

  • T. GriffithsM. Steyvers
  • Computer Science
    Proceedings of the National Academy of Sciences of the United States of America
  • 2004
A generative model for documents is described, introduced by Blei, Ng, and Jordan, and a Markov chain Monte Carlo algorithm is presented for inference in this model, which is used to analyze abstracts from PNAS by using Bayesian model selection to establish the number of topics.

A Language-based Approach to Measuring Scholarly Impact

This work proposes using changes in the thematic content of documents over time to measure the importance of individual documents within the collection, and describes a dynamic topic model for both quantifying and qualifying the impact of these documents.

Is anyone responsible? How television frames political issues.

A disturbingly cautionary tale, "Is Anyone Responsible?" anchors with powerful evidence suspicions about the way in which television has impoverished political discourse in the United States and at

Measuring Explicit Political Positions of Media

We amass a new, large-scale dataset of newspaper editorials that allows us to calculate fine-grained measures of the political positions of newspaper editorial pages. Collecting and classifying over

Probabilistic Topic Models

  • D. Blei
  • Computer Science
    IEEE Signal Processing Magazine
  • 2010
In this article, we review probabilistic topic models: graphical models that can be used to summarize a large collection of documents with a smaller number of distributions over words. Those