• Corpus ID: 210713851

Recovering Network Structure from Aggregated Relational Data using Penalized Regression

  title={Recovering Network Structure from Aggregated Relational Data using Penalized Regression},
  author={Hossein Alidaee and Eric Auerbach and Michael P. Leung},
  journal={arXiv: Econometrics},
Social network data can be expensive to collect. Breza et al. (2017) propose aggregated relational data (ARD) as a low-cost substitute that can be used to recover the structure of a latent social network when it is generated by a specific parametric random effects model. Our main observation is that many economic network formation models produce networks that are effectively low-rank. As a consequence, network recovery from ARD is generally possible without parametric assumptions using a… 

Tables from this paper

Detecting latent communities in network formation models
It is shown that the latent communities can be exactly recovered when the expected degree of the network is of order log n or higher, where n is the number of nodes in the network.
Spectral goodness-of-fit tests for complete and partial network data
This work addresses the question of how to determine whether a parametric model, such as a stochastic block model or latent space model, fits a dataset well and will extrapolate to similar data and uses recent results in random matrix theory to derive a general goodnessof-fit test for dyadic data.
Estimating Peer Effects Using Partial Network Data
This work provides an empirical application to the study of peer effects on students academic achievement using the widely used Add Health database and shows that network data errors have a first-order downward bias on estimated peer effects.
Identification and Inference of Network Formation Games with Misclassified Links
This paper considers a network formation model when links are potentially measured with error, and derives moment conditions that characterize the identified set for the preference parameters associated with homophily and network externalities.
Reevaluating COVID-19 Mandates using Tensor Completion
We propose a new method that uses tensor completion to estimate causal effects with multivariate longitudinal data—data in which multiple outcomes are observed for each unit and time period. Our
Social Inequality and the Spread of Misinformation
It is shown that the prevalence of misinformation in society is nonmonotone in the level of inequality, which implies that policies that decrease inequality without substantially reducing it can leave society more vulnerable to misinformation.
Inequality in Social Learning
  • Economics
  • 2020
We study the spread of misinformation in a social network characterized by unequal access to learning resources. Agents use social learning combined with their own signals to uncover an unknown state
Policy Targeting under Network Interference
A strong set of guarantees on the utilitarian regret is established, i.e., the difference between the average social welfare attained by the estimated policy function and the maximum attainable welfare.
Policy design in experiments with unknown interference
An experimental design for estimation and inference on welfaremaximizing policies in the presence of spillover effects in a setting where units are organized into a finite number of large clusters and interact in unknown ways within each cluster is proposed.


Latent Surface Models for Networks Using Aggregated Relational Data
A latent surface representation of social network structure for partially observed network data is introduced and a multivariate measure of expected (latent) distance between an observed actor and unobserved actors with given features is derived.
Recovering social networks from panel data: identification, simulations and an application
The method shows how the analysis of social interactions can be usefully extended to economic realms where no network data exists and provides novel insights into the long-standing debate on the relative roles of factor mobility and yardstick competition in driving tax setting behavior across states.
Latent Space Approaches to Social Network Analysis
This work develops a class of models where the probability of a relation between actors depends on the positions of individuals in an unobserved “social space,” and proposes Markov chain Monte Carlo procedures for making inference on latent positions and the effects of observed covariates.
Nuclear norm regularized estimation of panel regression models
In this paper we investigate panel regression models with interactive fixed effects. We propose two new estimation methods that are based on minimizing convex objective functions. The fi rst method
Spectral clustering and the high-dimensional stochastic blockmodel
The asymptotic results in th is paper are the first clustering results that allow the number of clusters in the model to grow with theNumber of nodes, hence the name high-dimensional.
Matrix Completion Methods for Causal Panel Data Models
A class of estimators that uses the observed elements of the matrix of control outcomes corresponding to untreated unit/periods to predict the "missing" elements ofThe matrix, corresponding to treated units/ periods is developed, which generalizes results from the matrix completion literature by allowing the patterns of missing data to have a time series dependency structure.
Statistical Inference on Random Dot Product Graphs: a Survey
This survey paper describes a comprehensive paradigm for statistical inference on random dot product graphs, a paradigm centered on spectral embeddings of adjacency and Laplacian matrices, and investigates several real-world applications, including community detection and classification in large social networks and the determination of functional and biologically relevant network properties from an exploratory data analysis of the Drosophila connectome.
I formulate and study a model of undirected dyadic link formation which allows for assortative matching on observed agent characteristics (homophily) as well as unrestricted agent level heterogeneity
Estimating the Structure of Social Interactions Using Panel Data
I consider settings where outcomes depend on own characteristics and on the characteristics of other individuals in the sample. I propose a method to identify individuals generating spillovers and