Robust Statistical Methods for Noisy Complex Network Data
Presented by Wenrui Li, University of Pennsylvania
Thursday, February 1 2024
3:30 PM-4:30 PM ET
AUST 105
Webex Meeting Link
Coffee will be served at 3:00 pm in the Noether Lounge (AUST 326)
In recent years there has been an explosion of network data from seemingly all corners of science. Such data present unique analytical challenges. In particular, it is widely recognized by practitioners that there is measurement error associated with most network data including, but not limited to, social networks and biological networks and pathways (e.g., gene regulatory pathways and metabolic networks). By ‘measurement error' we mean true edges being observed as non-edges, and vice versa. However, there has been little attention given, at the level of statistical theory and methods, to the problem of noisy networks. In this talk, I will introduce our recent efforts to account for network noise in two settings: Bayesian model for high-dimensional data with noisy network information and causal inference on noisy networks.
In the first project, we propose a graph-guided Bayesian modeling framework to account for network noise in regression models involving structured high-dimensional predictors. Our approach uses two sources of network information, namely, the noisy graph extracted from existing databases and the graph estimated for predictors in the data at hand, to inform the latent true graph. We demonstrate the advantages of our method over existing methods in simulations, and through analyses of -omics datasets for Alzheimer’s disease.
In the second project, we quantify biases and variances of standard estimators of average causal effects in noisy networks and develop a general framework for estimation of true average causal effects. We employ method-of-moments techniques to derive estimators and establish their asymptotic unbiasedness, consistency, and normality. Simulations in the context of social contact networks in British secondary schools suggest that substantial inferential accuracy by our estimators is possible in networks of even modest size when nontrivial noise is present.
Speaker Bio:
Wenrui Li, PhD is a postdoctoral researcher at the University of Pennsylvania. She received her PhD in statistics from Boston University. Her research interests include high-dimensional data analysis, statistics for network data, causal inference under interference, and statistical methods for infectious disease transmission and surveillance.