Skip to main content

Search NYU Steinhardt

Locally Dependent Mixed Membership Estimation for High-dimensional Categorical Data

Yuqi Gu
10 am - 11 am ET
Oct
09
Wed
2nd floor conference room (Room 211)
246 Greene Street, New York, NY 10003

A PRIISM Seminar by Columbia's Yuqi Gu

RSVP for Zoom link

In person RSVP

Join PRIISM and Yuqi Gu to learn about an alternative to Latent Class Analysis, the Grade of Membership model and its estimation with multivariate categorical data.

Abstract

Mixed membership models are popular individual-level mixture models widely used in various fields including network analysis, topic modeling, and multivariate categorical data analysis. This work focuses on mixed membership models for multivariate categorical data, which are also called Grade of Membership (GoM) models. GoM models drastically increase the modeling flexibility of latent class models by allowing each individual to partially belong to multiple extreme latent profiles. However, such flexibility also comes with challenging identifiability and estimation issues, especially for high-dimensional polytomous (categorical with more than two categories) data. Such data take the form of a three-way tensor, with N subjects responding to J items each with C categories. Existing estimation methods based on maximum likelihood or Bayesian MCMC inference are not computationally efficient and lack high-dimensional theoretical guarantees. We propose an SVD-based spectral method for high-dimensional polytomous Models with potential local dependence. We innovatively flatten the three-way tensor into a “fat” matrix and exploit the singular subspace geometry based on the matrix SVD for estimation. We establish fine-grained finite-sample entrywise error bounds for all the parameters. Moreover, we develop a novel two-to-infinity singular subspace perturbation theory under arbitrary local dependent noise, which is of independent interest. Simulations and applications to real-world data in genetics, political science, and single-cell sequencing demonstrate the merit of the proposed method.

Bio

Yuqi Gu is an Assistant Professor in the Department of Statistics at Columbia University. She is also a member of the Data Science Institute. Before joining Columbia in 2021, she spent a year as a postdoc at Duke University, mentored by David B. Dunson. In 2020, she received a Ph.D. in Statistics from the University of Michigan, advised by Gongjun Xu. In 2015, she received a B.S. in Mathematics from Tsinghua University.

NYU provides reasonable accommodations to people with disabilities. Please submit your request for accommodations for events and services at least two weeks before the date of your accommodation need. Although we can't guarantee accommodation requests received less than two weeks before the event, you should still contact us and we will do our best to meet your accommodation need. Please email Susana Toro for assistance.

Related Programs

Applied Statistics for Social Science Research

Learn advanced quantitative research techniques and apply them to critical policy issues across social, behavioral, and health sciences.

Read More

Related Departments

Applied Statistics, Social Science, and Humanities

Want to investigate urgent social trends? Explore our degree offerings in education studies, international education, applied statistics, policy, and sociology.

Read More