Switching Regression Models and Causal Inference in the Presence of Discrete Latent Variables

Research output: Contribution to journalJournal articlepeer-review

Documents

  • Rune Christiansen
  • Jonas Peters
Given a response Y and a vector X = (X-1, ...,X-d) of d predictors, we investigate the problem of inferring direct causes of Y among the vector X. Models for Y that use all of its causal covariates as predictors enjoy the property of being invariant across different environments or interventional settings. Given data from such environments, this property has been exploited for causal discovery. Here, we extend this inference principle to situations in which some (discrete-valued) direct causes of Y are unobserved. Such cases naturally give rise to switching regression models. We provide sufficient conditions for the existence, consistency and asymptotic normality of the MLE in linear switching regression models with Gaussian noise, and construct a test for the equality of such models. These results allow us to prove that the proposed causal discovery method obtains asymptotic false discovery control under mild conditions. We provide an algorithm, make available code, and test our method on simulated data. It is robust against model violations and outperforms state-of-the-art approaches. We further apply our method to a real data set, where we show that it does not only output causal predictors, but also a process-based clustering of data points, which could be of additional interest to practitioners.
Original languageEnglish
Article number(41)
JournalJournal of Machine Learning Research
Volume21
Pages (from-to)1-46
ISSN1532-4435
Publication statusPublished - 2020

    Research areas

  • causal discovery, invariance, switching regression models, hidden Markov models, latent variables

Number of downloads are based on statistics from Google Scholar and www.ku.dk


No data available

ID: 243008269