Switching Regression Models and Causal Inference in the Presence of Discrete Latent Variables

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Standard

Switching Regression Models and Causal Inference in the Presence of Discrete Latent Variables. / Christiansen, Rune; Peters, Jonas.

I: Journal of Machine Learning Research, Bind 21, (41), 2020, s. 1-46.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Harvard

Christiansen, R & Peters, J 2020, 'Switching Regression Models and Causal Inference in the Presence of Discrete Latent Variables', Journal of Machine Learning Research, bind 21, (41), s. 1-46.

APA

Christiansen, R., & Peters, J. (2020). Switching Regression Models and Causal Inference in the Presence of Discrete Latent Variables. Journal of Machine Learning Research, 21, 1-46. [(41)].

Vancouver

Christiansen R, Peters J. Switching Regression Models and Causal Inference in the Presence of Discrete Latent Variables. Journal of Machine Learning Research. 2020;21:1-46. (41).

Author

Christiansen, Rune ; Peters, Jonas. / Switching Regression Models and Causal Inference in the Presence of Discrete Latent Variables. I: Journal of Machine Learning Research. 2020 ; Bind 21. s. 1-46.

Bibtex

@article{e9b480decabd4aacbb70088c0a1b6d51,
title = "Switching Regression Models and Causal Inference in the Presence of Discrete Latent Variables",
abstract = "Given a response Y and a vector X = (X-1, ...,X-d) of d predictors, we investigate the problem of inferring direct causes of Y among the vector X. Models for Y that use all of its causal covariates as predictors enjoy the property of being invariant across different environments or interventional settings. Given data from such environments, this property has been exploited for causal discovery. Here, we extend this inference principle to situations in which some (discrete-valued) direct causes of Y are unobserved. Such cases naturally give rise to switching regression models. We provide sufficient conditions for the existence, consistency and asymptotic normality of the MLE in linear switching regression models with Gaussian noise, and construct a test for the equality of such models. These results allow us to prove that the proposed causal discovery method obtains asymptotic false discovery control under mild conditions. We provide an algorithm, make available code, and test our method on simulated data. It is robust against model violations and outperforms state-of-the-art approaches. We further apply our method to a real data set, where we show that it does not only output causal predictors, but also a process-based clustering of data points, which could be of additional interest to practitioners.",
keywords = "causal discovery, invariance, switching regression models, hidden Markov models, latent variables",
author = "Rune Christiansen and Jonas Peters",
year = "2020",
language = "English",
volume = "21",
pages = "1--46",
journal = "Journal of Machine Learning Research",
issn = "1533-7928",
publisher = "MIT Press",

}

RIS

TY - JOUR

T1 - Switching Regression Models and Causal Inference in the Presence of Discrete Latent Variables

AU - Christiansen, Rune

AU - Peters, Jonas

PY - 2020

Y1 - 2020

N2 - Given a response Y and a vector X = (X-1, ...,X-d) of d predictors, we investigate the problem of inferring direct causes of Y among the vector X. Models for Y that use all of its causal covariates as predictors enjoy the property of being invariant across different environments or interventional settings. Given data from such environments, this property has been exploited for causal discovery. Here, we extend this inference principle to situations in which some (discrete-valued) direct causes of Y are unobserved. Such cases naturally give rise to switching regression models. We provide sufficient conditions for the existence, consistency and asymptotic normality of the MLE in linear switching regression models with Gaussian noise, and construct a test for the equality of such models. These results allow us to prove that the proposed causal discovery method obtains asymptotic false discovery control under mild conditions. We provide an algorithm, make available code, and test our method on simulated data. It is robust against model violations and outperforms state-of-the-art approaches. We further apply our method to a real data set, where we show that it does not only output causal predictors, but also a process-based clustering of data points, which could be of additional interest to practitioners.

AB - Given a response Y and a vector X = (X-1, ...,X-d) of d predictors, we investigate the problem of inferring direct causes of Y among the vector X. Models for Y that use all of its causal covariates as predictors enjoy the property of being invariant across different environments or interventional settings. Given data from such environments, this property has been exploited for causal discovery. Here, we extend this inference principle to situations in which some (discrete-valued) direct causes of Y are unobserved. Such cases naturally give rise to switching regression models. We provide sufficient conditions for the existence, consistency and asymptotic normality of the MLE in linear switching regression models with Gaussian noise, and construct a test for the equality of such models. These results allow us to prove that the proposed causal discovery method obtains asymptotic false discovery control under mild conditions. We provide an algorithm, make available code, and test our method on simulated data. It is robust against model violations and outperforms state-of-the-art approaches. We further apply our method to a real data set, where we show that it does not only output causal predictors, but also a process-based clustering of data points, which could be of additional interest to practitioners.

KW - causal discovery

KW - invariance

KW - switching regression models

KW - hidden Markov models

KW - latent variables

M3 - Journal article

VL - 21

SP - 1

EP - 46

JO - Journal of Machine Learning Research

JF - Journal of Machine Learning Research

SN - 1533-7928

M1 - (41)

ER -

ID: 243008269