A Causal Framework for Distribution Generalization

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Standard

A Causal Framework for Distribution Generalization. / Christiansen, Rune; Pfister, Niklas Andreas; Emil Jakobsen, Martin; Gnecco, Nicola; Peters, Jonas Martin.

I: I E E E Transactions on Pattern Analysis and Machine Intelligence, Bind 44, Nr. 10, 2022, s. 6614-6630.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Harvard

Christiansen, R, Pfister, NA, Emil Jakobsen, M, Gnecco, N & Peters, JM 2022, 'A Causal Framework for Distribution Generalization', I E E E Transactions on Pattern Analysis and Machine Intelligence, bind 44, nr. 10, s. 6614-6630. https://doi.org/10.1109/TPAMI.2021.3094760

APA

Christiansen, R., Pfister, N. A., Emil Jakobsen, M., Gnecco, N., & Peters, J. M. (2022). A Causal Framework for Distribution Generalization. I E E E Transactions on Pattern Analysis and Machine Intelligence, 44(10), 6614-6630. https://doi.org/10.1109/TPAMI.2021.3094760

Vancouver

Christiansen R, Pfister NA, Emil Jakobsen M, Gnecco N, Peters JM. A Causal Framework for Distribution Generalization. I E E E Transactions on Pattern Analysis and Machine Intelligence. 2022;44(10):6614-6630. https://doi.org/10.1109/TPAMI.2021.3094760

Author

Christiansen, Rune ; Pfister, Niklas Andreas ; Emil Jakobsen, Martin ; Gnecco, Nicola ; Peters, Jonas Martin. / A Causal Framework for Distribution Generalization. I: I E E E Transactions on Pattern Analysis and Machine Intelligence. 2022 ; Bind 44, Nr. 10. s. 6614-6630.

Bibtex

@article{c63ec7e38cfb465ba928c1f8bf8546f2,
title = "A Causal Framework for Distribution Generalization",
abstract = "We consider the problem of predicting a response Y from a set of covariates X when test- and training distributions differ. Since such differences may have causal explanations, we consider test distributions that emerge from interventions in a structural causal model, and focus on minimizing the worst-case risk. Causal regression models, which regress the response on its direct causes, remain unchanged under arbitrary interventions on the covariates, but they are not always optimal in the above sense. For example, for linear models and bounded interventions, alternative solutions have been shown to be minimax prediction optimal. We introduce the formal framework of distribution generalization that allows us to analyze the above problem in partially observed nonlinear models for both direct interventions on X and interventions that occur indirectly via exogenous variables A . It takes into account that, in practice, minimax solutions need to be identified from data. Our framework allows us to characterize under which class of interventions the causal function is minimax optimal. We prove sufficient conditions for distribution generalization and present corresponding impossibility results. We propose a practical method, NILE, that achieves distribution generalization in a nonlinear IV setting with linear extrapolation. We prove consistency and present empirical results.",
author = "Rune Christiansen and Pfister, {Niklas Andreas} and {Emil Jakobsen}, Martin and Nicola Gnecco and Peters, {Jonas Martin}",
year = "2022",
doi = "10.1109/TPAMI.2021.3094760",
language = "English",
volume = "44",
pages = "6614--6630",
journal = "IEEE Transactions on Pattern Analysis and Machine Intelligence",
issn = "0162-8828",
publisher = "Institute of Electrical and Electronics Engineers",
number = "10",

}

RIS

TY - JOUR

T1 - A Causal Framework for Distribution Generalization

AU - Christiansen, Rune

AU - Pfister, Niklas Andreas

AU - Emil Jakobsen, Martin

AU - Gnecco, Nicola

AU - Peters, Jonas Martin

PY - 2022

Y1 - 2022

N2 - We consider the problem of predicting a response Y from a set of covariates X when test- and training distributions differ. Since such differences may have causal explanations, we consider test distributions that emerge from interventions in a structural causal model, and focus on minimizing the worst-case risk. Causal regression models, which regress the response on its direct causes, remain unchanged under arbitrary interventions on the covariates, but they are not always optimal in the above sense. For example, for linear models and bounded interventions, alternative solutions have been shown to be minimax prediction optimal. We introduce the formal framework of distribution generalization that allows us to analyze the above problem in partially observed nonlinear models for both direct interventions on X and interventions that occur indirectly via exogenous variables A . It takes into account that, in practice, minimax solutions need to be identified from data. Our framework allows us to characterize under which class of interventions the causal function is minimax optimal. We prove sufficient conditions for distribution generalization and present corresponding impossibility results. We propose a practical method, NILE, that achieves distribution generalization in a nonlinear IV setting with linear extrapolation. We prove consistency and present empirical results.

AB - We consider the problem of predicting a response Y from a set of covariates X when test- and training distributions differ. Since such differences may have causal explanations, we consider test distributions that emerge from interventions in a structural causal model, and focus on minimizing the worst-case risk. Causal regression models, which regress the response on its direct causes, remain unchanged under arbitrary interventions on the covariates, but they are not always optimal in the above sense. For example, for linear models and bounded interventions, alternative solutions have been shown to be minimax prediction optimal. We introduce the formal framework of distribution generalization that allows us to analyze the above problem in partially observed nonlinear models for both direct interventions on X and interventions that occur indirectly via exogenous variables A . It takes into account that, in practice, minimax solutions need to be identified from data. Our framework allows us to characterize under which class of interventions the causal function is minimax optimal. We prove sufficient conditions for distribution generalization and present corresponding impossibility results. We propose a practical method, NILE, that achieves distribution generalization in a nonlinear IV setting with linear extrapolation. We prove consistency and present empirical results.

U2 - 10.1109/TPAMI.2021.3094760

DO - 10.1109/TPAMI.2021.3094760

M3 - Journal article

C2 - 34232865

VL - 44

SP - 6614

EP - 6630

JO - IEEE Transactions on Pattern Analysis and Machine Intelligence

JF - IEEE Transactions on Pattern Analysis and Machine Intelligence

SN - 0162-8828

IS - 10

ER -

ID: 249022081