Illustration of factual vs counterfactual.

Causal ML

Last updated on Fri, May 9, 2025

Slides

Illustration of factual vs counterfactual.

Causal ML

Last updated on Fri, May 9, 2025

Slides

Causality provides a formal language for analyzing and understanding often subtle problems in machine learning, particularly it can formalize reasonable notions of distribution shift. At its core, causality is the combination of probability and the notion of intervention. Distribution shifts can be viewed as a type of unknown intervention. This project seeks to explore how causality can inspire and help to analyze core ML robustness problems.

One of the recent directions is using domain counterfactuals, counterfactuals between two domains that answer: “What would this sample have been like if it had been observed in the other domain or environment?” Our work has shown applications of domain counterfactuals for distribution shift explanations, counterfactual fairness, and out-of-distribution robustness. We have also worked on estimating counterfactuals given only data from the domains by leveraging a sparsity of intervention hypothesis.

Causality

David I. Inouye

Assistant Professor

I research trustworthy ML methods that are robust to imperfect distributional and computational assumptions using explainability, causality, and collaborative learning.

Publications

(New!) From Invariant Representations to Invariant Data: Provable Robustness to Spurious Correlations via Noisy Counterfactual Matching

Spurious correlations can cause model performance to degrade in new environments. Prior causality-inspired work aim to learn invariant …

Ruqi Bai, Yao Ji, Zeyu Zhou, David I. Inouye. Preprint, 2025.

Preprint Project Project

Counterfactual Fairness by Combining Factual and Counterfactual Predictions

In high-stake domains such as healthcare and hiring, the role of machine learning (ML) in decision-making raises significant fairness …

Zeyu Zhou, Tianci Liu, Ruqi Bai, Jing Gao, Murat Kocaoglu, David I. Inouye. Neural Information Processing Systems (NeurIPS), 2024.

Preprint Project Code

Towards Characterizing Domain Counterfactuals For Invertible Latent Causal Models

Answering counterfactual queries has important applications such as explainability, robustness, and fairness but is challenging when …

Zeyu Zhou, Ruqi Bai, Sean Kulinski, Murat Kocaoglu, David I. Inouye. International Conference on Learning Representations (ICLR), 2024.

Preprint PDF Project Code

Towards Explaining Distribution Shifts

A distribution shift can have fundamental consequences such as signaling a change in the operating environment or significantly …

Sean M. Kulinski, David I. Inouye. International Conference on Machine Learning (ICML), 2023.

Preprint PDF Project Project Code