Methodological Advances in Causal Inference: Experimentation, Identification and Estimation

Akbari, Sina

doi:10.5075/epfl-thesis-10886

doctoral thesis

Methodological Advances in Causal Inference: Experimentation, Identification and Estimation

2025

Causal inference provides a powerful framework for reasoning and decision-making. However, much of its machinery hinges on assumptions that might fail in real-world applications---such as parallel trends, full observability, and known causal structure. This thesis aims to develop new causal methodologies in order to extend the boundaries of what is possible when those assumptions are violated, with contributions across identification theory, semiparametric estimation, algorithmic experiment design, and structure learning.

We begin by developing methods for causal inference in settings with panel and repeated cross-sectional data. Building on the difference-in-differences (DiD) framework, we formalize the identification strategy for the triple difference framework and introduce a class of robust and efficient semiparametric estimators compatible with machine learning-based nuisance function estimators. We then generalize the classical changes-in-changes model to accommodate the triple difference setting, enabling identification of potential outcome distributions, even in settings with high-dimensional outcome variables.

Next, we turn to the challenge of designing experiments for identifying a causal estimand of interest. The existing identification theory answers the question of whether or not the causal query is identifiable using the data at hand. When an effect is not identifiable with the available data, rather than stopping there, the natural next question becomes: What additional data or interventions would make it identifiable? We study the problem of designing the optimal (minimum-cost) interventions to make identification feasible. In parallel, we introduce a new framework for causal effect identification under uncertain causal graphs---such as those learned from data with varying confidence over edges---offering a principled way to reason about identifiability when structure is not known with certainty.

Finally, we address causal discovery in settings with unobserved confounding, selection bias, and nonlinear dependencies. First, we propose L-MARVEL, a recursive, constraint-based discovery algorithm that is both sound and complete and, achieves the tightest known bounds on the number of required conditional independence tests. Then, we present a new transport-based discovery method using monotone triangular maps, which allows causal structures to be inferred from observational data without relying on strong functional form assumptions.

Type

doctoral thesis

DOI

10.5075/epfl-thesis-10886

Author(s)

Akbari, Sina

EPFL

Advisors

Kiyavash, Negar

Jury

Prof. Alexandre Massoud Alahi (président) ; Prof. Negar Kiyavash (directeur de thèse) ; Prof. Mats Stensrud, Prof. Robin Evans, Prof. Qingyuan Zhao (rapporteurs)

Date Issued

2025

Publisher

EPFL

Publisher place

Lausanne

Public defense year

2025-10-10

Thesis number

10886

Total of pages

310

Subjects

causal inference

•

triple difference

•

identification

•

estimation

•

panel data

•

experiment design

•

optimal transport

•

causal discovery

•

latent confounding

EPFL units

Faculty

School

Doctoral School

Available on Infoscience

October 6, 2025

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/254693