Detecting and quantifying causal associations in large nonlinear time series datasets

Jakob Runge, Peer Nowack, Marlene Kretschmer, Seth Flaxman, Dino Sejdinovic

Research output: Contribution to journalArticlepeer-review

436 Citations (Scopus)
37 Downloads (Pure)

Abstract

Identifying causal relationships and quantifying their strength from observational time series data are key problems in disciplines dealing with complex dynamical systems such as the Earth system or the human body. Data-driven causal inference in such systems is challenging since datasets are often high dimensional and nonlinear with limited sample sizes. Here, we introduce a novel method that flexibly combines linear or nonlinear conditional independence tests with a causal discovery algorithm to estimate causal networks from large-scale time series datasets. We validate the method on time series of well-understood physical mechanisms in the climate system and the human heart and using large-scale synthetic datasets mimicking the typical properties of real-world data. The experiments demonstrate that our method outperforms state-of-the-art techniques in detection power, which opens up entirely new possibilities to discover and quantify causal networks from time series across a range of research fields.

Original languageEnglish
Article numbereaau4996
Number of pages16
JournalScience Advances
Volume5
Issue number11
DOIs
Publication statusPublished - 27 Nov 2019

Cite this