Scalable Time Series Causal Discovery with Approximate Causal Ordering

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 14 Oct 2025Embargo end date: 01 Jan 2024 English Publisher:MDPI AGJournal:Mathematics, volume 13, page 3,288 (eissn: 2227-7390,

Copyright policy )Funded by:UKRI | Centre for Spatial Comput..., UKRI | Event-based parallel comp..., UKRI | DART: Design Accelerators... +1 projects

Authors: Ziyang Jiao; Ce Guo; Wayne Luk;

doi: 10.3390/math13203288 , 10.48550/arxiv.2409.05500

arXiv: 2409.05500

Scalable Time Series Causal Discovery with Approximate Causal Ordering

- Summary
- Subjects
- Metrics

Abstract

Causal discovery in time series data presents a significant computational challenge. Standard algorithms are often prohibitively expensive for datasets with many variables or samples. This study introduces and validates a heuristic approximation of the VarLiNGAM algorithm to address this scalability problem. The standard VarLiNGAM method relies on an iterative refinement procedure for causal ordering that is computationally expensive. Our heuristic modifies this procedure by omitting the iterative refinement. This change permits a one-time precomputation of all necessary statistical values. The algorithmic modification reduces the time complexity of VarLiNGAM from O(m3n) to O(m2n+m3) while keeping the space complexity at O(m2), where m is the number of variables and n is the number of samples. While an approximation, our approach retains VarLiNGAM’s essential structure and empirical reliability. On large-scale financial data with up to 400 variables, our algorithm achieves up to a 13.36× speedup over the standard implementation and an approximate 4.5× speedup over a GPU-accelerated version. Evaluations across medical time series analysis, IT service monitoring, and finance demonstrate the heuristic’s robustness and practical scalability. This work offers a validated balance between computational efficiency and discovery quality, making large-scale causal analysis feasible on personal computers.

Related Organizations

Imperial College London
United Kingdom
Department of Computing, Imperial College London
United Kingdom

Keywords

Machine Learning, Performance (cs.PF), FOS: Computer and information sciences, Performance, Computation, Distributed, Parallel, and Cluster Computing, Distributed, Parallel, and Cluster Computing (cs.DC), Computation (stat.CO), Machine Learning (cs.LG)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

1

Average

Green

gold

Funded by

UKRI| Centre for Spatial Computational Learning, UKRI| Event-based parallel computing - partially ordered event-triggered systems (POETS), UKRI| DART: Design Accelerators by Regulating Transformations, UKRI| SONNETS: Scalability Oriented Novel Network of Event Triggered Systems