METHODS FOR MINING CAUSUSITY FROM OBSERVATIONS IN ARTIFICIAL INTELLIGENCE

  • М.Y. Georgi Southern Federal University
Keywords: Causal AI, AI fairness, few-shot fine-tuning, counterfactual reasoning

Abstract

The article discusses the importance of capturing causal relationships in machine learning
for decision-making and evaluating real-world impact. It is noted that most current successes in
machine learning are based on pattern recognition and correlation analysis; however, for more
complex tasks, extracting causal relationships is necessary. The problems of explainability of predictions
and causal understanding, even with the use of advanced machine learning techniques
such as LIME, SHAP, TreeSHAP, DeepSHAP, and Shapley Flow, are recognized as fundamental
obstacles in the development of artificial intelligence. The article briefly presents the main philosophical
and mathematical concepts and definitions of causality, including counterfactuals, Bayesian
networks, directed acyclic graphs, and causal formal inference. It concludes that the practical
significance of data-based causal analysis consists in answering a priori formulated questions,
which may reflect a hypothetical relationship between an event (a cause) and a second event (an
effect), where the second event is a direct consequence of the first. A comparative analysis of the
methods and main scenarios for using the Causal Discovery and Causal Inference frameworks is
also carried out. Based on this analysis, it becomes possible to make assumptions about the causal
structure underlying the investigated dataset and to use statistical methods to evaluate the strength
and direction of such relationships. The article also discusses methods and algorithms of causal
analysis and their application in real-world tasks. Representative methods are mentioned, such as
constraint-based models, estimation-based models, functional causal models, (conditional) independence
tests, evaluation functions, and other tools that can be used to solve the problem of extracting
causal relationships from observational data. Most of these methods are implemented in
open-source frameworks such as Microsoft DoWhy, Uber CausalML, causal-learn, Econ-ml, and
many others, which facilitate causal analysis.

References

1. Hassani H., Xu H., Mansi G. Big data and causality, Annals of Data Science, 2018, 5, pp. 133-156.
2. Pearl J., Mackenzie D. The book of why. Penguin Books, 2019.
3. Schölkopf B., Locatello F., Bauer S., Ke N.R., Kalchbrenner N., Goyal A., and Bengio Y. Toward
causal representation learning, Proceedings of the IEEE, 2021, 109 (5), pp. 612-634.
4. Naser M. Causality, causal discovery, and causal inference in structural engineering, arXiv
preprint arXiv:2204.01543, 2022.
5. Hernán M., Robins J. Estimating causal effects from epidemiological data, Journal of Epidemiology
and Community Health, 2006 Jul., 60 (7), pp. 578-586.
6. Lesher J.H.H. Hume's Analysis of "Cause" and the 'Two-Definitions' Dispute, Journal of the
History of Philosophy, 1973, 11 (3), pp. 387-392.
7. Lewis D. Causation, The journal of philosophy, 1973, 70 (17), pp. 556-567.
8. Hofer-Szabó G., Rédei M., Szabó L.E. On Reichenbach's common cause principle and
Reichenbach's notion of common cause, The British Journal for the Philosophy of Science,
1999, 50 (3), pp. 377-399.
9. Pearl J. An Introduction to Causal Inference, The International Journal of Biostatistics, 2010, 6 (2).
10. Glymour M., Pearl J., Jewell N.P. Causal inference in statistics: A primer. John Wiley & Sons.
2016.
11. Bareinboim E., Pearl J. Causal inference and the data-fusion problem, Proceedings of the
National Academy of Sciences, 2016, 113 (27), pp. 7345-7352.
12. Pellet J.P. Effective Causal Analysis: Methods for Structure Learning and Explanations, 2010.
(Doctoral dissertation, ETH Zurich).
13. Triantafillou S., Tsamardinos I. Constraint-based causal discovery from multiple interventions
over overlapping variable sets, The Journal of Machine Learning Research, 2015, 16 (1),
pp. 2147-2205.
14. Glymour C., Zhang K., Spirtes P. Review of causal discovery methods based on graphical
models, Frontiers in genetics, 2019, 10, pp. 524.
15. Nogueira A.R, Pugnana A., Ruggieri S., Pedreschi D., Gama J. Methods and tools for causal
discovery and causal inference, Wiley interdisciplinary reviews: data mining and knowledge
discovery, 2022, 12 (2), e1449.
16. Yao L., Chu Z., Li S., Li Y., Gao J., Zhang A. A survey on causal inference, ACM Transactions
on Knowledge Discovery from Data (TKDD), 2021, 15 (5), pp. 1-46.
17. Rubin D.B. Estimating causal effects of treatments in randomized and nonrandomized studies,
Journal of educational Psychology, 1974, 66 (5), pp. 688.
18. Imbens G.W., Rubin D.B. Causal inference in statistics, social, and biomedical sciences. Cambridge
University Press, 2015.
19. hernozhukov V., Wüthrich K., Zhu Y. An exact and robust conformal inference method for
counterfactual and synthetic controls, Journal of the American Statistical Association, 2021,
116 (536), pp. 1849-1864.
20. Pearl J. Causal diagrams for empirical research, Biometrika, 1995, 82 (4), pp. 669-688.
21. Pearl Judea. The do-calculus revisited, arXiv preprint arXiv:1210.4852, 2012.
22. Vonk M., Malekovic N., Bäck T., Kononova A. Disentangling causality: assumptions in causal
discovery and inference, Artificial Intelligence Review, 2023, pp. 1-37.
23. Teinemaa I., Albert J., Goldenberg D. Uplift modeling: from causal inference to personalization,
In Companion Proceedings of the Web Conference, 2021.
Published
2023-08-14
Section
SECTION II. INFORMATION PROCESSING ALGORITHMS