NON-STATISTICAL METHODS OF AUTOMATIC EXTRACTION OF CAUSAL RELATIONSHIPS FROM THE TEXT

  • H.B. Shtanchaev Dagestan State Technical University
Keywords: Causal relationships, causal knowledge, natural language processing, ambiguity, implicit causal relationships, computational linguistics

Abstract

Most of the first attempts to extraction of causal relationship were tied with complex and manual
linguistic patterns, syntactic rules and small datasets based on domain. This article examines the paradigm
of a non-statistical approach to the extraction of causal relationships, its basis, language constructs,
patterns, and classification of causal relationships. The aim was to study the methods of this
paradigm, to determine their disadvantages, advantages, and the possibility of their application.The article discusses various approaches given by the authors of well-known and highly cited research
papers and their impact on the success of the extraction of causal relationships. The analysis of these
scientific papers has unequivocally confirmed that the task of extracting CR is an extremely difficult task
of natural language processing. The presence of a variety of linguistic constructions of the language,
ambiguities of various kinds, as well as language features greatly affect the accuracy of CR extraction.
Almost all non-statistical methods have encountered the problem of highly specialized fields of
knowledge, where expert description is almost always required. Also, almost all non-statistical methods
are manual or semi-automatic, because assume the construction of templates for determining the CR in
the text. Even though non-static methods with sufficient accuracy (on average 70-80%) successfully
cope with the task under consideration, there is currently no universal method for extracting CR.
The proposed method should be universal with respect to languages, universal with respect to subject
areas and with the possibility of defining implicit CR.

References

1. Khoo C.S., Kornfilt J., Oddy R.N., Myaeng S.H. Automatic extraction of cause-effect information
from newspaper text without knowledge-based inferencing, Literary and Linguistic
Computing, 1998. Vol. 13, No. 4, pp. 177-186.
2. Altenberg B. Causal linking in spoken and written English, Studia linguistic, 1984. Vol. 38,
No. 1, pp. 20-69.
3. Selfridge M. Toward a natural language-based causal model acquisition system // Applied
Artificial Intelligence an International Journal. 1989. Vol. 3, №2-3. P. 191–212.
4. Kaplan R. M., Berry-Rogghe G. Knowledge-based acquisition of causal relationships in text //
Knowledge Acquisition, 1991, Vol. 3, No. 3, pp. 317-337.
5. Doyle R.J. Hypothesizing and refining causal models, DTIC Document, Tech. Rep., 1984.
6. Grishman R. Domain modeling for language analysis, Linguistic approaches to artificial intelligence,
1990, pp. 41-58.
7. Kontos J., Sidiropoulou M. On the acquisition of causal knowledge from scientific texts with
attribute grammars, International Journal of Applied Expert Systems, 1991, Vol. 4, No. 1,
pp. 31-48.
8. Garcia D. Coatis, an NLP system to locate expressions of actions connected by causality links,
in Knowledge Acquisition, Modeling and Management, Springer, 1997, pp. 347-352.
9. Khoo C.S., Chan S., Niu Y. Extracting causal knowledge from a medical database using graphical
patterns, Proceedings of the 38th Annual Meeting on Association for Computational Linguistics.
Association for Computational Linguistics, 2000, pp. 336-343.
10. Khoo C.S., Chan S., Niu Y., Ang A. A method for extracting causal knowledge from textual
databases, Singapore journal of library & information management, 1999, Vol. 28, pp. 48-63.
11. Khoo C.S. Automatic identification of causal relations in text and their use for improving precision
in information retrieval // Ph.D. dissertation. 1995.
12. Finn R. Program uncovers hidden connections in the literature, The Scientist, 1998, Vol. 12,
No. 10, pp. 12-13.
13. Girju R. and Moldovan D. Text mining for causal relations, FLAIRS Conference, 2002,
pp. 360-364.
14. Chan K. and Lam W. Extracting causation knowledge from natural language texts, International
Journal of Intelligent Systems, 2005, Vol. 20, No. 3, pp. 327-358.
15. Low B.-T., Chan K., Choi L.-L., Chin M.-Y., Lay S.-L. Semantic expectation-based causation
knowledge extraction: A study on Hongkong stock movement analysis, Advances in
Knowledge Discovery and Data Mining. Springer, 2001, pp. 114-123.
16. Chan K., Low B.-T., Lam W., Lam K.-P. Extracting causation knowledge from natural language
texts. Springer, 2002.
17. Ittoo A., Bouma G. Extracting explicit and implicit causal relations from sparse, domain-specific
texts, Natural Language Processing and Information Systems. Springer, 2011, pp. 52-63.
18. Ittoo A., Bouma G. Minimally supervised learning of domain specific causal relations using an
open-domain corpus as knowledge base, Data & Knowledge Engineering, 2013, Vol. 88,
pp. 142-163.
19. Bethard S., Corvey W.J., Klingenstein S., Martin J.H. Building a corpus of temporal-causal
structure, LREC, 2008.
20. Al Hashimy H., Saleh A., Kulathuramaiyer N. Ontology enrichment with causation relations,
Systems, Process & Control (ICSPC) 2013 IEEE Conference on. IEEE, 2013, pp. 186-192.
21. Sadek J. Automatic detection of arabic causal relations, Natural Language Processing and
Information Systems. Springer, 2013, pp. 400-403.
22. Ackerman E.J.M. Extracting a causal network of news topics, On the Move to Meaningful
Internet Systems: OTM 2012 Workshops. Springer, 2012, pp. 33-42.
23. Ishii H., Ma Q., Yoshikawa M. Causal network construction to support understanding of news, in
System Sciences (HICSS), 2010 43rd Hawaii International Conference on. IEEE, 2010, pp 1-10.
24. Incremental construction of causal network from news articles Information and Media Technologies,
2012, Vol. 7, No. 1, pp. 110-118.
25. Arauz P.L. and Faber P. Causality in the specialized domain of the environment, Semantic Relations-
II. Enhancing Resources and Applications Workshop Programme, Citeseer, 2012, pp. 10.
Published
2023-06-07
Section
SECTION III. INFORMATION PROCESSING ALGORITHMS