CONSTRUCTION OF AN OPTIMAL CONTROL TRAJECTORY IN AN INTELLIGENT SYSTEM IN THE ABSENCE OF OBSERVABLE VARIABLES
Abstract
Constructing optimal control in the complete absence of data on the system dynamics is a pressing problem. In this paper, we propose a solution to a finite-horizon linear quadratic problem (LCP) for a time-invariant system with a graph dynamics matrix. Unlike the control problem, stability and complete controllability of the system are not assumed. The construction of the control trajectory is controlled by the direction of increase in the change in the state of variables over a small number of steps, which is determined by the conditional principal eigenvector of the adjacency matrix of the graph model. The solution of classical optimal control is carried out in an autonomous mode and requires complete knowledge of the system dynamics. In the absence of complete knowledge of the system dynamics, solving optimal control problems for systems with uncertainty, including discrete linear systems, has attracted considerable interest in recent years. The main approach when complete information about the system is unavailable is the design of optimal control, in which the system parameters are initially determined, and then an algebraic equation in the dual space is solved. An important difference from the standard discrete control problem is that the control model was modified to estimate changes in the state of variables under controls transmitted through the dynamics matrix. The proposed algorithm using a graph matrix implements recurrent calculations of dynamic and adjoint equations, as well as the Powell method for solving a system of linear algebraic equations (SLAE). The authors introduced a new interpretation of the mathematical construction of the system dynamics matrix in a standard discrete control problem on a finite time interval, which can be used to design any controlled dynamic system with unobservable parameters.
References
1. de Aguiar M.A.M., Bar-Yam Y. Spectral analysis and the dynamic response of complex networks, Phys. Rev. E, 2005, 71, 016106. Available at: https://doi.org/10.1103/PhysRevE.71.016106.
2. Butterworth J., Dunne P.E. Spectral Techniques in Argumentation Framework Analysis, Computational Models of Argument, 2016, pp. 167-178. Available at: https://doi.org/10.3233/978-1-61499-686-6-167.
3. Gadiyaram V., Ghosh S., Vishveshwara S. A graph spectral-based scoring scheme for network compar-ison, J. Complex Networks, cnw016, 2016. Available at: https://doi.org/10.1093/comnet/cnw016.
4. Pei S., Wang J., Morone F., Makse H.A. Influencer identification in dynamical complex systems,
J. Complex Networks, 2019. Available at: https://doi.org/10.1093/comnet/cnz029.
5. Ritter F.E., Shadbolt N.R., Elliman D., Young R.M., Gobet F., Baxter G.D. Techniques for Modeling Human Performance in Synthetic Environments: A Supplementary Review, 2003. Available at: https://doi.org/10.21236/ADA487721.
6. Gray W.D. ed. Integrated Models of Cognitive Systems. Oxford University Press, 2007. Available at: https://doi.org/10.1093/acprof:oso/9780195189193.001.0001.
7. Langley P., Laird J.E., Rogers S. Cognitive architectures: Research issues and challenges, Cogn. Syst. Res., 2009, 10, pp. 141-160. Available at: https://doi.org/10.1016/j.cogsys.2006.07.004.
8. Kotseruba I., Tsotsos J.K. 40 years of cognitive architectures: core cognitive abilities and practical appli-cations, Artif. Intell. Rev., 2020, 53, pp. 17-94. Available at: https://doi.org/10.1007/s10462-018-9646-y.
9. Sgaier S.K., Huang V., Charles G. The Case for Causal AI, Stanford Soc. Innov. Rev., 2020, 18 (3), pp. 50-55. Available at: https://doi.org/10.48558/KT81-SN73.
10. Razavi S.E., Moradi M.A., Shamaghdari S., Menhaj M.B. Adaptive optimal control of unknown dis-crete-time linear systems with guaranteed prescribed degree of stability using reinforcement learning, Int. J. Dyn. Control, 2022, 10, pp. 870-878. Available at: https://doi.org/10.1007/s40435-021-00836-x.
11. Moulton R.H., Rudie K. Online control of discrete-event systems: A survey, Annu. Rev. Control, 2022, 54, pp. 24-48. Available at: https://doi.org/10.1016/j.arcontrol.2022.08.002.
12. Katsuhiko O. Discrete-time control systems. Prentice-Hall, Inc., USA, 1995.
13. Tselykh A., Vasilev V., Tselykh L. A Method for Modeling the Control Impact Strategy Based on the Mental Frame of References of the Decision-Maker. Presented at the 2023. Available at: https://doi.org/10.1007/978-3-031-43789-2_29.
14. Pontryagin L.S. Mathematical Theory of Optimal Processes. Routledge, 2018. Available at: https://doi.org/10.1201/9780203749319.
15. Macki J., Strauss A. Necessary Conditions for Optimal Controls — The Pontryagin Maximum Principle. Presented at the 1982. Available at: https://doi.org/10.1007/978-1-4612-5671-7_5.
16. Bertsekas D.P. Constrained Optimization and Lagrange Multiplier Methods. Athena Scientifi, Belmont, MA, 1996.
17. Tselykh A., Vasilev V., Tselykh L., Ferreira F.A.F. Influence control method on directed weighted signed graphs with deterministic causality, Ann. Oper. Res., 2022, 311, pp. 1281-1305. Available at: https://doi.org/10.1007/s10479-020-03587-8.
18. Tselykh A., Vasilev V., Tselykh L. Effect of Resonance in the Effective Control Model Based on the Spread of Influence on Directed Weighted Signed Graphs, Advances in Intelligent Systems and Comput-ting, 2020, pp. 270-280. Available at: https://doi.org/10.1007/978-3-030-50097-9_28.
19. Powell M.J.D. An efficient method for finding the minimum of a function of several variables without calculating derivatives, Comput. J., 1964, 7, pp. 155-162. Available at: https://doi.org/10.1093/ comjnl/7.2.155.
20. Carvalho J.P. On the semantics and the use of fuzzy cognitive maps and dynamic cognitive maps in social sciences, Fuzzy Sets Syst., 2013, 214, pp. 6-19. Available at: https://doi.org/10.1016/ j.fss.2011.12.009








