THE COMMON ALTERNATIVE APPROACH FOR THE EFFICIENT DEEP LEARNING NEURAL NETWORKS

  • N.S. Krivsha Southern Federal University
  • V.V. Krivsha Southern Federal University
  • S.A. Butenkov Impuls Pyatigorsk Enterprise Company
Keywords: Artificial neuron nets, deep learning, information granulation theory, space granules, highperformance computations, computational intelligence

Abstract

This paper proposes a new approach to the organization of computational structures of layers
and inter-layer connections in the construction of artificial neural networks for solving a wide
range of problems of multidimensional data processing. The main problem of building deep learning
networks is the necessity of introducing a large number of network training parameters. Available
working instances of such networks contain billions of parameters, which allows to achieve
high efficiency of such networks. The downside of such a widely used structure of networks in the
form of multilayer sieve structures is the high cost of training networks with a large number of
structurally similar convolution layers by the back-propagation method. A solution to the problem
of increasing the efficiency of such multilayer structures can be found in the use of hybrid layers
realizing data granularity operations, which were developed in our work. The new hybrid models
use matrix information elements instead of vector values of training parameters, which allow encoding
subsets of data values (information granules) instead of encoding individual data points as
in classical convolutional networks. The proposed hybrid layers are trained without a teacher and
allow parallel implementation of learning algorithms, which is fundamentally different from sequential
backpropagation algorithms as a result, the computational efficiency of similar hybrid
neural networks can be significantly increased. The theoretical approach to modeling and optimizing
the structures of deep learning networks proposed in this paper can be extended to a wide
range of computational intelligence problems.

References

1. Terekhov V.F., Efimov D.V., Tyukin I.Yu. Neyrosetevye sistemy upravleniya [Control systems
with neurocomputers]. Moscow: IPRZhR, 2002, 480 p.
2. Gorban' A.N., Dunin-Barkovskiy V.L., Kirdin A.N. i dr. Neyroinformatika [NeuroInformatics].
Novosibirsk: Nauka. Sibirskoe predpriyatie RAN, 1998, 296 p.
3. Deng L., Yu D. Deep learning: methods and applications, Foundations and Trends in Signal
Processing, 2014, 7, pp. 197-387.
4. Mosavi A., Lopez A., Varkonyi-Koczy A. Industrial Applications of Big Data: State of the Art
Survey, Advances in Intelligent Systems and Computing, 2017.
5. Butenkov S.A. Komponenty gibridnykh neyrosetevykh intellektual'nykh sistem,
ispol'zuyushchie metod informatsionnoy granulyatsii [The hybrid neuro systems components,
based on information granulation], Sb. trudov XII Vserossiyskoy nauchno-tekhnicheskoy
konferentsii “Neyroinformatika–2010”, Moskva, 25-29 yanvarya 2010 [Collection of proceedings
of the XII All-Russian Scientific and Technical Conference “Neuroinformatics-2010”,
Moscow, January 25-29, 2010], Vol. 2, pp. 54-64.
6. Butenkov S.A. Granulyatsiya i inkapsulyatsiya v sistemakh effektivnoy obrabotki mnogomernoy
informatsii [Data granulation an encapsulation in efficient multidimensional data processing
systems], Iskusstvennyy intellect [Artificial Intelligence], 2005, No. 4, pp. 106-115.
7. Butenkov S.A., Zhukov A.L. Informatsionnaya granulyatsiya na osnove izomorfizma
algebraicheskikh sistem [Information Granulation based on the Algebraic Systems Isomorphism],
Sb. trudov Mezhdunarodnoy algebraicheskoy konferentsii, posvyashchennoy 80-letiyuso dnya rozhdeniya A.I. Kostrikina, Nal'chik, 12-18 iyulya 2009 g. [Collection of proceedings
of the International Algebraic Conference dedicated to the 80th anniversary of the birth of
A.I. Kostrikina, Nalchik, July 12-18, 2009], pp. 206-209.
8. Butenkov S., Krivsha V., Al-Dhouyani S. Granular Computing in Computer Image Perception:
basic issues and Glass Box models, In Proc. IASTED Conf. “AIA 2006”, Innsbruk, Austria,
February 16-18 2006, pp. 811-816.
9. Butenkov S.A., Krivsha V.V., Krivsha N.S. Recognition and Perception of Images. Fundamentals
and Applications. Chap. 7. The use of Mathematical Apparatus of Spatial Granulation in
the Problems of Perception and Image Recognition, 2021, John Wiley & Sons, Inc., 111, River
Street, Hoboken, NJ07030, USA, pp. 220-257. ISBN 9871119750550.
10. Weng J., Ahuja N. and Huang T.S. Cresceptron: a self-organizing neural network which grows
adaptively, Proc. International Joint Conference on Neural Networks, Baltimore, Maryland,
June, 1992, Vol. I, pp. 576-581.
11. Zadeh L.A. Toward a theory of fuzzy information granulation and its centrality in human reasoning
and fuzzy logic, Fuzzy Sets and Systems, 1997, Vol. 90, pp. 111-127.
12. Butenkov S.A. Algebraicheskie modeli v zadachakh intellektual'nogo analiza mnogomernykh
dannykh [The algebraic models for the intelligent data analysis], Sb. trudov mezhdunarodnoy
konferentsii “Matematicheskaya teoriya sistem 2009”, Moskva, 26-30 yanvarya 2009 [Collection
of proceedings of the international conference “Mathematical theory of systems 2009”,
Moscow, January 26-30, 2009], pp. 93-101.
13. Butenkov S.A., Krivsha V.V., Krivsha N.S. Topologicheskie prostranstvennye otnosheniya v
modelyakh granulirovaniya mnogomernykh dannykh [Topological spatial relations for the
granulated multidimensional data models], Izvestiya YuFU. Tekhnicheskie nauki [Izvestiya
SFedU. Engineering Sciences], 2010, No. 6, pp. 200-211.
14. Kohonen T. Self-Organizing Maps. Berlin–New York: Springer-Verlag. Third extended edition,
2001.
15. Butenkov S.A. Matrichnye neyronnye seti dlya zadach obrabotki i klassifikatsii mnogomernykh
dannykh [Matrix Neuro Nets in the Multidimensional Data Processing and Classification Problems],
Mater. Vos'moy Vserossiyskoy mul'tikonferentsii po problemam upravleniya “MKPU–
2015”, Gelendzhik, 28 sentyabrya –3 oktyabrya 2015 g. [Materials of the Eighth All-Russian
Multi-Conference on Control Problems “MKPU-2015”, Gelendzhik, September 28 – October
3, 2015]. Rostov-on-Don: Izd-vo YuFU, 2015, Vol. 1, pp. 34-39.
16. Butenkov S., Zhukov A., Nagorov A., Krivsha N. Granular Computing Models and Methods
Based on the Spatial Granulation, XII Int. Symposium «Intelligent Systems», INTELS’16, 5-7
October 2016, Moscow, Russia. Elsevier Procedia Computer Science, 2017, 103, pp. 295-302.
17. Yarushkina N.G. Osnovy teorii nechetkikh i gibridnykh system [The fundamentals of fuzzy
and hybrid systems]. Moscow: Finansy i statistika, 2004, 320 p.
18. Krivsha N., Krivsha V., Butenkov S. The Analytical Approach to the Parameterized Fuzzy
Operators Design, 13th International Symposium Intelligent Systems - 2018 (INTELS'18),
Elsevier Procedia Computer Science, 2019, St. Petersburg, Russia, pp. 193-200.
19. Butenkov S.A., Krivsha V.V., Krivsha N.S. Modelirovanie struktur vychisleniya [Modeling
computation structures], Informatizatsiya i svyaz' [Informatization and communication], 2021,
No. 3, pp. 28-32.
20. Butenkov S.A. Strukturnaya organizatsiya granulirovannykh vychisleniy pri obrabotke dannykh
na rekonfiguriruemykh vychislitel'nykh sistemakh [The structural organization of the granulated
computing for the data processing by the reconfigurable computers], Izvestiya YuFU.
Tekhnicheskie nauki [Izvestiya SFedU. Engineering Sciences], 2018, No. 8 (202), pp. 250-262.
Published
2024-01-05
Section
SECTION I. INFORMATION PROCESSING ALGORITHMS