TRANSFORMATION AND ANALYSIS OF INFORMATION WHEN CREATING A DATABASE OF PARTICIPANTS OF THE GREAT PATRIOTIC WAR 1941-1945 IN THE MEMORIAL COMPLEX «ROAD OF MEMORY» IN THE MAIN RUSSIAN ARMED FORCES CATHEDRAL ON THE BASIS OF COMPUTER METHODS OF INFORMATION PR
Abstract
Preserving the historical memory of the participants of the Great Patriotic War
1941–1945 is a world-class task that should preserve the truth about the most terrible war and the
feat of our people. In modern conditions, attracting interest in history, traditions and finally
recognition of one's duty to the past generations requires modern methods. One of these methods
is the transformation of information, which allows you to present this information in such a way
that it can be used most effectively. At the same time, the main goal in the transformation of historical
data is to optimize their representations and formats and not change the information content.
The presented algorithms of transformation and analysis of information when creating a database
of participants of the Great Patriotic War were aimed at maximizing the preservation of historical
value and reliability of information. To achieve this goal, computer methods of information processing
for normalization and consolidation of personal data obtained from various sources are
considered. The analysis of the content of information in archival documents with the presentation
of statistical data on the number of documents (records) from various sources (archives, databases,
information resources, etc.) is carried out and the procedure for translating information
from archival documents into electronic form, which has been applied in practice, is described.
Based on the analysis of the information, diagrams of the content of personal information in archival
sources are constructed, the stages of systematization and bringing the generalized information
array records to a single format are presented, as well as the procedure for combining and
deleting duplicate records. For the possibility of using in other projects, an algorithm for consolidating
data obtained from various sources is described in detail, and its block diagram is constructed.
In addition, the applied fuzzy search algorithms are described, which made it possible to
minimize errors in records, as well as image comparison algorithms for searching for duplicates
from photographs. All of these algorithms have made it possible to bring together information
contained on various media, having different structures and geographical location. The created
information resource allows you to enormously reduce the resources needed to find the necessary
information, including access to which was limited or not at all. Further improvement of algorithms
for normalization and consolidation of information can serve as a basis for data migration
from outdated to promising systems, as well as for the formation of information resources from
existing heterogeneous archival funds.
References
gody: ukaz Prezidenta RF ot 09.05.2017 № 203 [Strategy of information society development
in the Russian Federation for 2017-2030 years: the decree of the President of the Russian Federation
dated 09.05.2017 No. 203]. Available at: http://www.kremlin.ru/acts/bank/41919 (accessed
05 April 2021).
2. O natsional'nykh tselyakh i strategicheskikh zadachakh razvitiya Rossiyskoy Federatsii na
period do 2024 goda: ukaz Prezidenta RF ot 07.05.2018 № 204 [On national goals and strategic
objectives of the development of the Russian Federation for the period up to 2024: Decree
of the President of the Russian Federation dated 07.05.2018 No. 204]. Available at:
http://www.kremlin.ru/acts/bank/43027 (accessed 02 May 2021).
3. Mikrin E.A. i dr. Informatsionnoe obespechenie sistem organizatsionnogo upravleniya
(teoreticheskie osnovy): v 3-kh ch. Ch. 3. Metody povysheniya kachestva informatsionnogo
obespecheniya sistem organizatsionnogo upravleniya [Information support of organizational
management systems (theoretical foundations): in 3 part. Part 3. Methods of improving the
quality of information support of organizational management systems] ed. by E.A. Mikrina,
V.V. Kul'by. Moscow: Fizmatlit, 2012, 528 p.
4. Pravila organizatsii, khraneniya, komplektovaniya, ucheta i ispol'zovaniya dokumentov
Arkhivnogo fonda Rossiyskoy Federatsii i drugikh arkhivnykh dokumentov v
gosudarstvennykh i munitsipal'nykh arkhivakh, muzeyakh i bibliotekakh, organizatsiyakh
Rossiyskoy akademii nauk [Rules of organization, storage, acquisition, accounting and use of
documents of the Archival Fund of the Russian Federation and other archival documents in
state and municipal archives, museums and libraries, organizations of the Russian Academy of
Sciences]. Moscow, 2007.
5. Rekomendatsii po komplektovaniyu, uchetu i organizatsii i khraneniyu elektronnykh
arkhivnykh dokumentov v gosudarstvennykh i munitsipal'nykh arkhivakh [Recommendations
on the acquisition, accounting and organization and storage of electronic archival documents
in state and municipal archives]. Moscow, 2014.
6. Pravila izdaniya istoricheskikh dokumentov v SSSR. Glavarkhiv SSSR, VNIIDAD, AN SSSR, In-t
istorii SSSR, Arkheogr. Komis. IML pri TSK KPSS, Gos. kom. SSSR po pechati, MGIAI [Rules
for publishing historical documents in the USSR. Glavarchiv of the USSR, VNIIDAD, Academy of
Sciences of the USSR, Institute of History of the USSR, Archeogr. Comis. IML at the Central
Committee of the CPSU, State. com. USSR Press, MGIA]. 2nd ed. Moscow, 1990, 187 p.
7. Veber Kh., Derr M. Otsifrovka kak metod obespecheniya sokhraneniya [Digitization as a
method of ensuring preservation], Nauchnye i tekhnicheskie biblioteki [Scientific and technical
libraries], 1998, No. 10.
8. Veretekhina S.V. Otsifrovka arkhivnykh dokumentov v formate PDF/A [Digitization of archival
documents in PDF format/A], Innovatsii v nauke [Innovations in Science], 2016, No. 2 (51).
Available at: https://cyberleninka.ru/article/n/otsifrovka-arhivnyh-dokumentov-v-formate-pdfa
(accessed 21 March 2021).
9. Gref G. Informatsionnye tekhnologii - otstoy: budushchee za ekonomikoy dannykh [Information
technology sucks: the future belongs to the data economy]. Available at:
http://2035.media/2017/10/24/gref-future/ (accessed 16 May 2021).
10. Goremykin V.P. Glavnoe upravlenie kadrov Ministerstva oborony Rossiyskoy Fede-ratsii: istoriya i
sovremennost' [The Main Personnel Directorate of the Ministry of Defense of the Russian Federation:
history and modernity], Voennaya mysl' [Military Thought], 2013, No. 6, pp. 3-10.
11. Gusenkov A.M. Intellektual'nyy poisk slozhnykh ob"ektov v massivakh bol'shikh dannykh
[Intelligent search for complex objects in big data arrays], Rossiyskiy nauchnyy elektronnyy
zhurnal [Russian Scientific Electronic Journal.], 2016, Vol. 19, No. 1, pp. 40-76.
12. Dyuk V. Data Mining - intellektual'nyy analiz dannykh [Data Mining - data mining]. Available
at: https://iteam.ru/publications/it/section_92/article_1448/ (accessed 14 June 2021).
13. Ermolenko V.V., Ermoshina A.S., Lanskaya D.V. Elektronnye dokumenty, otsifrovka i
arkheologiya znaniy [Documentos electrónicos, digitalización y arqueología del
conocimiento], EGI [Investigación natural-humanitaria], 2020, No. 3 (29). Available at:
https://cyberleninka.ru/article/n/elektronnye-dokumenty-otsifrovka-i-arheologiya-znaniy (accessed
21 Octubre 2021).
14. Kul'ba V.V., Mamikonov A.G., Pelikhov V.P. Metody povysheniya dostovernosti i sokhrannosti
informatsii v ASU: Obzor [Métodos para aumentar la confiabilidad y la seguridad de la
información en ACS: Revisión], Avtomatika i telemekhanika [Automatización y telemecánica],
1985, No. 2, pp. 5-33.
15. Lisitsin A.L. Programmnyy kompleks otsifrovki dokumentov na bumazhnoy osnove [Software
package for digitizing paper-based documents], Computational nanotechnology, 2015, No. 3.
Available at: https://cyberleninka.ru/article/n/programmnyy-kompleks-otsifrovki-dokumentovna-
bumazhnoy-osnove (accessed 21 October 2021).
16. Manning K., Ragkhavan P., Shyuttse Kh. Vvedenie v informatsionnyy poisk [Introduction to
information search]. Moscow: Vil'yams, 2015, 528 p.
17. Morozova T.Yu. Bazovye tekhnologii avtomatizatsii proizvodstva [Basic technologies of production
automation], Promyshlennye ASU i kontrolery [ndustrial automated control systems
and controllers], 2013, No. 6, pp. 3-8.
18. Pavel'eva E.A. Ispol'zovanie fazovoy informatsii v obrabotke izobrazheniy [The use of phase
information in image processing], Tr. mezhdunarodnoy konferentsii po komp'yuternoy grafike i
zreniyu GraphiCon2016 [Proceedings of the International Conference on Computer Graphics
and Vision GraphiCon2016], 2016, pp. 476-483.
19. Putin V.V. 75 let Velikoy Pobedy: obshchaya otvetstvennost' pered istoriey i budushchim:
ofitsial'nyy sayt Prezidenta Rossiyskoy Federatsii [75 years of the Great Victory: shared responsibility
to history and the future: official website of the President of the Russian Federation],
Novosti, Vystupleniya i stenogrammy. 2020 [News, Speeches and transcripts. 2020].
Available at: http//kremlin.ru/events/president/news/63527 (accessed 02 April 2021).
20. Khvorova I.E. Protsess otsifrovki dokumentov dlya sozdaniya elektronnogo arkhiva [The process
of digitizing documents to create an electronic archive], Istoriya i arkhivy [History and
archives], 2017, No. 2, pp. 22-29.
21. Ernst V. Arkhivatsiya: arkhiv kak khranilishche pamyati i ego instrumentalizatsiya pri
natsional-sotsializme [Archiving: archive as a storage of memory and its instrumentalization
under National Socialism], Novoe literaturnoe obozrenie [New Literary Review], 2005, No.
74. Available at: http://magazines.russ.ru/nlo/2005/74/ern9.html (accessed 22 April 2021).
22. Yumasheva Yu.Yu. Metodicheskie rekomendatsii po elektronnomu kopirovaniyu arkhivnykh
dokumentov i upravleniyu poluchennym informatsionnym massivom [Methodological recommendations
on electronic copying of archival documents and management of the received
information array]. Moscow: VNIIDAD. 2012, 217 p.
23. Yumasheva Yu.Yu. Nauchnoe izdanie istoricheskikh dokumentov v elektronnoy srede:
problemy istochnikovedeniya i arkheografii [Scientific publication of historical documents in
the electronic environment: problems of source studies and archeography], Istoricheskaya
informatika [Historical Informatics], 2017, No. 1 (19). Available at: https://cyberleninka.ru/ article/
n/nauchnoe-izdanie-istoricheskih-dokumentov-v-elektronnoy-srede-problemyistochnikovedeniya-
i-arheografii (accessed 21 May 2021).
24. ISO 12653-1:2007. Obrabotka izobrazheniy elektronnaya. Test-ob"ekt dlya cherno-belogo
skanirovaniya ofisnykh dokumentov [ISO 12653-1:2007. Image processing is electronic. Test
object for black-and-white scanning of office documents].
25. ISO 29861:2009. Prikladnye sistemy upravleniya dokumentami. Kontrol' kachestva
skanirovaniya tsvetnykh uchrezhdencheskikh dokumentov [ISO 29861:2009. Applied document
management systems. Quality control of scanning of color institutional documents].
26. Aggarwal C.C. Data streams: models and algorithms. Springer Science & Business Media,
2007, 353 p. DOI: 10.1007/978-0-387-47534-9.
27. Khan F.A., Ahmad A., Imran M., Alharbi M., Jan B. Efficient data access and performance
improvement model for virtual data warehouse, Sustainable cities and society, 2017, Vol. 35,
pp. 232-240. DOI: 10.1016/j.scs.2017.08.003.