LARGE LANGUAGE MODELS APPLICATION IN ORGANIZATION OF REPLENISHMENT OF THE KNOWLEDGE BASE ON METHODS OF INFORMATION PROCESSING IN SYSTEMS OF APPLIED PHOTOGRAMMETRY
Abstract
The article deals with the issues related to the automation of the procedure of synthesis of applied
photogrammetry systems. Such systems serve to measure and account for objects from images and are
now widely utilized in various fields of activity, such as mapping, archaeology and aerial photography.
Increasing availability and mobility of imaging devices has also contributed to the widespread application.
All this has led to active research aimed at developing methods and algorithms for applied photogrammetry
systems. Manual tracking of new methods and algorithms of photogrammetric information
processing for a wide range of application areas is quite difficult, which makes the automation of this
procedure urgent. The solution proposed in the article is based on the use of a knowledge base of information
processing methods in applied photogrammetry systems, the main elements of which are a fuzzy
ontology of the subject area and a database, which is logical, since the information about the subject area
can be structured quite easily. As a basis for the ontology, an existing solution was taken, which was supplemented
based on the results of analyzing the current state of the subject area. The resulting ontology
was further used to search and classify information processing methods in applied photogrammetry systems
and to populate the knowledge base. Due to the intensification of the development of new methods of information processing in the systems of applied photogrammetry, there is a need to modify the ontology
and to replenish the database, i.e. to replenish the knowledge base. The Internet is an important source of
information for this purpose. To automate the search for data on information processing methods and
ontology modification, it is reasonable to use large language models. To automate data mining of information
processing methods and to populate the knowledge base, it is useful to use large language models
that simplify several natural language processing tasks, which include clustering and formation of new
entities for classification. The corresponding method is described in the paper. For the method the results
of testing its performance are given. As part of problem solving, a comparative analysis of large language
models has been carried out, resulting in the RoBERTa model.
References
fotogrammetriya [Modern digital photogrammetry], Mezhdunarodnyy zhurnal prikladnykh nauk i
tekhnologiy «Integral» [International Journal of Applied Science and Technology "Integral"], 2019,
No. 4 (2), pp. 41-47.
2. Krasnopevtsev B.V. Fotogrammetriya [Photogrammetry]. Moscow: UPP "Reprografiya" MIIGAiK,
2008, 160 p.
3. Altukhov V.G. Issledovaniye tochnosti fotogrammetrii kak metoda opredeleniya ob"ema ob"yekta [Study of
the accuracy of photogrammetry as a method for determining the volume of an object], Avtomatika i
programmnaya inzheneriya [Automation and Software Engineering], 2020, No. 2 (32), pp. 69-74.
4. Bezmenov V.M. Primeneniye metodov fotogrammetrii v voprosakh tochnosti obespecheniya
prostranstvennogo i uglovogo polozheniya snimkov distantsionnogo zondirovaniya Zemli [Application
of photogrammetry methods in matters of accuracy of ensuring the spatial and angular position of
Earth remote sensing images], Sovremennyye problemy distantsionnogo zondirovaniya Zemli iz
kosmosa [Sovremennyye problemy distantsionnogo zondirovaniya Zemli iz kosmosa], 2020,
No. 17(5), pp. 45-52.
5. Mikhailova M.V., Akhmedov A.N., Shagibalov R.R. Fotogrammetriya. Osnovnye printsipy i
prakticheskoye primeneniye. Priborostroyenie, metrologiya i informatsionno-izmeritel'nye pribory, i
sistemy [Photogrammetry. basic principles and practical application. Instrumentation, metrology and
information-measuring devices, and systems], Yestestvennyye i tekhnicheskiye nauki [Natural and
Technical Sciences], 2018, No. 5.
6. Grushin S.P., Sosnovsky I.A. Fotogrammetriya v arkheologii - metodika i perspektivy [Photogrammetry
in archeology - methods and prospects], Teoriya i praktika arkheologicheskikh issledovaniy [Theory
and practice of archaeological research], 2018, No. 1 (21).
7. Katermin V.S. Fotogrammetriya: 3D-model' iz fotografiy [Photogrammetry: 3D model from photographs],
Aktual'nyye nauchnyye issledovaniya v sovremennom mire [Current scientific research in the
modern world], 2021, No. 12-11(80), pp. 89-94.
8. Chukanov A.N., Tsoi E.V., Yakovenko A.A., Maliy D.V., Goncharov S.S. Fotogrammetriya v fiksatsii i
analize lokalizovannoy deformatsii 3d obraztsov [Photogrammetry in fixation and analysis of localized
deformation of 3d samples], Sovremennyye problemy i napravleniya razvitiya metallovedeniya i
termicheskoy obrabotki metallov i splavov [Modern problems and directions in the development of
metal science and heat treatment metals and alloys], 2023, pp. 168-173.
9. Anufriev V.N. Sozdaniye trekhmernoy modeli ob"yektov metodami fotogrammetrii [Creating a threedimensional
model of objects using photogrammetry methods: abstract to the thesis], 2022.
10. Sergeev N.E., Samoilov A.N., Polovko I.Yu. Ontologicheskoe predstavlenie fotogrammetricheskikh
metodov dlya resheniya zadach opredeleniya geometricheskikh parametrov ob"ektov po predvaritel'no
obrabotannym tsifrovym izobrazheniyam [Ontological representation of photogrammetric methods for
solving problems of determining the geometric parameters of objects from pre-processed digital images],
Vestnik AGU [Bulletin of ASU], 2020, No. 3(266), pp. 34-39.
11. Bengio Y., Courville A., Vincent P. Glubokoye obucheniye [Deep learning transl. from Engl.
A.A. Slinkina.]. Moscow: DMK Press, 2018, 482 p.
12. Rogers A., Hastie M. Obrabotka yestestvennogo yazyka s pomoshch'yu TensorFlow: sozdaniye prilozheniy
dlya mashinnogo obucheniya [Natural language processing with TensorFlow: creating applications for machine
learning lane: transl. from Engl. Logunova A.V.]. St. Petersburg: Peter, 2019, 352 p.
13. Vasiliev I.G., Larionov S.A. Modeli glubokogo obucheniya dlya obrabotki yestestvennogo yazyka:
ucheb. posobiye [Deep learning models for natural language processing: textbook]. Moscow: RUDN,
2020, 119 p.
14. Devlin J., Chang M.-W., Lee K., Toutanova K. BERT: Pre-training of Deep Bidirectional Transformers
for Language Understanding, Proceedings of the 2019 Conference of the North American Chapter of
the Association for Computational Linguistics: Human Language Technologies, 2019, pp. 4171-4186.
15. Liu Y., Ott M., Goyal N., Du J., Joshi M., Chen D., Levy O., Lewis M., Zettlemoyer L., Stoyanov V.
RoBERTa: A Robustly Optimized BERT Pretraining Approach, arXiv preprint arXiv:1907.11692,
2019.
16. Radford A., Wu J., Child R., Luan D., Amodei D., Sutskever I. Language Models are Unsupervised
Multitask Learners, OpenAI Blog, 2019, Vol. 1, No. 8.
17. Reimers N., Gurevych I. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks,
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the
9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 2019, pp.
3980-3990.
18. Conneau A., Kiela D., Schwenk H., Barrault L., Bordes A. Supervised Learning of Universal Sentence
Representations from Natural Language Inference Data, Proceedings of the 2017 Conference on Empirical
Methods in Natural Language Processing, 2017, pp. 670-680.
19. Pennington J., Socher R., Manning C.D. GloVe: Global Vectors for Word Representation, Proceedings
of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2014,
pp. 1532-1543.
20. Mikolov T., Chen K., Corrado G., Dean J. Efficient Estimation of Word Representations in Vector
Space, arXiv preprint arXiv:1301.3781, 2013.