+7 (495) 987 43 74 ext. 3304
Join us -              
Рус   |   Eng

Authors

Korepanova Veronika S.

Degree
Cand. Sci. (Eng.), Associate Professor, Digital Economy Department, Synergy University; Leading Engineer, LLC LUKOIL-Engineering
E-mail
vskorepanova5@gmail.com
Location
Moscow, Russia
Articles

The method of preprocessing machine learning data for solving computer vision problems

In the field of machine learning, there is no single methodology for data preprocessing, since all stages of this process are unique for a specific task. However, a specific data type is used in each direction. The research hypothesis assumes that it is possible to clearly structure the sequences and phases of data preparation for text recognition tasks. The article discusses the basic principles of data preprocessing and the allocation of successive stages as a specific technique for the task of recognizing ABC characters. ETL set images were selected as the source data. Preprocessing included the stages of working with images, at each of which changes were made to the source data. The first step was cropping, which allowed to get rid of unnecessary information in the image. Next, the approach of converting the image to the original aspect ratio was considered and the method of converting from shades of gray to black and white format was determined. At the next stage, the character lines were artificially expanded for better recognition of printed alphabets. At the last stage of data preprocessing, augmentation was performed, which made it possible to better recognize ABC characters regardless of their position in space. As a result, the general structure of the data preprocessing methodology for text recognition tasks was built. Read more...

Modification of the convolutional neural network architecture for determining the category of a land plot from satellite images

Correct classification of land plots by their types, for example, such as forest, agricultural, urbanized, water bodies, and others, is relevant for remote sensing of the Earth and the development of geoinformation technologies. The accuracy and reliability of the results of such categorization are of paramount importance for the efficient use of natural resources, rational land use, and environmental monitoring. The article presents an approach to solving the problem of categorizing land plots based on satellite images by applying a modified standard model of a convolutional neural network. The main attention is paid to the modification of the network architecture in order to improve the accuracy of land plot classification. The authors propose an approach to training and optimizing the network in order to solve this problem. The stages of data preparation are discussed in detail, including preprocessing satellite images, annotating them, and creating high-quality training samples. The presented approaches to network training and optimization include the use of modern regularization techniques, adaptive learning methods, and class balancing strategies, which allows efficient processing of both large amounts of data and more limited sets of specific information. To test the approach’s operability and obtain the values of quality indicators, experiments were conducted to train and test the model on various sets of satellite image data. The results of the experiment suggest that the accuracy of categorization achieved on the basis of the created model meets the requirements of the Federal Service for State Registration, Cadastre and Cartography for studying remote Territories for the suitability of land for their rational use, and the proposed method can be used to solve practical problems. Read more...