519.7 Modeling and efficiency analysis of perceptual hash functions for segmented image search

Valishin A. A. (Bauman Moscow State Technical University), Zaprivoda A. V. (Bauman Moscow State Technical University), Tsukhlo S. S. (Bauman Moscow State Technical University)

COMPUTER VISION, PERCEPTUAL HASHING, SEMANTIC SEGMENTATION, DISCRETE COSINE TRANSFORM, RADON TRANSFORM


doi: 10.18698/2309-3684-2024-2-4667


This research paper explores the use of perceptual hash functions to improve the retrieval efficiency of aerial photography and satellite remote sensing images segmented by a convolutional neural network. This paper describes three hashing algorithms. The first algorithm is based on the use of a low-pass filter and is aimed at reducing image detail in order to highlight the most stable image features. The second algorithm uses a two-dimensional discrete cosine transform to create an image hash. The third algorithm is based on the Radon transform, which allows you to extract information about the directions of lines in the image, as well as provide maximum invariance to the rotation transformation of the input image. The article also tests these algorithms, including analysis of their invariance to transformations for rotation, scaling and shifting of the source image. Test results show that the algorithm based on the Radon transform exhibits good rotation invariance, but is sensitive to the quality of segmentation, which can lead to frequent collisions when searching for similar images. Algorithms using a two-dimensional discrete cosine transform and an algorithm using a low-pass filter turned out to be more stable and have a smaller spread of values. However, it should be noted that algorithms using a low-pass filter and 2D discrete cosine transform may not be applicable to rotated images. Based on the results of analysis and comparison of the performance of the algorithms, it is recommended to give preference to either the second or third algorithm, because each of them has its own advantages and disadvantages, and the decision to use a specific algorithm in the task of finding the most similar image must take into account the specific conditions and limitations of the problem, as well as the requirements for the quality of image comparison.


[1] Dostavalova A.M. Simulation of locally homogeneous radar images using different statistical criteria. Маthematical Modeling and Coтputational Methods, 2021, no. 4, pp. 103–120.
[2] Dostavalova A.M. Application of a finite mixture model in radar image processing. Маthematical Modeling and Coтputational Methods, 2020, no. 3, pp. 117–130.
[3] Hadmi A., Puech W., Es Said B.A., Ait A. Perceptual Image Hashing. Watermarking — Volume 2. InTech, 2012. DOI: 10.5772/37435
[4] Salomon D. Szhatie dannyh, izobrazhenij i zvuka [Compression of data, images and sound]. Moscow, Technosphere Publ., 2004, 368 p.
[5] Marchevsky I.K., Puzikova V.V. Performance analysis of iterative methods of combined linear algebraic equations solution. Маthematical Modeling and Coтputational Methods, 2014, no. 4, pp. 37–52.
[6] Loeffler C., Ligtenberg A., Moschytz G.S. Practical fast 1-D DCT algorithms with 11 multiplications. International Conference on Acoustics, Speech, and Signal Processing, 1989, vol. 2, pp. 988–991. DOI: 10.1109/ICASSP.1989.266596
[7] Nguyen D.Q., Weng L., Preneel B. Radon Transform-Based Secure Image Hashing. CMS 2011: Communications and Multimedia Security, 2011, p. 186–193. DOI: 10.1007/978-3-642-24712-5_1
[8] Standaert F.-X., Lefebvre E., Rouvroy G., Macq B., Quisquater J.-J., Legat J.-D., Practical evaluation of a radial soft hash algorithm. International Conference on Information Technology: Coding and Computing (ITCC'05), 2005, vol. 2, pp. 89–94. DOI: 10.1109/ITCC.2005.229
[9] Xia J., Yokoya N., Adriano B., Broni-Bediako C. OpenEarthMap: A Benchmark Dataset for Global High-Resolution Land Cover Mapping. arXiv.org, 2022. DOI: https://arxiv.org/abs/2210.10732
[10] Ronneberger O., Fischer P., Brox T. U-Net: Convolutional Networks for Biomedical Image Segmentation. arXiv.org, 2015. URL: https://arxiv.org/abs/2210.10732


Валишин А.А., Запривода А.В., Цухло С.С. Моделирование и сравнительный анализ эффективности перцептивных хеш-функций для поиска сегментированных изображений. Математическое моделирование и численные методы, 2024, № 2, с. 46-67.



Download article

Количество скачиваний: 223