001.4+001.8+001.9 The similarity index of mathematical and other scientific publications with equations and formulas and the problem of self–plagiarism identification

Polyanin A. D. (Bauman Moscow State Technical University/Ishlinsky Institute for Problems in Mechanics/MEPhI), Shingareva I. K. (University of Sonora)


doi: 10.18698/2309-3684-2021-2-96116

The problems of estimating the similarity index of inhomogeneous scientific publications containing equations and formulas are discussed for the first time. It is shown that the presence of equations and formulas (as well as figures, drawings, and tables) is a complicating factor that significantly complicates the study of such texts. It has been proved that the method for determining the similarity index of publications, based on taking into account individual mathematical symbols and parts of equations and formulas, is ineffective and can lead to erroneous and even completely absurd conclusions. Possibilities of the most popular analytical systems Antiplagiat and iThenticate, currently used in scientific journals, are investigated for detecting plagiarism and self–plagiarism. The results of processing by the iThenticate system of specific examples and specific test problems containing equations and formulas are presented. It has been established that this analytical system, when analyzing heterogeneous texts, is often unable to distinguish self– plagiarism from pseudo-self-plagiarism, seeming real (but false and imaginary) self– plagiarism. A model complex situation is considered, in which the identification of self–plagiarism requires the involvement of highly qualified specialists of a narrow profile. Various ways to improve the work of analytical systems for comparing inhomogeneous texts are proposed. This article will be useful to researchers and university teachers in physics, mathematics, and engineering, programmers dealing with problems in image recognition and research topics of digital image processing, as well as a wide range of readers who are interested in issues of plagiarism and self–plagiarism.

Igra v cyfir', ili kak teper' ocenivayut trud uchenogo (cbornik statej o bibliometrike) [The game of tsyfir, or how the work of a scientist is now evaluated (a collection of articles on bibliometric)]. Moscow, ICNMO, 2011, 72 p.
Polyanin A.D. Disadvantages of citation index and Hirsch and using other scientometrics. Маthematical Modeling and Coтputational Methods, 2014, № 1, pp.131–144.
Dobryakova N.I. Citation or plagiarism. Human Capital and Professional Education, 2015, vol.13, no.1, pp.15–20.
Vrbanec T., Mestrovic A. The struggle with academic plagiarism: approaches based on semantic similarity.40th International Convention on Information and Communication Technology, Electronics and Microelectronics, MIPRO 2017 —Proceedings, 2017, pp.870–875.
Gelman V.Ya. Problems of formal–mechanistic approach to identification of plagiarism in scientific work. The Economics of Science, 2020, vol.6, no.3, pp.180–185.
Kotlyarov I.D. Plagiarism in scientific publications. Nauchnaya periodika: problemy i resheniya [Scientific periodicals: problems and solutions], 2011, no.4, pp.6–12.
Kuleshova A.V., Chekhovich Y.V., Belenkaya O.S. Walking the razor's edge: how to avoid self–plagiarism when you recycle your texts. Nauchnyi Redaktor i Izdatel’ [Science Editor and Publisher], 2019, vol.4, no.1–2, pp.45–51.
iThenticate. The ethics of self–plagiarism [Electronic resource]. Access mode: https://www.ithenticate.com/hs-fs/hub/92785/file-5414624-pdf/media/ith-selfplagiarism-whitepaper.pdf (accessed: 18.02.2021).
Wikipedia. The Free Encyclopedia. Antiplagiat [Electronic resource]. Access mode: https://ru.wikipedia.org/wiki/Антиплагиат (accessed: 19.02.2021).
Antiplagiat. Plagiarism detection system [Electronic resource]. Access mode: https://www.antiplagiat.ru/ (accessed: 20.02.2021).
The Epoch Times. Vo chto prevratilas' sistema «Antiplagiat» [What has the "anti-plagiarism" system turned into] [Electronic resource]. Access mode: https://www.epochtimes.ru/vo-chto-prevratilas-sistema-antiplagiat-99080032/ (accessed: 20.02.2021).
Wikipedia. The Free Encyclopedia.iThenticate [Electronic resource]. Access mode: https://en.wikipedia.org/wiki/IThenticate (accessed: 19.02.2021).
K svedeniyu avtorov [For the authors' information]. Siberian Medical Journal, 2018, vol.33, no.4, p.164.
Elsevier. Grecea М. What every Editor should know about Similarity (Cross) Check [Electronic resource]. Access mode: https://www.elsevier.com/__data/assets/pdf_file/0006/865131/Similarity-Cross-Check-webcast-Mihail-Gr (accessed: 19.02.2021).
Rashbi N. Ot redaktora: vyzov Disserneta [From the editor: calling the Dissernet]. The Journal of Education and Self Development, 2017, vol.12, no.1, pp.14–22.
Polyanin A.D. Construction of exact solutions in implicit form for PDEs: New functional separable solutions of non–linear reaction–diffusion equations with variable coefficients. International Journal of Non–Linear Mechanics, 2019, vol.111, pp.95–105.
Polyanin A.D. Construction of functional separable solutions in implicit form for non–linear Klein–Gordon type equations with variable coefficients. International Journal of Non–Linear Mechanics, 2019, vol.114, pp.29–40.
Polyanin A.D., Zaitsev V.F. Handbook of ordinary differential equations: exact solutions, methods, and problems. Boca Raton — London — New York, CRC Press, 2018, 1496 p.
Polyanin A.D., Sorokin V.G. Nonlinear pantograph–type diffusion pdes: exact solutions and the principle of analogy. Mathematics, vol.9, iss.5, pp.1–23, art no.511.

Полянин А.Д., Шингарева И.К. Индекс подобия математических и других научных публикаций с уравнениями и формулами и проблема идентификации самоплагиата. Математическое моделирование и численные методы, 2021, № 2, с. 96–116.

Авторы благодарят А.В. Аксенова, А.Л. Левитина и А.Н. Филиппова за внимание к работе и полезные обсуждения.

Download article

Количество скачиваний: 220