论文标题

科学出版物与方程式和公式的相似性指数,自位态性的识别以及对Ithenticate系统的测试

The similarity index of scientific publications with equations and formulas, identification of self-plagiarism, and testing of the iThenticate system

论文作者

Polyanin, Andrei D., Shingareva, Inna K.

论文摘要

首次讨论了估计数学和其他科学出版物的相似性指数的问题。结果表明,方程和公式的存在(以及图,图纸和表格)是一个复杂的因素,使对此类文本的研究显着复杂。结果表明,根据考虑单个数学符号以及方程式和公式的一部分,确定出版物的相似性索引的方法是无效的,并且可能导致错误甚至完全荒谬的结论。目前在科学期刊中使用的最受欢迎的软件系统的可能性进行了研究,以检测窃和自位式主义。提出了由特定示例和包含方程式(PDES和ODE),精确解决方案和某些公式的特定示例和特殊测试问题的处理系统的处理结果。已经确定,该软件系统在分析不均匀的文本时通常无法将自位态主义与伪自然主义(虚假的自我位plagiarism)区分开。考虑了模型复杂的情况,其中自位态主义的识别需要狭窄概况的高素质专家的参与。提出了改善软件系统工作的各种方法,以比较不均匀文本。本文将对数学,物理和工程科学领域的研究人员和大学教师,处理图像识别和数字图像处理的研究主题的程序员以及对窃和自我普莱格主义问题感兴趣的广泛读者。

The problems of estimating the similarity index of mathematical and other scientific publications containing equations and formulas are discussed for the first time. It is shown that the presence of equations and formulas (as well as figures, drawings, and tables) is a complicating factor that significantly complicates the study of such texts. It is shown that the method for determining the similarity index of publications, based on taking into account individual mathematical symbols and parts of equations and formulas, is ineffective and can lead to erroneous and even completely absurd conclusions. The possibilities of the most popular software system iThenticate, currently used in scientific journals, are investigated for detecting plagiarism and self-plagiarism. The results of processing by the iThenticate system of specific examples and special test problems containing equations (PDEs and ODEs), exact solutions, and some formulas are presented. It has been established that this software system when analyzing inhomogeneous texts, is often unable to distinguish self-plagiarism from pseudo-self-plagiarism (false self-plagiarism). A model complex situation is considered, in which the identification of self-plagiarism requires the involvement of highly qualified specialists of a narrow profile. Various ways to improve the work of software systems for comparing inhomogeneous texts are proposed. This article will be useful to researchers and university teachers in mathematics, physics, and engineering sciences, programmers dealing with problems in image recognition and research topics of digital image processing, as well as a wide range of readers who are interested in issues of plagiarism and self-plagiarism.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源