Program clones detection as natural language texts fragments based on constructive-synthesizing modeling

dc.contributor.authorShynkarenko, Viktor I.en
dc.contributor.authorKuropiatnyk, Olena S.en
dc.contributor.authorDemidovich, Inna M.en
dc.date.accessioned2025-10-27T11:34:33Z
dc.date.issued2025
dc.descriptionThis article is published under a Creative Commons Attribution 4.0 International (CC BY 4.0) licence. V. Shynkarenko: ORCID 0000-0001-8738-7225; O. Kuropiatnyk: ORCID 0000-0003-2286-884X; I. Demidovich: ORCID 0000-0002-3644-184Xen
dc.description.abstractENG: The developed and tested method for comparing the structure of natural language texts is adapted to the analysis of program texts. The method is based on the use of stochastic grammars, including rules that describe the algorithmic structure of programs. The certain structures appearance probability is calculated as the product of the different program elements probabilities. Constructive-production modeling tools were used to form the rules. An experiment was conducted to verify the possibility of using this method to detect clones in the programs source text in C++ and C#. Different types of tasks and their software implementations were studied: both those that are equivalent in control flow but different in calculations, and vice versa. As a result of the experiments, it was found that programs that solve different tasks but have almost identical algorithms have high values of similarity indicators. If the algorithms are similar, but solve different tasks, the indicators are slightly lower. Similarity indicators from low to medium, obtained in cases where different tasks are solved with different algorithms that is due to the use of a single programming language syntax.en
dc.identifier.citationShynkarenko, V., Kuropiatnyk, O., & Demidovich, I. (2025). Program clones detection as natural language texts fragments based on constructive-synthesizing modeling. In N. Khairova, V. Vysotska, N. Grabar, & T. Hamon (Eds.), Proceedings of the Computational Linguistics Workshop (CLW-CoLInS 2025) at the 9th International Conference on Computational Linguistics and Intelligent Systems (CoLInS 2025), Kharkiv, Ukraine, May 15–16, 2025 (Vol. 3976, pp. 11-22). CEUR-WS.org. https://ceur-ws.org/Vol-3976/en
dc.identifier.issn1613-0073
dc.identifier.urihttps://ceur-ws.org/Vol-3976/
dc.identifier.urihttps://crust.ust.edu.ua/handle/123456789/21152
dc.language.isoen
dc.publisherCEUR Workshop Proceedingsen
dc.rightsAttribution 4.0 International (CC BY 4.0)en
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectsoftwareen
dc.subjectconstructive-synthesizing modelingen
dc.subjectnatural languageen
dc.subjectformal languageen
dc.subjectprogram cloneen
dc.subjectinformation technologyen
dc.subjectКІТuk_UA
dc.titleProgram clones detection as natural language texts fragments based on constructive-synthesizing modelingen
dc.typeArticleen

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
paper2.pdf
Size:
2.71 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: