
Discovering sequential patterns in source codes is an important issue in software engineering since it can provide useful knowledge to help in a variety of tasks such as code completion, code refactoring, developer profiling, and code complexity measurement. This paper proposes a new framework, called Source Code Miner (SCodeMiner), which discovers frequent sequential rules within a software project. The proposed framework firstly transforms a Java code into a sequence data and then applies a sequential pattern mining (SPM) algorithm. This study is also original in that it compares four SPM algorithms in terms of computational time, including sequential pattern discovery using equivalence classes (SPADE), prefix-projected sequential pattern mining (PrefixSpan), bi-directional extension (BIDE+), and last position induction (LAPIN). The experiments that carried out on an open-source software project showed that the proposed SCodeMiner framework is an effective mining tool in identifying coding patterns.
source code analysis, Technology, yazılım mühendisliği, sequential pattern mining, Science (General), T, Science, Q, kaynak kod analizi, kaynak kod örüntüleri, Engineering (General). Civil engineering (General), Q1-390, sıralı örüntü madenciliği, TA1-2040, source code patterns, software engineering
source code analysis, Technology, yazılım mühendisliği, sequential pattern mining, Science (General), T, Science, Q, kaynak kod analizi, kaynak kod örüntüleri, Engineering (General). Civil engineering (General), Q1-390, sıralı örüntü madenciliği, TA1-2040, source code patterns, software engineering
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
