Standard

Mining Patterns in Source Code using Tree Mining Algorithms. / Pham, Hoang Son; Nijssen, Siegfried; Mens, Kim; Di Nucci, Dario; Molderez, Tim Christiaan; De Roover, Coen; Fabry, Johan; Zaytsev, Vadim.

Proceedings of the 22nd International Conference on Discovery Science (DS2019). DS2019: 22nd International Conference on Discovery Science, 2019.

Research output: Chapter in Book/Report/Conference proceedingConference paper

Harvard

Pham, HS, Nijssen, S, Mens, K, Di Nucci, D, Molderez, TC, De Roover, C, Fabry, J & Zaytsev, V 2019, Mining Patterns in Source Code using Tree Mining Algorithms. in Proceedings of the 22nd International Conference on Discovery Science (DS2019). DS2019: 22nd International Conference on Discovery Science, 22nd International Conference on Discovery Science, Split, Croatia, 28/10/20.

APA

Pham, H. S., Nijssen, S., Mens, K., Di Nucci, D., Molderez, T. C., De Roover, C., ... Zaytsev, V. (2019). Mining Patterns in Source Code using Tree Mining Algorithms. In Proceedings of the 22nd International Conference on Discovery Science (DS2019) DS2019: 22nd International Conference on Discovery Science.

Vancouver

Pham HS, Nijssen S, Mens K, Di Nucci D, Molderez TC, De Roover C et al. Mining Patterns in Source Code using Tree Mining Algorithms. In Proceedings of the 22nd International Conference on Discovery Science (DS2019). DS2019: 22nd International Conference on Discovery Science. 2019

Author

Pham, Hoang Son ; Nijssen, Siegfried ; Mens, Kim ; Di Nucci, Dario ; Molderez, Tim Christiaan ; De Roover, Coen ; Fabry, Johan ; Zaytsev, Vadim. / Mining Patterns in Source Code using Tree Mining Algorithms. Proceedings of the 22nd International Conference on Discovery Science (DS2019). DS2019: 22nd International Conference on Discovery Science, 2019.

BibTeX

@inproceedings{bcf8bad3882046cf8af63f38b01a1e26,
title = "Mining Patterns in Source Code using Tree Mining Algorithms",
abstract = "Discovering regularities in source code is of great interest to software engineers, both in academia and in industry, as regularities can provide useful information to help in a variety of tasks such as code comprehension, code refactoring, and fault localisation. However, traditional pattern mining algorithms often find too many patterns of little use and hence are not suitable for discovering useful regularities. In this paper we propose FREQTALS, a new algorithm for mining patterns in source code based on the FREQT tree mining algorithm. First, we introduce several constraints that effectively enable us to find more useful patterns; then, we show how to efficiently include them in FREQT. To illustrate the usefulness of the constraints we carried out a case study in collaboration with software engineers, where we identified a number of interesting patterns in a repository of Java code.",
keywords = "Pattern Mining, Frequent Tree Mining, Source Code Regularities",
author = "Pham, {Hoang Son} and Siegfried Nijssen and Kim Mens and {Di Nucci}, Dario and Molderez, {Tim Christiaan} and {De Roover}, Coen and Johan Fabry and Vadim Zaytsev",
year = "2019",
month = "10",
day = "28",
language = "English",
isbn = "978-3-030-33778-0",
booktitle = "Proceedings of the 22nd International Conference on Discovery Science (DS2019)",
publisher = "DS2019: 22nd International Conference on Discovery Science",

}

RIS

TY - GEN

T1 - Mining Patterns in Source Code using Tree Mining Algorithms

AU - Pham, Hoang Son

AU - Nijssen, Siegfried

AU - Mens, Kim

AU - Di Nucci, Dario

AU - Molderez, Tim Christiaan

AU - De Roover, Coen

AU - Fabry, Johan

AU - Zaytsev, Vadim

PY - 2019/10/28

Y1 - 2019/10/28

N2 - Discovering regularities in source code is of great interest to software engineers, both in academia and in industry, as regularities can provide useful information to help in a variety of tasks such as code comprehension, code refactoring, and fault localisation. However, traditional pattern mining algorithms often find too many patterns of little use and hence are not suitable for discovering useful regularities. In this paper we propose FREQTALS, a new algorithm for mining patterns in source code based on the FREQT tree mining algorithm. First, we introduce several constraints that effectively enable us to find more useful patterns; then, we show how to efficiently include them in FREQT. To illustrate the usefulness of the constraints we carried out a case study in collaboration with software engineers, where we identified a number of interesting patterns in a repository of Java code.

AB - Discovering regularities in source code is of great interest to software engineers, both in academia and in industry, as regularities can provide useful information to help in a variety of tasks such as code comprehension, code refactoring, and fault localisation. However, traditional pattern mining algorithms often find too many patterns of little use and hence are not suitable for discovering useful regularities. In this paper we propose FREQTALS, a new algorithm for mining patterns in source code based on the FREQT tree mining algorithm. First, we introduce several constraints that effectively enable us to find more useful patterns; then, we show how to efficiently include them in FREQT. To illustrate the usefulness of the constraints we carried out a case study in collaboration with software engineers, where we identified a number of interesting patterns in a repository of Java code.

KW - Pattern Mining

KW - Frequent Tree Mining

KW - Source Code Regularities

M3 - Conference paper

SN - 978-3-030-33778-0

BT - Proceedings of the 22nd International Conference on Discovery Science (DS2019)

PB - DS2019: 22nd International Conference on Discovery Science

ER -

ID: 47774193