Similarity measure exercise for classification trees based on the classification path

Authors

  • Norsida Hasana Faculty of Science and Mathematics Universiti Pendidikan Sultan Idris 35900 Tanjong Malim, Perak, Malaysia.
  • M. B. Adam Institute for Mathematical Research (INSPEM) Universiti Putra Malaysia 43400 UPM Serdang, Selangor, Malaysia.
  • N. Mustapha Institute for Mathematical Research (INSPEM) Universiti Putra Malaysia 43400 UPM Serdang, Selangor, Malaysia.
  • M. R. Abu Bakar Institute for Mathematical Research (INSPEM) Universiti Putra Malaysia 43400 UPM Serdang, Selangor, Malaysia.

Abstract

Classification tree models are known for their simplicity and efficiency when dealing with domains contain large number of variables and cases. However, a small perturbation in the data, can lead to a very different tree. We introduce a method for measuring similarity between binary classification trees based on the similarity between the classification paths. The trees to be compared are represented in the form of matrices whose entries are in the interval [0,1]. Overlap similarity measure is used to measure the similarity between each pair of path in two trees, and the best matching paths between trees are used to calculate
the similarity measure. This method has advantage to measure trees that possess the same structure and leaf nodes but different internal node.

Keywords:

Binary classification tree, Classification paths, Overlap similarity measure

Downloads

Published

2012-12-31

How to Cite

Norsida Hasana, M. B. Adam, N. Mustapha, & M. R. Abu Bakar. (2012). Similarity measure exercise for classification trees based on the classification path. Applied Mathematics and Computational Intelligence (AMCI), 1(1), 33–41. Retrieved from https://ejournal.unimap.edu.my/index.php/amci/article/view/52