
doi: 10.3390/app142210497
handle: 10115/75878 , 10261/386755
Decision trees are a widely used machine learning technique due to their ease of interpretation and construction. This method allows domain experts to learn from raw data, but they cannot include their prior knowledge in the analysis due to its automatic nature, which implies minimal human intervention in its computation. Conversely, interactive visualization methods have proven to be effective in gaining insights from data, as they incorporate the researcher’s criteria into the analysis process. In an effort to combine both methodologies, we have developed a tool to manually build decision trees according to subsequent visualizations of data mapping after applying linear discriminant analysis in combination with Star Coordinates in order to analyze the importance of each feature in the separation. The nodes’ information contains data about the features that can be used to split and their cut-off values, in order to select them in a guided manner. In this way, it is possible to produce simpler and more expertly driven decision trees than those obtained by automatic methods. The resulting decision trees reduces the tree size compared to those generated by automatic machine learning algorithms, obtaining a similar accuracy and therefore improving their understanding. The tool developed and presented here to manually create decision trees in a guided manner based on the subsequent visualizations of the data mapping facilitates the use of this method in real-world applications. The usefulness of this tool is demonstrated through a case study with a complex dataset used for motion recognition, where domain experts built their own decision trees by applying their prior knowledge and the visualizations provided by the tool in node construction. The resulting trees are more comprehensible and explainable, offering valuable insights into the data and confirming the relevance of upper body features and hand movements for motion recognition.
Technology, Linear discriminant analysis, decision trees, linear discriminant analysis, QH301-705.5, T, Physics, QC1-999, Decision trees, Engineering (General). Civil engineering (General), multivariate visualization, Chemistry, Motion recognition, Visual data mining, visual data mining, motion recognition, Multivariate visualization, TA1-2040, Biology (General), QD1-999
Technology, Linear discriminant analysis, decision trees, linear discriminant analysis, QH301-705.5, T, Physics, QC1-999, Decision trees, Engineering (General). Civil engineering (General), multivariate visualization, Chemistry, Motion recognition, Visual data mining, visual data mining, motion recognition, Multivariate visualization, TA1-2040, Biology (General), QD1-999
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 3 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
