TSSF: Enhanced Phylogenetic Inference through Optimized Feature Selection and Computational Efficiency Analysis
DOI:
https://doi.org/10.24996/ijs.2026.67.3.34Keywords:
Cladistic Analysis, computational selection of features, Pattern systems in script analysis, Phylogenetic studies, Scriptinformatics, Arabic and Aramaic scriptsAbstract
In this paper, we introduce an improved feature selection algorithm TSSF designed to enhance computational efficiency in phylogenetic tree construction by evaluating feature subsets and identifying those that contribute positively or negatively to tree topology. Building on the symbolic aspects of pattern systems, which encompass symbols (data), syntax (relations between data points), and layout rules, our approach systematically assesses the effectiveness of feature subsets. This is illustrated using essential phylogenetic metrics, including the Consistency Index, Retention Index, and Rescaled Consistency Index. In these metrics, higher values suggest high-quality feature subsets, whereas lower scores on the Homoplasy Index indicate minimized noise within the feature set. A comprehensive analysis reveals that the algorithm efficiently handles non-linear complexities, enabling it to distinguish between good and bad feature sets while maintaining computational efficiency. The results underscore the importance of informed feature selection in improving computational performance and the accurate reconstruction of evolutionary history. Given its flexibility, our algorithm serves as a robust tool for analyzing the evolution of varied pattern systems, making a significant contribution to the emerging field of Scriptinformatics.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Iraqi Journal of Science

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.



