Title Mišriųjų duomenų klasterizavimas taikant informacijos entropiją /
Translation of Title Application of information entropy for mixed data clustering.
Authors Venckus, Mindaugas
Full Text Download
Pages 58
Keywords [eng] clustering ; mixed data ; information entropy ; cluster validation
Abstract [eng] The main topic of this paper is mixed data clustering analysis and implementation. Clustering algorithm was implemented based on J. Liang, et al., Determining the number of clusters using information entropy for mixed data, Pattern Recognition (2012), doi:10.1016/j.patcog.2011.12.017 paper. Main contributions are: mixed datatype visualization using multidimensional scaling method, non random initial centers selection, validation of clustering results using Dunn and Davies- Bouldin indices. Modified algorithm was tested with synthetical and real data sets. Originally real data sets were with known groups, but before clustering labels from real data sets were removed. In all cases the number of true clusters were detected correctly.
Dissertation Institution Kauno technologijos universitetas.
Type Master thesis
Language Lithuanian
Publication date 2016