TY - GEN
T1 - Clustering under composite generative models
AU - Wang, Tiexing
AU - Bucci, Donald J.
AU - Liang, Yingbin
AU - Chen, Biao
AU - Varshney, Pramod K.
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/5/21
Y1 - 2018/5/21
N2 - This paper studies clustering of data samples generated from composite distributions using the Kolmogorov-Smirnov (KS) based K-means algorithm. All data sequences are assumed to be generated from unknown continuous distributions. The maximum intra-cluster KS distance of each distribution cluster is assumed to be smaller than the minimum inter-cluster KS distance of different clusters. The analysis of convergence and upper bounds on the error probability are provided for both cases with known and unknown number of clusters. Furthermore, it is shown that the probability of error decays exponentially as the number of samples in each data sequence goes to infinity, and the error exponent is only a function of the difference of the inter-cluster and intra-cluster KS distances. The analysis is validated by simulation results.
AB - This paper studies clustering of data samples generated from composite distributions using the Kolmogorov-Smirnov (KS) based K-means algorithm. All data sequences are assumed to be generated from unknown continuous distributions. The maximum intra-cluster KS distance of each distribution cluster is assumed to be smaller than the minimum inter-cluster KS distance of different clusters. The analysis of convergence and upper bounds on the error probability are provided for both cases with known and unknown number of clusters. Furthermore, it is shown that the probability of error decays exponentially as the number of samples in each data sequence goes to infinity, and the error exponent is only a function of the difference of the inter-cluster and intra-cluster KS distances. The analysis is validated by simulation results.
KW - K-means algorithm
KW - Kolmogorov-Smirnov distance
KW - composite distributions
KW - probability of error
UR - http://www.scopus.com/inward/record.url?scp=85048555402&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85048555402&partnerID=8YFLogxK
U2 - 10.1109/CISS.2018.8362256
DO - 10.1109/CISS.2018.8362256
M3 - Conference contribution
AN - SCOPUS:85048555402
T3 - 2018 52nd Annual Conference on Information Sciences and Systems, CISS 2018
SP - 1
EP - 6
BT - 2018 52nd Annual Conference on Information Sciences and Systems, CISS 2018
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 52nd Annual Conference on Information Sciences and Systems, CISS 2018
Y2 - 21 March 2018 through 23 March 2018
ER -