Creating and sharing knowledge for telecommunications

Clustering via binary embedding

Bicego, M. ; Figueiredo, M. A. T.

Pattern Recognition Vol. 83, Nº -, pp. 52 - 63, November, 2018.

ISSN (print): 0031-3203
ISSN (online):

Journal Impact Factor: 3,279 (in 2008)

Digital Object Identifier: 10.1016/j.patcog.2018.05.011

In this paper, we present a novel clustering scheme based on binary embeddings, which provides compact and informative binary representations of high-dimensional objects. The binary representations are obtained with a collection of one-class classifiers learned from (pseudo) randomly selected points in the dataset. To cluster the binary representations, we consider two approaches: a mixture of Bernoulli distributions and a recent biclustering approach called CRAFT. The empirical evaluation in comparison with both classic and recent clustering methods, based on 12 different datasets, provides encouraging results. The main feature of the proposed method is that it is agnostic to the shape of the clusters.