TY - GEN
T1 - Towards distributed ensemble clustering for networked sensing systems
T2 - 17th ACM International Symposium on Mobile Ad Hoc Networking and Computing, MobiHoc 2016
AU - Ding, Hu
AU - Su, Lu
AU - Xu, Jinhui
N1 - Publisher Copyright:
© 2016 ACM.
PY - 2016/7/5
Y1 - 2016/7/5
N2 - Given a set of different clustering solutions to a unified dataset, ensemble clustering is to aggregate them to yield a more accurate and robust solution. In recent years, ensemble clustering has been extensively studied and successfully applied to many areas. In this paper, we study a new variant of ensemble clustering, distributed ensemble clustering, motivated by the proliferation of networked sensing systems where communication is enabled between only connected nodes. Our goal is to aggregate the clustering solutions produced by the sensor nodes that observe the same set of objects. Different from traditional ensemble clustering problems, distributed ensemble clustering aims to achieve not only accurate clustering results, but also low communication cost among the nodes. To this end, we build a novel geometric optimization model that can be efficiently solved with theoretical quality guarantee. The proposed approach, bearing nice geometric properties, can be easily adapted to distributed settings without any sacrifice of clustering quality, and facilitates a dimension reduction procedure which can significantly reduce the communication complexity. We validate our approach on two benchmark datasets. Experimental results suggest that our approach can efficiently solve the distributed ensemble clustering problem, and outperform the baselines on both clustering accuracy and communication cost.
AB - Given a set of different clustering solutions to a unified dataset, ensemble clustering is to aggregate them to yield a more accurate and robust solution. In recent years, ensemble clustering has been extensively studied and successfully applied to many areas. In this paper, we study a new variant of ensemble clustering, distributed ensemble clustering, motivated by the proliferation of networked sensing systems where communication is enabled between only connected nodes. Our goal is to aggregate the clustering solutions produced by the sensor nodes that observe the same set of objects. Different from traditional ensemble clustering problems, distributed ensemble clustering aims to achieve not only accurate clustering results, but also low communication cost among the nodes. To this end, we build a novel geometric optimization model that can be efficiently solved with theoretical quality guarantee. The proposed approach, bearing nice geometric properties, can be easily adapted to distributed settings without any sacrifice of clustering quality, and facilitates a dimension reduction procedure which can significantly reduce the communication complexity. We validate our approach on two benchmark datasets. Experimental results suggest that our approach can efficiently solve the distributed ensemble clustering problem, and outperform the baselines on both clustering accuracy and communication cost.
KW - Communication efficient
KW - Computational geometry
KW - Ensemble clustering
KW - Machine learning
KW - Sensor networks
UR - https://www.scopus.com/pages/publications/84979240809
U2 - 10.1145/2942358.2942391
DO - 10.1145/2942358.2942391
M3 - Conference contribution
AN - SCOPUS:84979240809
T3 - Proceedings of the International Symposium on Mobile Ad Hoc Networking and Computing (MobiHoc)
SP - 1
EP - 10
BT - MobiHoc 2016 - Proceedings of the 17th ACM International Symposium on Mobile Ad Hoc Networking and Computing
PB - Association for Computing Machinery
Y2 - 5 July 2016 through 8 July 2016
ER -