Abstract
Truth Discovery is an important learning problem arising in data analytics related fields. It concerns about finding the most trustworthy information from a dataset acquired from a number of unreliable sources. The problem has been extensively studied and a number of techniques have already been proposed. However, all of them are of heuristic nature and do not have any quality guarantee. In this paper, we formulate the problem as a high dimensional geometric optimization problem, called Entropy based Geometric Variance. Relying on a number of novel geometric techniques, we further discover new insights to this problem. We show, for the first time, that the truth discovery problem can be solved with guaranteed quality of solution. Particularly, it is possible to achieve a (1+ϵ)-approximation within nearly linear time under some reasonable assumptions. We expect that our algorithm will be useful for other data related applications.
| Original language | English |
|---|---|
| Pages (from-to) | 78-94 |
| Number of pages | 17 |
| Journal | Journal of Computer and System Sciences |
| Volume | 109 |
| DOIs | |
| State | Published - May 2020 |
Keywords
- Approximation algorithm
- Entropy
- High dimension
- Truth discovery
Fingerprint
Dive into the research topics of 'Learning the truth vector in high dimensions'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver