Skip to main navigation Skip to search Skip to main content

Learning the truth vector in high dimensions

  • University of Science and Technology of China

Research output: Contribution to journalArticlepeer-review

6 Scopus citations

Abstract

Truth Discovery is an important learning problem arising in data analytics related fields. It concerns about finding the most trustworthy information from a dataset acquired from a number of unreliable sources. The problem has been extensively studied and a number of techniques have already been proposed. However, all of them are of heuristic nature and do not have any quality guarantee. In this paper, we formulate the problem as a high dimensional geometric optimization problem, called Entropy based Geometric Variance. Relying on a number of novel geometric techniques, we further discover new insights to this problem. We show, for the first time, that the truth discovery problem can be solved with guaranteed quality of solution. Particularly, it is possible to achieve a (1+ϵ)-approximation within nearly linear time under some reasonable assumptions. We expect that our algorithm will be useful for other data related applications.

Original languageEnglish
Pages (from-to)78-94
Number of pages17
JournalJournal of Computer and System Sciences
Volume109
DOIs
StatePublished - May 2020

Keywords

  • Approximation algorithm
  • Entropy
  • High dimension
  • Truth discovery

Fingerprint

Dive into the research topics of 'Learning the truth vector in high dimensions'. Together they form a unique fingerprint.

Cite this