Skip to main navigation Skip to search Skip to main content

Entity resolution using cloud computing

  • CUBRC

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Roles and capabilities of analysts are changing as the volume of data grows. Open-source content is abundant and users are becoming increasingly dependent on automated capabilities to sift and correlate information. Entity resolution is one such capability. It is an algorithm that links entities using an arbitrary number of criteria (e.g., identifiers, attributes) from multiple sources. This paper demonstrates a prototype capability, which identifies enriched attributes of individuals stored across multiple sources. Here, the system first completes its processing on a cloud-computing cluster. Then, in a data explorer role, the analyst evaluates whether automated results are correct and whether attribute enrichment improves knowledge discovery.

Original languageEnglish
Title of host publicationNext-Generation Analyst III
EditorsTimothy P. Hanratty, James Llinas, Barbara D. Broome, David L. Hall
PublisherSPIE
ISBN (Electronic)9781628416152
DOIs
StatePublished - 2015
EventNext-Generation Analyst III - Baltimore, United States
Duration: Apr 20 2015Apr 21 2015

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume9499
ISSN (Print)0277-786X
ISSN (Electronic)1996-756X

Conference

ConferenceNext-Generation Analyst III
Country/TerritoryUnited States
CityBaltimore
Period04/20/1504/21/15

Keywords

  • Cloud Computing
  • Entity Resolution
  • Hadoop
  • HBase
  • Knowledge Discovery
  • MapReduce
  • NLP
  • Ontology
  • RDF
  • User Interface

Fingerprint

Dive into the research topics of 'Entity resolution using cloud computing'. Together they form a unique fingerprint.

Cite this