Skip to main navigation Skip to search Skip to main content

Designing a realistic evaluation of an end-to-end interactive question answering system

  • Nina Wacholder
  • , Sharon Small
  • , Bing Bai
  • , Diane Kelly
  • , Robert Rittman
  • , Sean Ryan
  • , Robert Salkin
  • , Peng Song
  • , Ying Sun
  • , Liu Ting
  • , Paul Kantor
  • , Tomek Strzalkowski
  • Rutgers - The State University of New Jersey, New Brunswick
  • SUNY Albany
  • University of North Carolina at Chapel Hill

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

We report on the development of material for an evaluation exercise designed to assess the overall design and usability of HITIQA, an interactive question-answering system for preparing broad ranging reports on complex issues. The two basic objectives of the evaluation were (1) To perform a realistic assessment of the usefulness and usability of HITIQA as an end-to-end system, from the information seeker's initial questions to completion of a draft report; and (2) To develop metrics to compare the answers obtained by different analysts and evaluate the quality of the support that HITIQA provides. We used qualitative and quantitative tools to obtain data about analyst's comfort with the HITIQA system, especially its novel features such as the ability to answer complex questions and the interactive dialogue. Because of the impracticality of measuring the quality of HITIQA output with the standard metrics of precision and recall, we developed a new task-cross-evaluation-to indirectly measure the quality of the answers obtained using HITIQA; in this black-box assessment, analysts rate the quality of their own and their colleagues' reports.

Original languageEnglish
Title of host publicationProceedings of the 4th International Conference on Language Resources and Evaluation, LREC 2004
EditorsMaria Francisca Xavier, Rute Costa, Fatima Ferreira, Maria Teresa Lino, Raquel Silva
PublisherEuropean Language Resources Association (ELRA)
Pages989-992
Number of pages4
ISBN (Electronic)2951740816, 9782951740815
StatePublished - 2004
Event4th International Conference on Language Resources and Evaluation, LREC 2004 - Lisbon, Portugal
Duration: May 26 2004May 28 2004

Publication series

NameProceedings of the 4th International Conference on Language Resources and Evaluation, LREC 2004

Conference

Conference4th International Conference on Language Resources and Evaluation, LREC 2004
Country/TerritoryPortugal
CityLisbon
Period05/26/0405/28/04

Fingerprint

Dive into the research topics of 'Designing a realistic evaluation of an end-to-end interactive question answering system'. Together they form a unique fingerprint.

Cite this