Skip to main navigation Skip to search Skip to main content

Identification of effective predictive variables for document qualities

  • Kwong Bor Ng
  • , Rong Tang
  • , Sharon Small
  • , Tomek Strzalkowski
  • , Paul Kantor
  • , Robert Rittman
  • , Peng Song
  • , Ying Sun
  • , Nina Wacholder
  • City University of New York
  • SUNY Albany
  • Rutgers - The State University of New Jersey, New Brunswick

Research output: Contribution to journalArticlepeer-review

6 Scopus citations

Abstract

We analyzed textual properties of documents to identify predictive variables for various document qualities by means of statistical and linguistic methods. We have created a collection of 1000 documents, each document has been judged in terms of nine document qualities (accuracy, reliability, objectivity, depth, author/producer credibility, readability, verbosity and conciseness, grammatical correctness, one-sided or multiview.) Employing statistical analyses, we considered a kind of linear combination, asking (1) if it was possible to combine textual features linearly to predict document qualities; (2) what textual features had good predictive power; (3) what textual features were minimally required for prediction with a detection rate much better than the false alarm rate. We present several promising results, indicating that with a few number of textual features, we can predict various document qualities much better than chance.

Original languageEnglish
Pages (from-to)221-229
Number of pages9
JournalProceedings of the ASIST Annual Meeting
Volume40
DOIs
StatePublished - Oct 2003

Fingerprint

Dive into the research topics of 'Identification of effective predictive variables for document qualities'. Together they form a unique fingerprint.

Cite this