Abstract
We analyzed textual properties of documents to identify predictive variables for various document qualities by means of statistical and linguistic methods. We have created a collection of 1000 documents, each document has been judged in terms of nine document qualities (accuracy, reliability, objectivity, depth, author/producer credibility, readability, verbosity and conciseness, grammatical correctness, one-sided or multiview.) Employing statistical analyses, we considered a kind of linear combination, asking (1) if it was possible to combine textual features linearly to predict document qualities; (2) what textual features had good predictive power; (3) what textual features were minimally required for prediction with a detection rate much better than the false alarm rate. We present several promising results, indicating that with a few number of textual features, we can predict various document qualities much better than chance.
| Original language | English |
|---|---|
| Pages (from-to) | 221-229 |
| Number of pages | 9 |
| Journal | Proceedings of the ASIST Annual Meeting |
| Volume | 40 |
| DOIs | |
| State | Published - Oct 2003 |
Fingerprint
Dive into the research topics of 'Identification of effective predictive variables for document qualities'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver