TY - GEN
T1 - A dataset for quality assessment of camera captured document images
AU - Kumar, Jayant
AU - Ye, Peng
AU - Doermann, David
PY - 2014
Y1 - 2014
N2 - With the proliferation of cameras on mobile devices there is an increased desire to image document pages as an alternative to scanning. However, the quality of captured document images is often lower than its scanned equivalent due to hardware limitations and stability issues. In this context, automatic assessment of the quality of captured images is useful for many applications. Although there has been a lot of work on developing computational methods and creating standard datasets for natural scene image quality assessment, until recently quality estimation of camera captured document images has not been given much attention. One traditional quality indicator for document images is the Optical Character Recognition (OCR) accuracy. In this work, we present a dataset of camera captured document images containing varying levels of focal-blur introduced manually during capture. For each image we obtained the character level OCR accuracy. Our dataset can be used to evaluate methods for predicting OCR quality of captured documents as well as enhancements. In order to make the dataset publicly and freely available, originals from two existing datasets - University of Washington dataset and Tobacco Database were selected. We present a case study with three recent methods for predicting the OCR quality of images on our dataset.
AB - With the proliferation of cameras on mobile devices there is an increased desire to image document pages as an alternative to scanning. However, the quality of captured document images is often lower than its scanned equivalent due to hardware limitations and stability issues. In this context, automatic assessment of the quality of captured images is useful for many applications. Although there has been a lot of work on developing computational methods and creating standard datasets for natural scene image quality assessment, until recently quality estimation of camera captured document images has not been given much attention. One traditional quality indicator for document images is the Optical Character Recognition (OCR) accuracy. In this work, we present a dataset of camera captured document images containing varying levels of focal-blur introduced manually during capture. For each image we obtained the character level OCR accuracy. Our dataset can be used to evaluate methods for predicting OCR quality of captured documents as well as enhancements. In order to make the dataset publicly and freely available, originals from two existing datasets - University of Washington dataset and Tobacco Database were selected. We present a case study with three recent methods for predicting the OCR quality of images on our dataset.
KW - Document image quality
KW - Image quality dataset
KW - Optical character recognition
KW - Sharpness
UR - https://www.scopus.com/pages/publications/84958528293
U2 - 10.1007/978-3-319-05167-3_9
DO - 10.1007/978-3-319-05167-3_9
M3 - Conference contribution
AN - SCOPUS:84958528293
SN - 9783319051666
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 113
EP - 125
BT - Camera-Based Document Analysis and Recognition - 5th International Workshop, CBDAR 2013, Revised Selected Papers
PB - Springer Verlag
T2 - 5th International Workshop on Camera-Based Document Analysis and Recognition, CBDAR 2013
Y2 - 23 August 2013 through 23 August 2013
ER -