EMPIRICAL EVALUATION OF STATE-OF-THE-ART OBJECT DETECTION METHODS FOR DOCUMENT IMAGE UNDERSTANDING

Nguyen D. Vo, Khanh Nguyen, Tam V. Nguyen, Khang Nguyen



DOI: 10.15625/vap.2017.00022

Abstract


The majority of online documents such as research papers, articles, and magazines is publicly available in the image form due to the copyright issue. Document image understanding is the task of deriving a high level presentation of the contents of a document image, which involves several phases, mainly including page segmentation (or block segmentation), blocks classification (or blocks labeling) and several operations for processing text, tables, graphics, figures, formulas, etc. Our objective focuses on the first two phases of document image understanding, namely, locating the logical objects in document pages. This process is valuable for a variety of document image analysis applications. To this end, we evaluate different state-of-the-art object detection methods based on computer vision for the task. Through our extensive experiments, we report findings/comments from the off-the-shelf object detectors and streamline several potential directions for the future work.

Keywords


Page Object Detection, Document Image Understanding

Full Text:

PDF


Copyright (c) 2018 PROCEEDING of Publishing House for Science and Technology



PROCEEDING

PUBLISHING HOUSE FOR SCIENCE AND TECHNOLOGY

Website: http://vap.ac.vn

Contact: nxb@vap.ac.vn