EMPIRICAL EVALUATION OF STATE-OF-THE-ART OBJECT DETECTION METHODS FOR DOCUMENT IMAGE UNDERSTANDING

Nguyen D. Vo; Khanh Nguyen; Tam V. Nguyen; Khang Nguyen

doi:10.15625/vap.2017.00022

EMPIRICAL EVALUATION OF STATE-OF-THE-ART OBJECT DETECTION METHODS FOR DOCUMENT IMAGE UNDERSTANDING

Nguyen D. Vo, Khanh Nguyen, Tam V. Nguyen, Khang Nguyen

DOI: 10.15625/vap.2017.00022

Abstract

The majority of online documents such as research papers, articles, and magazines is publicly available in the image form due to the copyright issue. Document image understanding is the task of deriving a high level presentation of the contents of a document image, which involves several phases, mainly including page segmentation (or block segmentation), blocks classification (or blocks labeling) and several operations for processing text, tables, graphics, figures, formulas, etc. Our objective focuses on the first two phases of document image understanding, namely, locating the logical objects in document pages. This process is valuable for a variety of document image analysis applications. To this end, we evaluate different state-of-the-art object detection methods based on computer vision for the task. Through our extensive experiments, we report findings/comments from the off-the-shelf object detectors and streamline several potential directions for the future work.

Keywords

Page Object Detection, Document Image Understanding

Full Text:

PDF

PROCEEDING

PUBLISHING HOUSE FOR SCIENCE AND TECHNOLOGY

Website: http://vap.ac.vn

Contact: nxb@vap.ac.vn

Username
Password
Remember me