Document Image Classification with Intra-Domain Transfer Learning and Stacked Generalization of Deep Convolutional Neural Networks

  • Arindam Das
  • , Saikat Roy
  • , Ujjwal Bhattacharya
  • , Swapan K. Parui

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this article, a region-based Deep Convolutional Neural Network framework is presented for document structure learning. The contribution of this work involves efficient training of region based classifiers and effective ensembling for document image classification. A primary level of 'inter-domain' transfer learning is used by exporting weights from a pre-trained VGG16 architecture on the ImageNet dataset to train a document classifier on whole document images. Exploiting the nature of region based influence modelling, a secondary level of 'intra-domain' transfer learning is used for rapid training of deep learning models for image segments. Finally, a stacked generalization based ensembling is utilized for combining the predictions of the base deep neural network models. The proposed method achieves state-of-the-art accuracy of 92.21% on the popular RVL-CDIP document image dataset, exceeding the benchmarks set by the existing algorithms.

Original languageEnglish
Title of host publication2018 24th International Conference on Pattern Recognition, ICPR 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages3180-3185
Number of pages6
ISBN (Electronic)9781538637883
DOIs
Publication statusPublished - 26 Nov 2018
Externally publishedYes
Event24th International Conference on Pattern Recognition, ICPR 2018 - Beijing, China
Duration: 20 Aug 201824 Aug 2018

Publication series

NameProceedings - International Conference on Pattern Recognition
Volume2018-August
ISSN (Print)1051-4651

Conference

Conference24th International Conference on Pattern Recognition, ICPR 2018
Country/TerritoryChina
CityBeijing
Period20/08/1824/08/18

Keywords

  • deep convolutional neural network
  • deep learning
  • document recognition
  • document structure learning
  • intra-domain
  • neural network
  • transfer learning

Fingerprint

Dive into the research topics of 'Document Image Classification with Intra-Domain Transfer Learning and Stacked Generalization of Deep Convolutional Neural Networks'. Together they form a unique fingerprint.

Cite this