Tabular Corner Detection in Historical Irish Records

Enda O'Shea

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The process of extracting relevant data from historical handwritten documents can be time-consuming and challenging. In Ireland, from 1864 to 1922, government records regarding births, deaths, and marriages were documented by local registrars using printed tabular structures. Leveraging this systematic approach, we employ a neural network capable of segmenting scanned versions of these record documents. We sought to isolate the corner points with the goal of extracting the vital tabular elements and transforming them into consistently structured standalone images. By achieving uniformity in the segmented images, we enable more accurate row and column segmentation, enhancing our ability to isolate and classify individual cell contents effectively. This process must accommodate varying image qualities, different tabular orientations and sizes resulting from diverse scanning procedures, as well as faded and damaged ink lines that naturally occur over time.

Original languageEnglish
Title of host publicationDocEng 2023 - Proceedings of the 2023 ACM Symposium on Document Engineering
PublisherAssociation for Computing Machinery, Inc
ISBN (Electronic)9798400700279
DOIs
Publication statusPublished - 22 Aug 2023
Event2023 ACM Symposium on Document Engineering, DocEng 2023 - Limerick, Ireland
Duration: 22 Aug 202325 Aug 2023

Publication series

NameDocEng 2023 - Proceedings of the 2023 ACM Symposium on Document Engineering

Conference

Conference2023 ACM Symposium on Document Engineering, DocEng 2023
Country/TerritoryIreland
CityLimerick
Period22/08/2325/08/23

Keywords

  • corner detection
  • historical documents
  • Image segmentation

Fingerprint

Dive into the research topics of 'Tabular Corner Detection in Historical Irish Records'. Together they form a unique fingerprint.

Cite this