In this work we propose a new page segmentation method for recognizing text and graphics based on a multiresolution representation of the page image. Our approach is based on the analysis of a set of feature maps available at different resolution levels. The final output is a description of the physical structure of a page. A page image is broken down into several blocks which represent components of a page, such as text, line-drawings, and pictures. The result, which uses only a small amount of memory in addition to that for the image, may be the first step for a more detailed analysis such as optical character recognition.

A Multiresolution Approach for Page Segmentation

LOMBARDI, LUCA;
1998-01-01

Abstract

In this work we propose a new page segmentation method for recognizing text and graphics based on a multiresolution representation of the page image. Our approach is based on the analysis of a set of feature maps available at different resolution levels. The final output is a description of the physical structure of a page. A page image is broken down into several blocks which represent components of a page, such as text, line-drawings, and pictures. The result, which uses only a small amount of memory in addition to that for the image, may be the first step for a more detailed analysis such as optical character recognition.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11571/108918
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact