ABSTRACT
This paper presents the architecture and operation of a Historical
Newspaper Page Image Topic Navigation System designed to
facilitate the access and use of social and historical research to
the historical newspaper collection. The system consists of four
modules which are: Text Subimage Segmentation, Text Extraction
and Preprocessing, Topic Network Extraction, and Document Viewing
and Retrieval Interface. The algorithmic and technological approaches
of each module are described and the initial test results
are presented.
O Computer on the Beach é um evento técnico-científico que visa reunir profissionais, pesquisadores e acadêmicos da área de Computação, a fim de discutir as tendências de pesquisa e mercado da computação em suas mais diversas áreas.