djvutxt

TXT* chunks extractor
Synopsis
        djvutxt [-page <page_num>] <djvu_file_in> [<txt_file_out>]
    

Description --- File "djvutxt.cpp" illustrates how to use DjVuDocument, DjVuImage, DjVuTXT to retrieve textual information stored inside a TXT* chunk of a DjVu document.

TXT* chunks should have been created with the help of an OCR engine, and are used to allow indexing and searching of the DjVu document. The chunks contain the ASCII text itself, and layout information allowing the DjVu plugins to highlight found text.

This utility can be used to extract text from TXT* chunks and output it to a file or standard output.

Arguments:

-page option can be used to select a particular page from the <djvu_file_in> for processing.

Author:
Andrei Erofeev <eaf@geocities.com> -- initial implementation
Version:
$Id: djvutxt.2.html,v 1.6 2000/11/02 02:17:04 bcr Exp $

Alphabetic index Hierarchy of classes


DjVu is a trademark of AT&T.
All other products mentioned are registered trademarks or trademarks of their respective companies.