Skip navigation links
A C D G I L M O P S T U V 

A

AbstractTesseract4OcrEngine - Class in com.itextpdf.pdfocr.tesseract4
The implementation of IOcrEngine.
AbstractTesseract4OcrEngine(Tesseract4OcrEngineProperties) - Constructor for class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
 
applyRotation(ImageData) - Method in class com.itextpdf.pdfocr.tesseract4.LeptonicaImageRotationHandler
 

C

CANNOT_BINARIZE_IMAGE - Static variable in class com.itextpdf.pdfocr.tesseract4.Tesseract4LogMessageConstant
 
CANNOT_CONVERT_IMAGE_TO_GRAYSCALE - Static variable in class com.itextpdf.pdfocr.tesseract4.Tesseract4LogMessageConstant
Deprecated.
since 1.0.1. Will be removed in 2.0.0
CANNOT_CONVERT_IMAGE_TO_PIX - Static variable in class com.itextpdf.pdfocr.tesseract4.Tesseract4LogMessageConstant
Deprecated.
since 1.0.1. Will be removed in 2.0.0
CANNOT_CREATE_BUFFERED_IMAGE - Static variable in class com.itextpdf.pdfocr.tesseract4.Tesseract4LogMessageConstant
 
CANNOT_DELETE_FILE - Static variable in class com.itextpdf.pdfocr.tesseract4.Tesseract4LogMessageConstant
 
CANNOT_FIND_PATH_TO_TESSERACT_EXECUTABLE - Static variable in exception com.itextpdf.pdfocr.tesseract4.Tesseract4OcrException
 
CANNOT_GET_TEMPORARY_DIRECTORY - Static variable in class com.itextpdf.pdfocr.tesseract4.Tesseract4LogMessageConstant
 
CANNOT_OCR_INPUT_FILE - Static variable in class com.itextpdf.pdfocr.tesseract4.Tesseract4LogMessageConstant
 
CANNOT_PARSE_NODE_BBOX - Static variable in class com.itextpdf.pdfocr.tesseract4.Tesseract4LogMessageConstant
 
CANNOT_PROCESS_IMAGE - Static variable in class com.itextpdf.pdfocr.tesseract4.Tesseract4LogMessageConstant
 
CANNOT_READ_FILE - Static variable in class com.itextpdf.pdfocr.tesseract4.Tesseract4LogMessageConstant
 
CANNOT_READ_IMAGE_METADATA - Static variable in class com.itextpdf.pdfocr.tesseract4.Tesseract4LogMessageConstant
 
CANNOT_READ_INPUT_IMAGE - Static variable in class com.itextpdf.pdfocr.tesseract4.Tesseract4LogMessageConstant
 
CANNOT_READ_PROVIDED_IMAGE - Static variable in exception com.itextpdf.pdfocr.tesseract4.Tesseract4OcrException
 
CANNOT_RETRIEVE_PAGES_FROM_IMAGE - Static variable in class com.itextpdf.pdfocr.tesseract4.Tesseract4LogMessageConstant
 
CANNOT_USE_USER_WORDS - Static variable in class com.itextpdf.pdfocr.tesseract4.Tesseract4LogMessageConstant
 
CANNOT_WRITE_TO_FILE - Static variable in class com.itextpdf.pdfocr.tesseract4.Tesseract4LogMessageConstant
 
com.itextpdf.pdfocr.tesseract4 - package com.itextpdf.pdfocr.tesseract4
 
com.itextpdf.pdfocr.tesseract4.events - package com.itextpdf.pdfocr.tesseract4.events
 
COMMAND_FAILED - Static variable in class com.itextpdf.pdfocr.tesseract4.Tesseract4LogMessageConstant
 
CREATED_TEMPORARY_FILE - Static variable in class com.itextpdf.pdfocr.tesseract4.Tesseract4LogMessageConstant
 
createTxtFile(List<File>, File) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Performs OCR using provided IOcrEngine for the given list of input images and saves output to a text file using provided path.

D

doImageOcr(File) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Reads data from the provided input image file and returns retrieved data in the format described below.
doImageOcr(File, OutputFormat) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Reads data from the provided input image file and returns retrieved data as string.
doTesseractOcr(File, File, OutputFormat) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Performs tesseract OCR for the first (or for the only) image page.

G

getDefaultLanguage() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Gets default language for ocr.
getDefaultUserWordsSuffix() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Gets default user words suffix.
getEventType() - Method in class com.itextpdf.pdfocr.tesseract4.events.PdfOcrTesseract4Event
 
getImagePreprocessingOptions() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
getLanguagesAsString() - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Gets list of languages concatenated with "+" symbol to a string in format required by tesseract.
getMinimalConfidenceLevel() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Gets minimal confidence level for HOCR line to be considered as properly recognized.
getOriginId() - Method in class com.itextpdf.pdfocr.tesseract4.events.PdfOcrTesseract4Event
 
getPageSegMode() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Gets Page Segmentation Mode.
getPathToExecutable() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4ExecutableOcrEngine
Gets path to tesseract executable.
getPathToTessData() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Gets path to directory with tess data.
getTesseract4OcrEngineProperties() - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Gets properties for AbstractTesseract4OcrEngine.
getTesseractInstance() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4LibOcrEngine
Gets tesseract instance.
getTextPositioning() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Defines the way text is retrieved from tesseract output using TextPositioning.
getThreadLocalMetaInfo() - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
getTileHeight() - Method in class com.itextpdf.pdfocr.tesseract4.ImagePreprocessingOptions
getTileWidth() - Method in class com.itextpdf.pdfocr.tesseract4.ImagePreprocessingOptions

I

identifyOsType() - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Identifies type of current OS and return it (win, linux).
ImagePreprocessingOptions - Class in com.itextpdf.pdfocr.tesseract4
Additional options applied on image preprocessing step.
ImagePreprocessingOptions() - Constructor for class com.itextpdf.pdfocr.tesseract4.ImagePreprocessingOptions
 
ImagePreprocessingOptions(ImagePreprocessingOptions) - Constructor for class com.itextpdf.pdfocr.tesseract4.ImagePreprocessingOptions
 
INCORRECT_INPUT_IMAGE_FORMAT - Static variable in exception com.itextpdf.pdfocr.tesseract4.Tesseract4OcrException
 
INCORRECT_LANGUAGE - Static variable in exception com.itextpdf.pdfocr.tesseract4.Tesseract4OcrException
 
initializeTesseract(OutputFormat) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4LibOcrEngine
Initializes instance of tesseract if it haven't been already initialized or it have been disposed and sets all the required properties.
isPreprocessingImages() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Checks whether image preprocessing is needed.
isSmoothTiling() - Method in class com.itextpdf.pdfocr.tesseract4.ImagePreprocessingOptions
isUseTxtToImproveHocrParsing() - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
isWindows() - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Checks current os type.

L

LANGUAGE_IS_NOT_IN_THE_LIST - Static variable in exception com.itextpdf.pdfocr.tesseract4.Tesseract4OcrException
 
LeptonicaImageRotationHandler - Class in com.itextpdf.pdfocr.tesseract4
Leptonica based implementation of IImageRotationHandler.
LeptonicaImageRotationHandler() - Constructor for class com.itextpdf.pdfocr.tesseract4.LeptonicaImageRotationHandler
 

M

MAJOR_VERSION - Static variable in class com.itextpdf.pdfocr.tesseract4.PdfOcrTesseract4ProductInfo
The major version number.
MINOR_VERSION - Static variable in class com.itextpdf.pdfocr.tesseract4.PdfOcrTesseract4ProductInfo
The minor version number.

O

OutputFormat - Enum in com.itextpdf.pdfocr.tesseract4
Enumeration of the available output formats.

P

PAGE_NUMBER_IS_INCORRECT - Static variable in class com.itextpdf.pdfocr.tesseract4.Tesseract4LogMessageConstant
 
parseHocrFile(List<File>, TextPositioning) - Static method in class com.itextpdf.pdfocr.tesseract4.TesseractHelper
PATH_TO_TESS_DATA_DIRECTORY_CONTAINS_NON_ASCII_CHARACTERS - Static variable in exception com.itextpdf.pdfocr.tesseract4.Tesseract4OcrException
 
PATH_TO_TESS_DATA_DIRECTORY_IS_INVALID - Static variable in exception com.itextpdf.pdfocr.tesseract4.Tesseract4OcrException
 
PATH_TO_TESS_DATA_IS_NOT_SET - Static variable in exception com.itextpdf.pdfocr.tesseract4.Tesseract4OcrException
 
PdfOcrTesseract4Event - Class in com.itextpdf.pdfocr.tesseract4.events
Class for ocr events
PdfOcrTesseract4ProductInfo - Class in com.itextpdf.pdfocr.tesseract4
Product info about this iText add-on.
PdfOcrTesseract4ProductInfo() - Constructor for class com.itextpdf.pdfocr.tesseract4.PdfOcrTesseract4ProductInfo
 
PRODUCT_NAME - Static variable in class com.itextpdf.pdfocr.tesseract4.PdfOcrTesseract4ProductInfo
The product name.

S

setImagePreprocessingOptions(ImagePreprocessingOptions) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
setMinimalConfidenceLevel(int) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Sets minimal confidence level for HOCR line to be considered as properly recognized.
setPageSegMode(Integer) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Sets Page Segmentation Mode.
setPathToExecutable(String) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4ExecutableOcrEngine
Sets path to tesseract executable.
setPathToTessData(File) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Sets path to directory with tess data.
setPreprocessingImages(boolean) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Sets true if image preprocessing is needed.
setSmoothTiling(boolean) - Method in class com.itextpdf.pdfocr.tesseract4.ImagePreprocessingOptions
setTesseract4OcrEngineProperties(Tesseract4OcrEngineProperties) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Sets properties for AbstractTesseract4OcrEngine.
setTextPositioning(TextPositioning) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Defines the way text is retrieved from tesseract output using TextPositioning.
setThreadLocalMetaInfo(IMetaInfo) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
setTileHeight(int) - Method in class com.itextpdf.pdfocr.tesseract4.ImagePreprocessingOptions
setTileWidth(int) - Method in class com.itextpdf.pdfocr.tesseract4.ImagePreprocessingOptions
setUseTxtToImproveHocrParsing(boolean) - Method in class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
START_OCR_FOR_IMAGES - Static variable in class com.itextpdf.pdfocr.tesseract4.Tesseract4LogMessageConstant
 

T

TESSERACT4_IMAGE_OCR - Static variable in class com.itextpdf.pdfocr.tesseract4.events.PdfOcrTesseract4Event
 
TESSERACT4_IMAGE_TO_PDF - Static variable in class com.itextpdf.pdfocr.tesseract4.events.PdfOcrTesseract4Event
 
TESSERACT4_IMAGE_TO_PDFA - Static variable in class com.itextpdf.pdfocr.tesseract4.events.PdfOcrTesseract4Event
 
Tesseract4ExecutableOcrEngine - Class in com.itextpdf.pdfocr.tesseract4
The implementation of AbstractTesseract4OcrEngine for tesseract OCR.
Tesseract4ExecutableOcrEngine(Tesseract4OcrEngineProperties) - Constructor for class com.itextpdf.pdfocr.tesseract4.Tesseract4ExecutableOcrEngine
Creates a new Tesseract4ExecutableOcrEngine instance.
Tesseract4ExecutableOcrEngine(String, Tesseract4OcrEngineProperties) - Constructor for class com.itextpdf.pdfocr.tesseract4.Tesseract4ExecutableOcrEngine
Creates a new Tesseract4ExecutableOcrEngine instance.
Tesseract4LibOcrEngine - Class in com.itextpdf.pdfocr.tesseract4
The implementation of AbstractTesseract4OcrEngine for tesseract OCR.
Tesseract4LibOcrEngine(Tesseract4OcrEngineProperties) - Constructor for class com.itextpdf.pdfocr.tesseract4.Tesseract4LibOcrEngine
Creates a new Tesseract4LibOcrEngine instance.
Tesseract4LogMessageConstant - Class in com.itextpdf.pdfocr.tesseract4
 
Tesseract4OcrEngineProperties - Class in com.itextpdf.pdfocr.tesseract4
Properties that will be used by the IOcrEngine.
Tesseract4OcrEngineProperties() - Constructor for class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Creates a new Tesseract4OcrEngineProperties instance.
Tesseract4OcrEngineProperties(Tesseract4OcrEngineProperties) - Constructor for class com.itextpdf.pdfocr.tesseract4.Tesseract4OcrEngineProperties
Creates a new Tesseract4OcrEngineProperties instance based on another Tesseract4OcrEngineProperties instance (copy constructor).
Tesseract4OcrException - Exception in com.itextpdf.pdfocr.tesseract4
 
Tesseract4OcrException(String, Throwable) - Constructor for exception com.itextpdf.pdfocr.tesseract4.Tesseract4OcrException
Creates a new TesseractException.
Tesseract4OcrException(String) - Constructor for exception com.itextpdf.pdfocr.tesseract4.Tesseract4OcrException
Creates a new TesseractException.
TESSERACT_FAILED - Static variable in class com.itextpdf.pdfocr.tesseract4.Tesseract4LogMessageConstant
 
TESSERACT_FAILED - Static variable in exception com.itextpdf.pdfocr.tesseract4.Tesseract4OcrException
 
TESSERACT_LIB_NOT_INSTALLED - Static variable in exception com.itextpdf.pdfocr.tesseract4.Tesseract4OcrException
 
TESSERACT_LIB_NOT_INSTALLED_WIN - Static variable in exception com.itextpdf.pdfocr.tesseract4.Tesseract4OcrException
 
TESSERACT_NOT_FOUND - Static variable in exception com.itextpdf.pdfocr.tesseract4.Tesseract4OcrException
 
TesseractHelper - Class in com.itextpdf.pdfocr.tesseract4
Helper class.
TextPositioning - Enum in com.itextpdf.pdfocr.tesseract4
Enumeration of the possible types of text positioning.

U

UNSUPPORTED_EXIF_ORIENTATION_VALUE - Static variable in class com.itextpdf.pdfocr.tesseract4.Tesseract4LogMessageConstant
 

V

validateLanguages(List<String>) - Method in class com.itextpdf.pdfocr.tesseract4.AbstractTesseract4OcrEngine
Validates list of provided languages and checks if they all exist in given tess data directory.
valueOf(String) - Static method in enum com.itextpdf.pdfocr.tesseract4.OutputFormat
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum com.itextpdf.pdfocr.tesseract4.TextPositioning
Returns the enum constant of this type with the specified name.
values() - Static method in enum com.itextpdf.pdfocr.tesseract4.OutputFormat
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum com.itextpdf.pdfocr.tesseract4.TextPositioning
Returns an array containing the constants of this enum type, in the order they are declared.
A C D G I L M O P S T U V 
Skip navigation links

Copyright © 1998–2021 iText Group NV. All rights reserved.