DocumentRecognitionSettings

DocumentRecognitionSettings class

Settings for the pdf recognition.
Contains elements that allow customizing the recognition process.

The DocumentRecognitionSettings type exposes the following members:

Name	Description
DocumentRecognitionSettings(start_page, pages_number)	Initializes a new instance of the DocumentRecognitionSettings class
DocumentRecognitionSettings(start_page, pages_number, language, detect_areas, auto_skew, threshold)	Initializes a new instance of the DocumentRecognitionSettings class

Name	Description
ignored_symbols	Sets blacklist for recognition symbols.
ignored_characters	Sets blacklist for recognition symbols.
allowed_symbols	Set the allowed characters with alphabet property.
lines_filtration	Allows to recognize text in the tables (regions surrounded lines).
preprocessing_filters	Allows to prepare the image for OCR by adjusting pre-processing methods.
auto_contrast	Allows using an additional contrast correction algorithm for the image before recognition.
allowed_characters	Allowed characters set. Determines the type of characters allowed for recognition result.
detect_areas_mode	Allows to select the optimal mode for document type areas: document, photo, plain text, column, image.
auto_denoising	Enables the use of an additional neural network to improve the image - reduce noise. Useful for images with scan artifacts, distortion, spots, flares, gradients, foreign elements.
upscale_small_font	Allows you to use additional algorithms specifically for small font recognition. Useful for images with small size characters.
start_page	Set the first page for recognition.
pages_number	Set the number of pages for recognition multipage pdf file.