IronOcrEngineParameters

AcceptedOcrCharacters

Specifies the characters the OCR engine will look for, which can hugely improve speed and accuracy. The default behavior is to identify all UTF-8 characters including accented characters borrowed from international languages. E.g. "0123456789" for only numbers, or "ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz.:;?!@$%&+-,-()\"'0123456789" for English characters without accents. Default: null

Syntax: public string AcceptedOcrCharacters { get; set; }

CleanBackgroundNoise

AdvancedOcr will clean background 'noise' , dust, speckles and signs of paper away from printed text. In the context of detecting text on the UI, CleanBackgroundNoise can help on screen segments with different backgrounds, e.g. tables with alternating row colors. Note: This will produce unusable results for white text on dark backgrounds. If you want to use it in that case, enable UseInvertModification in PreprocessingParameters. Default: false

Syntax: public bool CleanBackgroundNoise { get; set; }

ColorDepth

Bits of color per channel per pixel. Lower ColorDepth can reduce scanning time for documents that also contain many colors or images. A value of ZERO will result in image color depths not being modified from the original source image. Default: 0

Syntax: public int ColorDepth { get; set; }

ColorSpace

OCR documents can be scanned in GreyScale to reduce processing time. This may not work well for low-contrast text on a non-white backgrounds. Default: GrayScale

Syntax: public OcrColorSpace ColorSpace { get; set; }

DetectWhiteTextOnDarkBackgrounds

Automatically detects and adjusts white text on darker backgrounds. Default: true

Syntax: public bool DetectWhiteTextOnDarkBackgrounds { get; set; }

EnhanceContrast

Enhances image contrast adaptively, which generally increases text readability both by humans and machines. Default: false

Syntax: public bool EnhanceContrast { get; set; }

EnhanceResolution

Re-renders low quality images at a higher DPI. IronOcr works faster and more accurately with high resolution images. Useful for images scanned at under 250 DPI (standard dpi for vnc is 95, so this is a very important parameter for detecting text on the UI!) Default: true

Syntax: public bool EnhanceResolution { get; set; }

InputImageType

Hints the OCR Engine about the type of image being scanned, which can reduce OCR time and increase accuracy. Can be AutoDetect, Document, Snippet Default: Snippet

Syntax: public InputTypes InputImageType { get; set; }

ReadBarCodes

Enables Barcodes to be extracted from OCR documents, alongside the text content. Default: false

Syntax: public bool ReadBarCodes { get; set; }

RotateAndStraighten

AdvancedOcr will deskew and rotate images to be straight, upright, unbent and without perspective. RotateAndStraighten is a powerful tool, making imperfect scans OCR accurately. CPU/GPU intensive. Default: false

Syntax: public bool RotateAndStraighten { get; set; }

Strategy

Algorithm to be used for OCR text analysis. Can be Fast or Advanced. Advanced will take considerably longer and produce much more false results, but might help to recognize text that is not detected as text with Fast strategy. Default: Fast

Syntax: public OcrStrategy Strategy { get; set; }

UseAutoMode

AutoOcr uses meta-data and computer-vision to predict intelligent default settings for each setting. Ignores all other IronOcrParameters except ReadBarCodes. Generally takes longer than the default settings for AdvancedOcr and doesn't produce better results. Default: false

Syntax: public bool UseAutoMode { get; set; }

Last updated