set_unicharset_properties - Man Page
set properties about the unichars
Synopsis
set_unicharset_properties --U input_unicharsetfile --script_dir /path/to/langdata --O output_unicharsetfile
Description
set_unicharset_properties(1) reads a unicharset file, puts the result in a UNICHARSET object, fills it with properties about the unichars it contains and writes the result back to another unicharset file.
Options
- --script_dir /path/to/langdata
(Input) Specify the location of directory for universal script unicharsets and font xheights (type:string default:)
- --U unicharsetfile
(Input) Specify the location of the unicharset to load as input.
- --O unicharsetfile
(Output) Specify the location of the unicharset to be written with updated properties.
History
set_unicharset_properties(1) was first made available for tesseract version 3.03.
Resources
Main web site: https://github.com/tesseract-ocr Information on training: https://tesseract-ocr.github.io/tessdoc/Training-Tesseract.html
See Also
Copying
Copyright (C) 2012 Google, Inc. Licensed under the Apache License, Version 2.0
Author
The Tesseract OCR engine was written by Ray Smith and his research groups at Hewlett Packard (1985-1995) and Google (2006-2018).