tslmendian - Man Page
change the byte-order of sunpinyin's threaded back-off language model
Synopsis
tslmendian [option]
Description
slmendian converts the binary language model files used by SunPinyin from big-endian to small-endian or vice versa.
Options
- -e endian
Specify the output-lm-file's endian-ness. It can be le or ge. If not given, tslmendian uses the host's endian-ness.
- -v
Prints out the endian-ness of input-lm-file.
- -i input-lm-file
Identify the input file of convert. Generally, this file is generated by slmthread.
- -o out-lm-file
Identify the output file of convert.
Notes
The converted output file is equivalent to the input. But if you compare the output of tslminfo of both files, the ARPA file generated by tslminfo may different. This is due to the different precision in different machines.
And due to uninitialized padding data in data structure, the checksum of the original data file may different from the converted-back file, even though they are identical from SunPinyin's point of view.
Author
Originally written by Kov. Chai <tchaikov.gmail.com>. Currently maintained by Kov.Chai <tchaikov@gmail.com>.
See Also
slmthread(1). tslminfo.