tslmendian - Man Page

change the byte-order of sunpinyin's threaded back-off language model

Synopsis

tslmendian [option]

Description

slmendian converts the binary language model files used by SunPinyin from big-endian to small-endian or vice versa.

Options

-e endian

Specify the output-lm-file's endian-ness. It can be le or ge. If not given, tslmendian uses the host's endian-ness.

-v

Prints out the endian-ness of input-lm-file.

-i input-lm-file

Identify the input file of convert. Generally, this file is generated by slmthread.

-o out-lm-file

Identify the output file of convert.

Notes

The converted output file is equivalent to the input. But if you compare the output of tslminfo of both files, the ARPA file generated by tslminfo may different. This is due to the different precision in different machines.

And due to uninitialized padding data in data structure, the checksum of the original data file may different from the converted-back file, even though they are identical from SunPinyin's point of view.

Author

Originally written by Kov. Chai <tchaikov.gmail.com>. Currently maintained by Kov.Chai <tchaikov@gmail.com>.

See Also

slmthread(1). tslminfo.

Info

2024-07-20 perl v5.40.0 User Contributed Perl Documentation