encguess - Man Page
guess character encodings of files
Version
$Id: encguess,v 0.4 2023/11/10 01:10:50 dankogai Exp $
Synopsis
encguess [switches] filename...
Switches
- -h
- show this message and exit. 
- -s
- specify a list of "suspect encoding types" to test, separated by either - :or- ,
- -S
- output a list of all acceptable encoding types that can be used with the -s param 
- -u
- suppress display of unidentified types 
Examples
- Guess encoding of a file named - test.txt, using only the default suspect types.- encguess test.txt 
- Guess the encoding type of a file named - test.txt, using the suspect types- euc-jp,shiftjis,7bit-jis.- encguess -s euc-jp,shiftjis,7bit-jis test.txt encguess -s euc-jp:shiftjis:7bit-jis test.txt 
- Guess the encoding type of several files, do not display results for unidentified files. - encguess -us euc-jp,shiftjis,7bit-jis test*.txt 
Description
The encoding identification is done by checking one encoding type at a time until all but the right type are eliminated. The set of encoding types to try is defined by the -s parameter and defaults to ascii, utf8 and UTF-16/32 with BOM. This can be overridden by passing one or more encoding types via the -s parameter. If you need to pass in multiple suspect encoding types, use a quoted string with the a space separating each value.
See Also
Encode::Guess, Encode::Detect
License and Copyright
Copyright 2015 Michael LaGrasta and Dan Kogai.
This program is free software; you can redistribute it and/or modify it under the terms of the Artistic License (2.0). You may obtain a copy of the full license at: