一个偶然的机会找到一个命令行的编码转换利器 uniconv 2.1d3。
0 b4 V% t8 _2 j: m可以进行各种编码的转换,先放在这里,也许以后有用。( P7 K, a4 @% Q5 G+ a% X* G
7 f. X, z8 i8 p
1 F7 ~0 i4 Y4 G. p5 h用法说明
) Y# l) m) n L4 t) V' ?8 eusage: uniconv [-debug] [-directASCII] [-subst <substitute-string>]# J& N7 Y) H7 ]- k5 s( m9 a
<input-encoding> <input-file> <output-encoding> <output-file>$ t& k; K5 q& I. y9 \4 k9 z& E' `& l
<property | transform>*
5 H( J4 L1 D& cVersion 2.1d3, 11/18/98
7 H/ K0 [5 Y5 |Copyright (c) Basis Technology Corp. 1995-1998. All rights reserved.
) ] E0 g* T: g2 @Type "uniconv -help" for more information.
! V6 ~0 a7 t1 \) JType "uniconv -describe <encoding>" for more information about an encoding.( ^1 k8 ^! i% s# D9 Y5 B3 k4 S
Encodings: Arabic, ASCII, big-endian, Big5, BMP, ChineseAutoDetect, . `) m& l% \$ d& z+ i
CNS-11643, CNS-11643-1986, cp1251, cp1252, cp437, cp850, cp932,
9 R' r' m3 K7 E+ s EBCDIC, EBCDIK, EUC-J, EUC-KR, GB2312, Greek, Hebrew, HZ,
, U% R/ I7 U, ~9 e0 P, C ISO-2022-JP, ISO-2022-KR, ISOLatinCyrillic, JapaneseAutoDetect, $ C) B* Z; w0 q! H+ I7 P
Java, JIS_X0201, JIS_X_0208, KoreanAutoDetect, Latin1, Latin2,
/ B T T* d3 o x Latin3, Latin4, Latin5, Latin6, little-endian, Shift-JIS, Thai, ) y1 V6 |0 K. _$ z7 K7 @
UCS2, Unicode11:big-endian, Unicode11:BOM:big-endian,
% T8 s3 i5 y8 Z Unicode11:BOM:Java, Unicode11:BOM:little-endian, + {1 P- E& s: R2 s
Unicode11:BOM:UCS2, Unicode11:BOM:UTF7, Unicode11:BOM:UTF8,
# H" X9 x( C p. f: |4 ] Unicode11:Java, Unicode11:little-endian, Unicode11:UCS2, 1 P- P7 i0 R, `. y! V
Unicode11:UTF7, Unicode11:UTF8, Unicode20:BOM:Java,
9 R2 l0 f/ J+ R% |0 U9 b Unicode20:BOM:UTF7, Unicode20:BOM:UTF8, Unicode20:little-endian,
0 a- v& ^3 \6 z1 Q" A/ B/ m Unicode20:UCS2, UTF7, UTF8
5 a9 ]8 n2 u$ ^6 HProperties: UppercaseLetter, LowercaseLetter, TitlecaseLetter, ModifierLetter, / M3 A2 j6 |' B1 \! R
OtherLetter, AnyLetter, NonSpacingMark, CombiningMark, . l' T) |( V* L. r7 e2 G" ]
DecimalNumber, OtherNumber, DashPunctuation, OpenPunctuation, . C9 {- c7 q0 w$ O/ X u
ClosePunctuation, OtherPunctuation, MathSymbol, CurrencySymbol,
9 d' R1 j$ V% q OtherSymbol, SpaceSeparator, LineSeparator, ParagraphSeparator,
( z1 z* L$ W/ N J0 ?) \; J* _6 H5 W ControlCharacter, OtherCharacter, UndefinedScript, GeneralScript, 2 u/ y/ U5 h! X
Latin, Greek, Cyrillic, Armenian, Hebrew, Arabic, Devanagari, * S% \; s7 t1 ]( W+ U
Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, Kannada, - S" @. Z, X5 u) {* c7 J
Malayalam, Thai, Lao, Tibetan, Georgian, HangulJamo, Hiragana,
1 {9 D! n @7 G3 b Katakana, Kana, Bopomofo, CJKUnifiedIdeographs, Hangul, : V6 o/ m u3 y# N5 @
UndefinedWidth, Fullwidth, Halfwidth$ S! J0 V% h- P! w
Transforms: ToLowercase, ToUppercase, ToFullwidth, ToHalfwidth, ToHiragana,
1 b) E3 s! F+ |3 B. g% c ToKatakana, Decompose, Compose, ToCombiningMark, ToSpacingMark, " |) T0 L4 s& A
Select, Filter, ToCRLF, ToCR, ToLF, ToParagraphSeparator, ' w+ @0 @$ o7 t" [
ToLineSeparator, ToCanonical, ToTraditionalChinese, ! Y! c- v$ A7 |0 K# E
ToSimplifiedChinese, RomajiToHiragana, RomajiToKatakana, 0 _% M7 y' d8 P! P$ ]% q
KanaToRomaji, KanaToKunreiRomaji, KanaToHebonRomaji,
+ o- e b# M3 X2 n# V* x ToLatinNumber, FromSGMLEntity |