一个偶然的机会找到一个命令行的编码转换利器 uniconv 2.1d3。
/ }) d d) }- R' W7 p可以进行各种编码的转换,先放在这里,也许以后有用。
- B3 h9 e6 [6 {! X0 e8 x
1 J1 J6 k, D' v4 X+ ?+ l$ O- j) M' {# C( n) F( _" s
用法说明. }9 z! ]+ T" v$ m* ?
usage: uniconv [-debug] [-directASCII] [-subst <substitute-string>]
7 _) v6 ?; E+ k H* k. b <input-encoding> <input-file> <output-encoding> <output-file>) F/ ~; h$ ~4 }! s& h
<property | transform>*) Z/ Y: q5 L" ~9 f! f; t: \
Version 2.1d3, 11/18/98
, l" c* l5 K: |, `9 ^: ~* ^* T" rCopyright (c) Basis Technology Corp. 1995-1998. All rights reserved.
$ n8 F& Y* V" `( OType "uniconv -help" for more information.
4 o6 l: L) B3 k& F+ fType "uniconv -describe <encoding>" for more information about an encoding.
4 r" B e& r+ t* k. BEncodings: Arabic, ASCII, big-endian, Big5, BMP, ChineseAutoDetect,
" O* k& T9 B. u& j# v& S CNS-11643, CNS-11643-1986, cp1251, cp1252, cp437, cp850, cp932,
7 k8 N2 Y+ M6 S; r8 z# y% D EBCDIC, EBCDIK, EUC-J, EUC-KR, GB2312, Greek, Hebrew, HZ, : ~" Z( |9 y4 D" g, O
ISO-2022-JP, ISO-2022-KR, ISOLatinCyrillic, JapaneseAutoDetect, . p' y8 L& u7 C3 \7 {, N% _
Java, JIS_X0201, JIS_X_0208, KoreanAutoDetect, Latin1, Latin2,
) a$ j$ e" k. m. d; w Latin3, Latin4, Latin5, Latin6, little-endian, Shift-JIS, Thai, : n: v9 b7 y2 \
UCS2, Unicode11:big-endian, Unicode11:BOM:big-endian,
7 a# i; O& w' u! h. b Unicode11:BOM:Java, Unicode11:BOM:little-endian,
, E! Z" K' V( B0 w8 n e: ] Unicode11:BOM:UCS2, Unicode11:BOM:UTF7, Unicode11:BOM:UTF8,
- |% U0 A8 B1 \" {/ G/ o Unicode11:Java, Unicode11:little-endian, Unicode11:UCS2,
& V( K: ?4 `* h/ t8 {5 n Unicode11:UTF7, Unicode11:UTF8, Unicode20:BOM:Java,
. U6 g5 u" W+ p Unicode20:BOM:UTF7, Unicode20:BOM:UTF8, Unicode20:little-endian, : f% w! K& W3 V- R2 A' f! D
Unicode20:UCS2, UTF7, UTF8
" `* d1 d0 W1 }/ o) H. O; [Properties: UppercaseLetter, LowercaseLetter, TitlecaseLetter, ModifierLetter, " f6 a; @5 k. g9 N1 a! S3 c( V
OtherLetter, AnyLetter, NonSpacingMark, CombiningMark, ) H) H% O5 |) ?0 P& s
DecimalNumber, OtherNumber, DashPunctuation, OpenPunctuation,
8 X1 ]4 J5 V9 L& C+ e ClosePunctuation, OtherPunctuation, MathSymbol, CurrencySymbol,
! B: s6 ~2 \% r. t# q' E# ?& W OtherSymbol, SpaceSeparator, LineSeparator, ParagraphSeparator,
A; [' f: W+ @. j/ L ControlCharacter, OtherCharacter, UndefinedScript, GeneralScript, 6 {/ W) C! y1 D$ P
Latin, Greek, Cyrillic, Armenian, Hebrew, Arabic, Devanagari,
! }; X6 W! I. W6 K& a l1 R! J Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, Kannada,
+ b* F" ~- o5 q- v& \1 o Malayalam, Thai, Lao, Tibetan, Georgian, HangulJamo, Hiragana,
8 D! X9 i% j- C8 V& u+ m Katakana, Kana, Bopomofo, CJKUnifiedIdeographs, Hangul, ; Q$ L/ i! G* V- D( W; M5 H: @
UndefinedWidth, Fullwidth, Halfwidth
3 J7 \. {# z' x+ d+ {Transforms: ToLowercase, ToUppercase, ToFullwidth, ToHalfwidth, ToHiragana,
$ Z- k$ I: m p: N ToKatakana, Decompose, Compose, ToCombiningMark, ToSpacingMark, ' I/ U. r8 X- P; i
Select, Filter, ToCRLF, ToCR, ToLF, ToParagraphSeparator,
% u3 K% }1 M" s9 x3 p+ ~ ToLineSeparator, ToCanonical, ToTraditionalChinese, $ P+ I$ S' E" e9 Q; x
ToSimplifiedChinese, RomajiToHiragana, RomajiToKatakana,
* R) u& H, T9 z# h! G, |& s$ u KanaToRomaji, KanaToKunreiRomaji, KanaToHebonRomaji, 3 r: n6 C; c4 E ]# \
ToLatinNumber, FromSGMLEntity |