一个偶然的机会找到一个命令行的编码转换利器 uniconv 2.1d3。
3 s' t- N8 ^! @可以进行各种编码的转换,先放在这里,也许以后有用。
9 W/ a$ P# l& `4 K( ~# V5 k/ m5 k1 i/ c* Z+ N. Q3 m
0 x/ B/ m9 L4 @# K$ ]用法说明
& `, ~! q$ ~6 ~! w' _usage: uniconv [-debug] [-directASCII] [-subst <substitute-string>]
0 E; p4 d- G0 G* h t* m( X <input-encoding> <input-file> <output-encoding> <output-file>
8 i. F ~3 ?6 q: w, ~ <property | transform>*8 M: q7 o* z J. ^8 W+ C7 e
Version 2.1d3, 11/18/981 {# }9 B. ]: c# S1 {
Copyright (c) Basis Technology Corp. 1995-1998. All rights reserved.
8 O2 m0 {& m) z+ NType "uniconv -help" for more information.
3 u& g3 [9 _ r/ a; YType "uniconv -describe <encoding>" for more information about an encoding.
4 l: N) W* p7 Y! _0 q) l6 VEncodings: Arabic, ASCII, big-endian, Big5, BMP, ChineseAutoDetect, : c: W( u1 I9 h% a" o
CNS-11643, CNS-11643-1986, cp1251, cp1252, cp437, cp850, cp932, * p2 c2 H) ?: u- [3 U* @* a
EBCDIC, EBCDIK, EUC-J, EUC-KR, GB2312, Greek, Hebrew, HZ,
/ H l" I& Y4 y* B# P ISO-2022-JP, ISO-2022-KR, ISOLatinCyrillic, JapaneseAutoDetect,
' `% k4 c* k3 q: ^7 Z+ C) G Java, JIS_X0201, JIS_X_0208, KoreanAutoDetect, Latin1, Latin2,
: X. l$ k2 H; e$ h/ Y Latin3, Latin4, Latin5, Latin6, little-endian, Shift-JIS, Thai, ' P0 m% ]! Q& C3 f
UCS2, Unicode11:big-endian, Unicode11:BOM:big-endian,
- M: ~# s/ }3 b Unicode11:BOM:Java, Unicode11:BOM:little-endian, $ E! Q( X# K0 l' y4 H% ^
Unicode11:BOM:UCS2, Unicode11:BOM:UTF7, Unicode11:BOM:UTF8, 6 o1 K% X" x8 J8 Y6 \
Unicode11:Java, Unicode11:little-endian, Unicode11:UCS2,
, D! a/ w, x5 @- S, g Unicode11:UTF7, Unicode11:UTF8, Unicode20:BOM:Java,
: W. [5 D+ l( T& V Unicode20:BOM:UTF7, Unicode20:BOM:UTF8, Unicode20:little-endian,
, O& H$ m7 z; s! }+ |; S Unicode20:UCS2, UTF7, UTF81 W$ l8 V! @) @) Q5 H
Properties: UppercaseLetter, LowercaseLetter, TitlecaseLetter, ModifierLetter,
0 F+ S6 o8 C1 C) O, E) _4 p* ~ OtherLetter, AnyLetter, NonSpacingMark, CombiningMark, ( o* C/ R) O! E: S+ O1 m4 X. k
DecimalNumber, OtherNumber, DashPunctuation, OpenPunctuation,
5 U6 W7 F' @" d# F0 A! d9 P ClosePunctuation, OtherPunctuation, MathSymbol, CurrencySymbol, + ~4 t7 s9 ^) c
OtherSymbol, SpaceSeparator, LineSeparator, ParagraphSeparator,
$ A" A4 V$ C h7 Z; ?6 H ControlCharacter, OtherCharacter, UndefinedScript, GeneralScript, ; z- B) S% ^3 u
Latin, Greek, Cyrillic, Armenian, Hebrew, Arabic, Devanagari, , F: c3 i8 R$ o- `# x
Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, Kannada, 5 M2 s& a# K% }" }# }% Y- o
Malayalam, Thai, Lao, Tibetan, Georgian, HangulJamo, Hiragana,
. ? Z( r! Q# W Katakana, Kana, Bopomofo, CJKUnifiedIdeographs, Hangul,
6 z# G; U0 y- V: U8 I+ v) I/ Q V UndefinedWidth, Fullwidth, Halfwidth
0 w' F0 z$ V) a9 ]5 X/ R ]9 Q- OTransforms: ToLowercase, ToUppercase, ToFullwidth, ToHalfwidth, ToHiragana,
8 b( ^& [) q+ n+ O ToKatakana, Decompose, Compose, ToCombiningMark, ToSpacingMark, . Y3 X* ?* A/ \# v
Select, Filter, ToCRLF, ToCR, ToLF, ToParagraphSeparator,
# u& }& a7 ]' H; F ToLineSeparator, ToCanonical, ToTraditionalChinese,
+ z; C1 S* L$ i8 e8 U ToSimplifiedChinese, RomajiToHiragana, RomajiToKatakana, " H5 M$ Z' B" F5 B9 D* e
KanaToRomaji, KanaToKunreiRomaji, KanaToHebonRomaji,
" Y& V- ~2 l1 h! H/ I ToLatinNumber, FromSGMLEntity |