一个偶然的机会找到一个命令行的编码转换利器 uniconv 2.1d3。" Q3 s0 \$ ^' K! H; B
可以进行各种编码的转换,先放在这里,也许以后有用。
& g5 l \9 Z+ }; [0 T# i0 L- C! M
5 {* n, A, c$ @4 D& W8 c. S" M+ J, x8 e, n$ W
用法说明; C$ A4 Z- a. K* b
usage: uniconv [-debug] [-directASCII] [-subst <substitute-string>]
, w) v1 P6 p0 I% t: _ <input-encoding> <input-file> <output-encoding> <output-file>( ?/ Z4 V( y5 u+ T6 `% W
<property | transform>*. Z0 `% N. z$ N$ _
Version 2.1d3, 11/18/98
# {2 A4 p* p/ F% p8 d" sCopyright (c) Basis Technology Corp. 1995-1998. All rights reserved./ F8 L. b3 ]; g4 }, {
Type "uniconv -help" for more information.- \) Q7 G, f8 Z8 E
Type "uniconv -describe <encoding>" for more information about an encoding.9 [5 J+ C1 |# `7 p$ h9 S) Z0 a- u
Encodings: Arabic, ASCII, big-endian, Big5, BMP, ChineseAutoDetect, % g( [ Q$ f* ^
CNS-11643, CNS-11643-1986, cp1251, cp1252, cp437, cp850, cp932, 1 m1 L4 W! O0 m1 ^
EBCDIC, EBCDIK, EUC-J, EUC-KR, GB2312, Greek, Hebrew, HZ, 4 T4 C+ X' F3 }, J. {0 V8 l/ I
ISO-2022-JP, ISO-2022-KR, ISOLatinCyrillic, JapaneseAutoDetect,
8 ^7 ~/ i+ n: D Java, JIS_X0201, JIS_X_0208, KoreanAutoDetect, Latin1, Latin2,
# ^% u' L5 i' v& |4 A0 y Latin3, Latin4, Latin5, Latin6, little-endian, Shift-JIS, Thai,
. F$ Q% J8 P; \7 i# O UCS2, Unicode11:big-endian, Unicode11:BOM:big-endian, - ]9 b( m" `0 s
Unicode11:BOM:Java, Unicode11:BOM:little-endian,
& O A9 k# J# y X6 i. F Unicode11:BOM:UCS2, Unicode11:BOM:UTF7, Unicode11:BOM:UTF8,
, P- a2 U, c2 v _ Unicode11:Java, Unicode11:little-endian, Unicode11:UCS2, ( t4 ~% @0 n0 H1 K1 y
Unicode11:UTF7, Unicode11:UTF8, Unicode20:BOM:Java, ( Q2 k% F" W: [5 _/ k
Unicode20:BOM:UTF7, Unicode20:BOM:UTF8, Unicode20:little-endian,
7 \2 O* v5 ^& b: j8 y* u Unicode20:UCS2, UTF7, UTF8
+ n* R' p9 B+ h2 I" yProperties: UppercaseLetter, LowercaseLetter, TitlecaseLetter, ModifierLetter, ( J* `$ h8 N, P0 W5 v* s* Z
OtherLetter, AnyLetter, NonSpacingMark, CombiningMark, ( g" B" f- Z$ g! J
DecimalNumber, OtherNumber, DashPunctuation, OpenPunctuation, ( U* k2 D N1 u* l1 A/ [8 Y6 I% f
ClosePunctuation, OtherPunctuation, MathSymbol, CurrencySymbol, 2 I4 r% U7 A# G5 j7 a- T: C, G9 G
OtherSymbol, SpaceSeparator, LineSeparator, ParagraphSeparator, . K9 Y& x7 C U) s! Z3 J
ControlCharacter, OtherCharacter, UndefinedScript, GeneralScript, ! E# L; C2 w4 P' }0 I( v* E
Latin, Greek, Cyrillic, Armenian, Hebrew, Arabic, Devanagari, 2 c! T9 ?+ t: g1 W: `! Q# Y
Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, Kannada,
" ^! T, Z7 k! A- b" e# i( e) k1 z Malayalam, Thai, Lao, Tibetan, Georgian, HangulJamo, Hiragana,
: C% v! I' l: S7 [9 W# j Katakana, Kana, Bopomofo, CJKUnifiedIdeographs, Hangul, ( W i5 I4 s3 k
UndefinedWidth, Fullwidth, Halfwidth
2 i) h0 n: g5 D- RTransforms: ToLowercase, ToUppercase, ToFullwidth, ToHalfwidth, ToHiragana, 4 _! P% H/ a( q1 ]) t
ToKatakana, Decompose, Compose, ToCombiningMark, ToSpacingMark,
+ b1 ]' I6 ], I. P* b$ ? Select, Filter, ToCRLF, ToCR, ToLF, ToParagraphSeparator, ) \$ f2 D, s4 O' Z1 y
ToLineSeparator, ToCanonical, ToTraditionalChinese,
9 E0 A5 R n: m+ J( w" @ ToSimplifiedChinese, RomajiToHiragana, RomajiToKatakana, $ ?* Y( K% N+ I/ @0 O% @
KanaToRomaji, KanaToKunreiRomaji, KanaToHebonRomaji, * L2 n) x" e# p1 R9 V1 y5 V' B
ToLatinNumber, FromSGMLEntity |