一个偶然的机会找到一个命令行的编码转换利器 uniconv 2.1d3。- z: |2 E) H1 t( V, ]5 P
可以进行各种编码的转换,先放在这里,也许以后有用。" X$ }7 j$ _5 }4 K9 Y6 e
! w# y% y4 o3 F6 C* V" g1 Q; f! C1 V7 R
5 w7 M) Z/ v' m3 O, D0 ^. C3 i用法说明
. [! ?$ k7 V3 uusage: uniconv [-debug] [-directASCII] [-subst <substitute-string>]* k% I) V3 N' k
<input-encoding> <input-file> <output-encoding> <output-file>& u5 l! }8 {1 q% l8 Q+ r4 U
<property | transform>*: D/ K) g! S" Y }0 \
Version 2.1d3, 11/18/98* ?& k$ J, _" n0 R; I' S* N: w
Copyright (c) Basis Technology Corp. 1995-1998. All rights reserved.
! ?. B3 f/ s; T. `6 SType "uniconv -help" for more information.
" q$ p6 l$ ], l2 _! eType "uniconv -describe <encoding>" for more information about an encoding.
! i9 B' Y" n. C2 MEncodings: Arabic, ASCII, big-endian, Big5, BMP, ChineseAutoDetect,
3 ]" i( s7 K1 E. k. F CNS-11643, CNS-11643-1986, cp1251, cp1252, cp437, cp850, cp932, 0 B1 ]! r1 e I# @% q( `
EBCDIC, EBCDIK, EUC-J, EUC-KR, GB2312, Greek, Hebrew, HZ,
8 w* \# q! Y+ O& Q ISO-2022-JP, ISO-2022-KR, ISOLatinCyrillic, JapaneseAutoDetect,
) o$ l% d2 S1 d2 Y! m. \5 p Java, JIS_X0201, JIS_X_0208, KoreanAutoDetect, Latin1, Latin2,
- F$ _8 `6 j' e, ~ Latin3, Latin4, Latin5, Latin6, little-endian, Shift-JIS, Thai,
2 q7 E3 c7 k/ v: {0 z UCS2, Unicode11:big-endian, Unicode11:BOM:big-endian,
! U6 [ ^! h* E3 r3 o0 Y7 b1 K- w Unicode11:BOM:Java, Unicode11:BOM:little-endian,
8 Y: b y- B5 {2 R. e/ O3 d' F Unicode11:BOM:UCS2, Unicode11:BOM:UTF7, Unicode11:BOM:UTF8,
+ L2 h& N3 q* Y+ a Unicode11:Java, Unicode11:little-endian, Unicode11:UCS2,
5 `5 Z3 O+ f/ I) x9 u S1 Q1 ? W Unicode11:UTF7, Unicode11:UTF8, Unicode20:BOM:Java,
& C2 p5 a: w1 _ Unicode20:BOM:UTF7, Unicode20:BOM:UTF8, Unicode20:little-endian,
3 ~) o5 Y" L( ~+ J6 z$ z4 R Unicode20:UCS2, UTF7, UTF8$ e8 w8 C: Z$ ?) _, G
Properties: UppercaseLetter, LowercaseLetter, TitlecaseLetter, ModifierLetter,
# O/ S$ c! J5 A( {& `! g r! E1 Z OtherLetter, AnyLetter, NonSpacingMark, CombiningMark, , b8 A1 C& @7 C, m r/ Z
DecimalNumber, OtherNumber, DashPunctuation, OpenPunctuation,
7 e! Y; g6 ]+ v- ^# l& I. e" ~ ClosePunctuation, OtherPunctuation, MathSymbol, CurrencySymbol, 8 F+ A1 A0 c/ k! u( }/ n- x
OtherSymbol, SpaceSeparator, LineSeparator, ParagraphSeparator,
, P9 \. M$ Q R7 Q5 J ControlCharacter, OtherCharacter, UndefinedScript, GeneralScript,
2 j- z! O' I3 U5 }# \. L: z Latin, Greek, Cyrillic, Armenian, Hebrew, Arabic, Devanagari, % X4 U, N; O( J0 A! B$ k$ ?6 ]
Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, Kannada,
: o7 H. R0 E! r Malayalam, Thai, Lao, Tibetan, Georgian, HangulJamo, Hiragana, $ g# l b, x2 ]& r: o1 b
Katakana, Kana, Bopomofo, CJKUnifiedIdeographs, Hangul, : r" D+ I/ q6 b3 U+ H# d
UndefinedWidth, Fullwidth, Halfwidth
, @8 z0 t" A/ C) L+ Z, ITransforms: ToLowercase, ToUppercase, ToFullwidth, ToHalfwidth, ToHiragana, ! J9 q' B2 Y( m9 o
ToKatakana, Decompose, Compose, ToCombiningMark, ToSpacingMark, 0 [! w6 r' j& m
Select, Filter, ToCRLF, ToCR, ToLF, ToParagraphSeparator, & J9 F6 r3 ?" P2 @/ Y6 ^* m3 a
ToLineSeparator, ToCanonical, ToTraditionalChinese,
/ ~- j7 c3 G9 I% |# P7 [ ToSimplifiedChinese, RomajiToHiragana, RomajiToKatakana, - F" }( Z; z4 ?/ |
KanaToRomaji, KanaToKunreiRomaji, KanaToHebonRomaji,
: [/ a5 [' ^) f) R ToLatinNumber, FromSGMLEntity |