一个偶然的机会找到一个命令行的编码转换利器 uniconv 2.1d3。
1 {( \* z# H: J! V& S" t可以进行各种编码的转换,先放在这里,也许以后有用。& ]. x$ L9 _& k! j( w% L
- I1 y1 T' x- t0 ^7 x, h( d3 N2 ~5 W0 S) N7 u+ |( `: H
用法说明$ X$ r, v4 ]; W& z% ^! t
usage: uniconv [-debug] [-directASCII] [-subst <substitute-string>]
' `* Z" j3 ~# z* m <input-encoding> <input-file> <output-encoding> <output-file>
( M# Z: U) d& }2 C) [1 r3 Q# j <property | transform>* u- q3 K$ W7 n& h, M3 |" m
Version 2.1d3, 11/18/98: O8 U0 g. m: r' Z2 M. W
Copyright (c) Basis Technology Corp. 1995-1998. All rights reserved.
2 a7 P/ x0 u9 N v1 @' s2 ?4 Q$ mType "uniconv -help" for more information.- s! T' f1 g+ \9 ~
Type "uniconv -describe <encoding>" for more information about an encoding.- P4 j2 f* t3 y- I) o2 W2 F2 h
Encodings: Arabic, ASCII, big-endian, Big5, BMP, ChineseAutoDetect, ! M0 ]! ?4 t5 L" O; b
CNS-11643, CNS-11643-1986, cp1251, cp1252, cp437, cp850, cp932,
- t( r- ]5 N! H9 i4 Q EBCDIC, EBCDIK, EUC-J, EUC-KR, GB2312, Greek, Hebrew, HZ,
4 f; X, X' Q+ S) M7 Z; z+ k% y ISO-2022-JP, ISO-2022-KR, ISOLatinCyrillic, JapaneseAutoDetect, - ~4 a% f' J4 E# L% E
Java, JIS_X0201, JIS_X_0208, KoreanAutoDetect, Latin1, Latin2, & O6 u% [: ~3 d" s1 j
Latin3, Latin4, Latin5, Latin6, little-endian, Shift-JIS, Thai,
* |, @; C8 R4 ~! P& w, N! a UCS2, Unicode11:big-endian, Unicode11:BOM:big-endian,
3 N8 f0 v2 ], \0 G- h+ z Unicode11:BOM:Java, Unicode11:BOM:little-endian,
+ m9 x3 \! g( s0 q3 j* b Unicode11:BOM:UCS2, Unicode11:BOM:UTF7, Unicode11:BOM:UTF8,
3 D. |& g2 `3 Z9 o( } Unicode11:Java, Unicode11:little-endian, Unicode11:UCS2,
/ n( n& h* r& r: @( W Unicode11:UTF7, Unicode11:UTF8, Unicode20:BOM:Java, 1 S! E5 L' @% K8 R0 [: J
Unicode20:BOM:UTF7, Unicode20:BOM:UTF8, Unicode20:little-endian,
5 E4 u5 L% x! P0 n- W Unicode20:UCS2, UTF7, UTF8+ j) l0 u+ U& Q/ u- Z8 Z1 q) i; W% ~
Properties: UppercaseLetter, LowercaseLetter, TitlecaseLetter, ModifierLetter, + y# F8 ? s5 R7 c" n
OtherLetter, AnyLetter, NonSpacingMark, CombiningMark, . r1 e) l; c% ~( v8 h- I7 X: _( I- Y; d5 ?
DecimalNumber, OtherNumber, DashPunctuation, OpenPunctuation, 2 Y4 R7 h- S- O0 U
ClosePunctuation, OtherPunctuation, MathSymbol, CurrencySymbol, * P) g# m6 {; }6 w# S3 ^- j
OtherSymbol, SpaceSeparator, LineSeparator, ParagraphSeparator, " N& \; A/ B7 @6 @) M! A
ControlCharacter, OtherCharacter, UndefinedScript, GeneralScript,
; J0 h$ J! E2 ? Latin, Greek, Cyrillic, Armenian, Hebrew, Arabic, Devanagari,
+ E4 ^. r$ ~( l& O. Z, Q2 ? Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, Kannada,
( } }' E( J& n6 `; A5 S Malayalam, Thai, Lao, Tibetan, Georgian, HangulJamo, Hiragana,
" @2 V" s& F- @ Katakana, Kana, Bopomofo, CJKUnifiedIdeographs, Hangul,
) o: R& O7 }4 L+ w1 f UndefinedWidth, Fullwidth, Halfwidth
$ i' o7 d4 i0 }1 uTransforms: ToLowercase, ToUppercase, ToFullwidth, ToHalfwidth, ToHiragana,
5 v8 u& c- A# ]1 Q( H: Q/ C ToKatakana, Decompose, Compose, ToCombiningMark, ToSpacingMark,
4 Q# X, T( j2 a& D5 o3 g1 {1 g Select, Filter, ToCRLF, ToCR, ToLF, ToParagraphSeparator,
, V( p% N1 X5 ]0 ~1 c; X& q ToLineSeparator, ToCanonical, ToTraditionalChinese,
c3 `% |. O7 C% S5 U ToSimplifiedChinese, RomajiToHiragana, RomajiToKatakana, , b. u }6 p/ r. L+ `$ v! h" x
KanaToRomaji, KanaToKunreiRomaji, KanaToHebonRomaji,
- k- E5 s3 ^7 Y ToLatinNumber, FromSGMLEntity |