一个偶然的机会找到一个命令行的编码转换利器 uniconv 2.1d3。; A2 D& n) i- S8 R) n2 }* l
可以进行各种编码的转换,先放在这里,也许以后有用。
* u8 Z. s! F* { h' U$ m' `
/ V, n3 d& a' m( y4 _1 A9 Z! @/ v& |2 ^' h3 Q% A7 g- j+ N
用法说明' H6 j' X9 i. e& q' }1 G" O
usage: uniconv [-debug] [-directASCII] [-subst <substitute-string>]
) B( I3 d% R M3 o0 } <input-encoding> <input-file> <output-encoding> <output-file>2 b* c1 J* `0 x+ [/ K, w5 ], ?7 z
<property | transform>*
4 J6 t' X: X ]! i1 \, z; _Version 2.1d3, 11/18/98
- _- }7 c9 a+ E% e/ SCopyright (c) Basis Technology Corp. 1995-1998. All rights reserved.1 C% W( o8 }# k# L2 E& j
Type "uniconv -help" for more information.
$ V5 H. u* D0 v/ V. E# V8 F( tType "uniconv -describe <encoding>" for more information about an encoding.$ C) v9 h9 K* |1 ~$ X( {
Encodings: Arabic, ASCII, big-endian, Big5, BMP, ChineseAutoDetect, 1 [- r N9 z# Q3 x6 i
CNS-11643, CNS-11643-1986, cp1251, cp1252, cp437, cp850, cp932,
8 ^9 m2 v" X* }: F0 a2 k EBCDIC, EBCDIK, EUC-J, EUC-KR, GB2312, Greek, Hebrew, HZ,
( G. |( }7 p* y2 J& { ISO-2022-JP, ISO-2022-KR, ISOLatinCyrillic, JapaneseAutoDetect, 5 t, O, D2 }+ k" c$ T
Java, JIS_X0201, JIS_X_0208, KoreanAutoDetect, Latin1, Latin2, / V3 G$ g5 `( |& N* K
Latin3, Latin4, Latin5, Latin6, little-endian, Shift-JIS, Thai,
6 y+ `3 i: }: t5 }' d2 T1 g' S UCS2, Unicode11:big-endian, Unicode11:BOM:big-endian, " A2 Y2 z. o/ C# I R) f/ w9 |
Unicode11:BOM:Java, Unicode11:BOM:little-endian, % g' q& O: t3 X* m+ h Z8 D' R2 k
Unicode11:BOM:UCS2, Unicode11:BOM:UTF7, Unicode11:BOM:UTF8, . Y9 g' n; f" i# f% k2 x
Unicode11:Java, Unicode11:little-endian, Unicode11:UCS2, 0 ]: l$ o# L1 g
Unicode11:UTF7, Unicode11:UTF8, Unicode20:BOM:Java, " l3 L' `) P8 Y$ v' E9 }
Unicode20:BOM:UTF7, Unicode20:BOM:UTF8, Unicode20:little-endian,
! c: B; x& h: n2 ?+ G. w9 T( _' { Unicode20:UCS2, UTF7, UTF8" {8 e% Z$ B) s% K5 B
Properties: UppercaseLetter, LowercaseLetter, TitlecaseLetter, ModifierLetter, ! z" \6 E4 ~5 g9 {6 h; a" Z
OtherLetter, AnyLetter, NonSpacingMark, CombiningMark,
* [# J' t' L% T DecimalNumber, OtherNumber, DashPunctuation, OpenPunctuation, ) B1 Q( p4 v: G' v& w% D
ClosePunctuation, OtherPunctuation, MathSymbol, CurrencySymbol, & B& P7 R( q U+ r& g
OtherSymbol, SpaceSeparator, LineSeparator, ParagraphSeparator, # A6 j, l5 {- ?. Y/ V
ControlCharacter, OtherCharacter, UndefinedScript, GeneralScript,
9 b: E/ z( v1 l Latin, Greek, Cyrillic, Armenian, Hebrew, Arabic, Devanagari, % E8 g! F$ g/ R: B: G
Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, Kannada, , W }3 L: ^' R1 l4 n6 x* U
Malayalam, Thai, Lao, Tibetan, Georgian, HangulJamo, Hiragana,
. O' T3 L3 f: d' ` Katakana, Kana, Bopomofo, CJKUnifiedIdeographs, Hangul, & K& ^, T. `, o" G# p3 u) q9 N
UndefinedWidth, Fullwidth, Halfwidth
2 p4 {! U$ m( N$ iTransforms: ToLowercase, ToUppercase, ToFullwidth, ToHalfwidth, ToHiragana,
2 _- Q$ |6 D) S+ S2 ? ToKatakana, Decompose, Compose, ToCombiningMark, ToSpacingMark, % c+ V; }! y8 p8 c$ x( R& }% ]
Select, Filter, ToCRLF, ToCR, ToLF, ToParagraphSeparator, / }) d5 M7 i; L+ a, M# D
ToLineSeparator, ToCanonical, ToTraditionalChinese,
8 l. _7 T W! b9 T2 r ToSimplifiedChinese, RomajiToHiragana, RomajiToKatakana,
8 l* z) y w% ]- e1 N$ f3 Z5 j, E KanaToRomaji, KanaToKunreiRomaji, KanaToHebonRomaji,
+ A* ^* {+ W, i ToLatinNumber, FromSGMLEntity |