一个偶然的机会找到一个命令行的编码转换利器 uniconv 2.1d3。+ n) A l! R) K; y+ W
可以进行各种编码的转换,先放在这里,也许以后有用。3 a4 C8 L4 L! _* e9 x
' x% H3 \) K& t+ D$ k8 |
; M! V, c e9 {" O L1 j" g9 B. b
用法说明
5 C# v2 O& U$ x( V( T( X/ Dusage: uniconv [-debug] [-directASCII] [-subst <substitute-string>]
- c6 ^, l6 b, Z/ F0 _* ~ <input-encoding> <input-file> <output-encoding> <output-file>
2 H/ Y9 J7 a8 r' ^ <property | transform>*! \2 q8 l3 b: z8 m% i7 X4 \
Version 2.1d3, 11/18/98
! G3 ]9 Y# l1 }( C. \& c- WCopyright (c) Basis Technology Corp. 1995-1998. All rights reserved.$ z2 c, h; t8 e( E" P
Type "uniconv -help" for more information.
/ E. S1 S. h( ^1 N2 \& J p* E' z6 sType "uniconv -describe <encoding>" for more information about an encoding.0 A* u0 Q- ^& R$ y
Encodings: Arabic, ASCII, big-endian, Big5, BMP, ChineseAutoDetect, 1 E4 @# ~5 ]/ H- R3 e7 q/ x
CNS-11643, CNS-11643-1986, cp1251, cp1252, cp437, cp850, cp932,
( b' G: @: B5 p9 f0 s# c- D* h# j EBCDIC, EBCDIK, EUC-J, EUC-KR, GB2312, Greek, Hebrew, HZ, $ Z8 Q6 p6 o8 b E
ISO-2022-JP, ISO-2022-KR, ISOLatinCyrillic, JapaneseAutoDetect,
' u# j$ y X" E/ A( Q Java, JIS_X0201, JIS_X_0208, KoreanAutoDetect, Latin1, Latin2,
+ y% u: F) {0 F# C, k( `* x( I Latin3, Latin4, Latin5, Latin6, little-endian, Shift-JIS, Thai, 0 G6 x6 k. b' P% q$ o
UCS2, Unicode11:big-endian, Unicode11:BOM:big-endian,
7 n- D) p) M# u& h8 X Unicode11:BOM:Java, Unicode11:BOM:little-endian, 9 M0 J- ^* I8 B% R# E" \) L3 B
Unicode11:BOM:UCS2, Unicode11:BOM:UTF7, Unicode11:BOM:UTF8, $ }0 A3 a! U6 f P6 z, C4 F' W
Unicode11:Java, Unicode11:little-endian, Unicode11:UCS2,
3 l8 L/ w0 v" O" I ~3 Z/ n1 q9 U" O Unicode11:UTF7, Unicode11:UTF8, Unicode20:BOM:Java, 8 m5 E) }! P1 h3 |
Unicode20:BOM:UTF7, Unicode20:BOM:UTF8, Unicode20:little-endian,
; l5 Z" G# k, J$ Z6 O/ K5 _( q% M6 [ Unicode20:UCS2, UTF7, UTF8+ D# L. {' o! C5 P
Properties: UppercaseLetter, LowercaseLetter, TitlecaseLetter, ModifierLetter,
1 G& |# k* }" T' f: y OtherLetter, AnyLetter, NonSpacingMark, CombiningMark, 1 e9 [3 T1 Z8 ]& f2 @7 E% K
DecimalNumber, OtherNumber, DashPunctuation, OpenPunctuation,
! l" P+ o+ I- V$ Y( K+ C ClosePunctuation, OtherPunctuation, MathSymbol, CurrencySymbol,
. h- k' [. k* b( b1 J OtherSymbol, SpaceSeparator, LineSeparator, ParagraphSeparator,
2 y1 N0 H6 s5 ?, B* s ControlCharacter, OtherCharacter, UndefinedScript, GeneralScript,
( b5 I! ]& I( V. q% }1 o) y Latin, Greek, Cyrillic, Armenian, Hebrew, Arabic, Devanagari,
( d* [5 q! N! q& x$ P Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, Kannada, ) `- O7 y4 K8 ^8 ]* M& ]
Malayalam, Thai, Lao, Tibetan, Georgian, HangulJamo, Hiragana, 6 i; j' p& V: f8 H. X
Katakana, Kana, Bopomofo, CJKUnifiedIdeographs, Hangul,
5 g# R6 i0 T# r% f- L) O UndefinedWidth, Fullwidth, Halfwidth2 {' p3 F* T/ L+ s( p: \0 X8 r
Transforms: ToLowercase, ToUppercase, ToFullwidth, ToHalfwidth, ToHiragana, ! e1 w$ G7 n- W9 y2 M# j
ToKatakana, Decompose, Compose, ToCombiningMark, ToSpacingMark, ) I% h4 g/ s8 D I% A% {3 d0 w( i
Select, Filter, ToCRLF, ToCR, ToLF, ToParagraphSeparator,
. p/ ~+ ?" T3 n" _$ ? ToLineSeparator, ToCanonical, ToTraditionalChinese, 9 a5 k% ]( [. t& a/ w8 L0 C; @8 ~' o
ToSimplifiedChinese, RomajiToHiragana, RomajiToKatakana, ( S8 W* [# {! o
KanaToRomaji, KanaToKunreiRomaji, KanaToHebonRomaji,
9 E1 u0 |# j. t7 _, @6 l ToLatinNumber, FromSGMLEntity |