一个偶然的机会找到一个命令行的编码转换利器 uniconv 2.1d3。) C% B' _5 s% b8 j/ N) u" T
可以进行各种编码的转换,先放在这里,也许以后有用。
# ~5 q: E$ ^- X' o8 ?/ s6 k: h2 \( q6 ?- ?/ c
, L$ {$ o- `/ J( D+ q. j& w用法说明7 O4 q# p7 d* l0 ?
usage: uniconv [-debug] [-directASCII] [-subst <substitute-string>]
) T) T o# r S, M$ |+ L <input-encoding> <input-file> <output-encoding> <output-file>& r0 t. l z6 H# X% x/ _+ {$ p6 |' o
<property | transform>*
7 a2 Y1 X x7 X& N8 a4 |Version 2.1d3, 11/18/983 Y- C& t, m! M7 A5 j
Copyright (c) Basis Technology Corp. 1995-1998. All rights reserved.
; p Q6 Z% y" `" ~; u- uType "uniconv -help" for more information.
$ O% {; ~5 ~. {Type "uniconv -describe <encoding>" for more information about an encoding.
7 c( l. B4 T R# p) A* sEncodings: Arabic, ASCII, big-endian, Big5, BMP, ChineseAutoDetect,
9 H6 M0 |9 s5 g2 O5 X- h CNS-11643, CNS-11643-1986, cp1251, cp1252, cp437, cp850, cp932, + B0 J4 \. @& Q! t; p
EBCDIC, EBCDIK, EUC-J, EUC-KR, GB2312, Greek, Hebrew, HZ, # |3 Z" J+ L! B
ISO-2022-JP, ISO-2022-KR, ISOLatinCyrillic, JapaneseAutoDetect, 2 \$ L! H y3 s% p
Java, JIS_X0201, JIS_X_0208, KoreanAutoDetect, Latin1, Latin2,
6 I) x1 Q& F M* C0 E Latin3, Latin4, Latin5, Latin6, little-endian, Shift-JIS, Thai,
# l* L# Q. n1 `2 ?2 Z% N UCS2, Unicode11:big-endian, Unicode11:BOM:big-endian, 7 I" A: ? L1 r. F+ H a! O
Unicode11:BOM:Java, Unicode11:BOM:little-endian,
0 {1 j; s6 y5 L Unicode11:BOM:UCS2, Unicode11:BOM:UTF7, Unicode11:BOM:UTF8,
. V! s9 _* K- w8 ~' X Unicode11:Java, Unicode11:little-endian, Unicode11:UCS2,
4 n1 U- Z5 t3 N' z) R- D Unicode11:UTF7, Unicode11:UTF8, Unicode20:BOM:Java,
( r. w6 u- {' S* _# P# _ Unicode20:BOM:UTF7, Unicode20:BOM:UTF8, Unicode20:little-endian,
) ?) ]8 B* h2 }+ |/ Y1 I" i4 R" F/ z Unicode20:UCS2, UTF7, UTF8% g( P* }7 g. J6 ]! ?
Properties: UppercaseLetter, LowercaseLetter, TitlecaseLetter, ModifierLetter, 2 p; L8 X5 r+ K& V& h
OtherLetter, AnyLetter, NonSpacingMark, CombiningMark,
, L* F8 D3 {$ f1 E$ t DecimalNumber, OtherNumber, DashPunctuation, OpenPunctuation, y E" }4 w9 u
ClosePunctuation, OtherPunctuation, MathSymbol, CurrencySymbol, 0 P6 B% n) I1 }) Z! Y! h
OtherSymbol, SpaceSeparator, LineSeparator, ParagraphSeparator,
- t% C( }$ I* \* @3 g1 e9 ? ControlCharacter, OtherCharacter, UndefinedScript, GeneralScript, / I1 Y" W0 T- y% O' o5 ^
Latin, Greek, Cyrillic, Armenian, Hebrew, Arabic, Devanagari, / F* V' L x4 a6 N1 u6 w
Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, Kannada,
; A6 s' k. ?$ x) h6 K" n: v Malayalam, Thai, Lao, Tibetan, Georgian, HangulJamo, Hiragana,
0 `1 @- o; [ ~/ n' X" d+ u: b Katakana, Kana, Bopomofo, CJKUnifiedIdeographs, Hangul, 5 B# f V: R2 b
UndefinedWidth, Fullwidth, Halfwidth9 @4 L8 E- A9 ~/ V+ ~' ]
Transforms: ToLowercase, ToUppercase, ToFullwidth, ToHalfwidth, ToHiragana, - J) L# W* Z) `: [) Y2 G
ToKatakana, Decompose, Compose, ToCombiningMark, ToSpacingMark, 2 [8 T' E$ `; v: p7 j, W
Select, Filter, ToCRLF, ToCR, ToLF, ToParagraphSeparator, ) X7 c7 h4 |. {9 ~" ?
ToLineSeparator, ToCanonical, ToTraditionalChinese,
/ W5 \/ w% m5 p# S- ] ToSimplifiedChinese, RomajiToHiragana, RomajiToKatakana, % E: ?' d$ w. K( Y" j5 {8 `
KanaToRomaji, KanaToKunreiRomaji, KanaToHebonRomaji,
7 y. V" a- M, t$ Z( @ ToLatinNumber, FromSGMLEntity |