一个偶然的机会找到一个命令行的编码转换利器 uniconv 2.1d3。& c( v- d/ D4 ?6 Z" U% x) m# R
可以进行各种编码的转换,先放在这里,也许以后有用。
7 B% {1 |9 u: B( `, i5 ]% a1 f# i
5 x6 b+ x# ]0 q' O) |9 S9 B) w2 R8 w1 T0 I4 }5 |
用法说明* N$ V7 V8 L! E* n1 ]& `
usage: uniconv [-debug] [-directASCII] [-subst <substitute-string>]' J% m- |' s {2 P! r3 I; H4 P5 y/ Y
<input-encoding> <input-file> <output-encoding> <output-file>
1 x7 C% g" C, @5 s <property | transform>*" Z' Q$ L; S1 z! | `
Version 2.1d3, 11/18/98
$ m( L0 Y$ q: l# i- OCopyright (c) Basis Technology Corp. 1995-1998. All rights reserved.! I! `; G& _; k% w+ W5 T S# T: \
Type "uniconv -help" for more information.
8 w8 n4 w! a$ e. A! ]5 S5 P3 rType "uniconv -describe <encoding>" for more information about an encoding.: I9 L5 e3 N& D) U5 c: y
Encodings: Arabic, ASCII, big-endian, Big5, BMP, ChineseAutoDetect,
# X& H) R6 q5 ~: E CNS-11643, CNS-11643-1986, cp1251, cp1252, cp437, cp850, cp932,
9 B7 B2 ?; J# D) K( o9 R EBCDIC, EBCDIK, EUC-J, EUC-KR, GB2312, Greek, Hebrew, HZ,
1 d7 n% r2 R) O* c6 q7 Y4 j ISO-2022-JP, ISO-2022-KR, ISOLatinCyrillic, JapaneseAutoDetect,
% ]5 f0 E- y8 @' t( Z/ P Java, JIS_X0201, JIS_X_0208, KoreanAutoDetect, Latin1, Latin2, , V( W, `$ Y5 J* r% B( Z9 x( y" q
Latin3, Latin4, Latin5, Latin6, little-endian, Shift-JIS, Thai, 1 K: c/ o4 E% a& M. W& B
UCS2, Unicode11:big-endian, Unicode11:BOM:big-endian,
/ A1 }( \5 q& J7 c6 t/ L Unicode11:BOM:Java, Unicode11:BOM:little-endian,
; H9 Z3 O+ J( h) i0 w Unicode11:BOM:UCS2, Unicode11:BOM:UTF7, Unicode11:BOM:UTF8,
( q2 X( z; M- j* D9 R! g2 f Unicode11:Java, Unicode11:little-endian, Unicode11:UCS2,
' c3 L8 C" A7 @: M/ p! ` Unicode11:UTF7, Unicode11:UTF8, Unicode20:BOM:Java,
8 @. N. J+ Q5 _1 W( B b. Z# F Unicode20:BOM:UTF7, Unicode20:BOM:UTF8, Unicode20:little-endian, : ?3 J7 g- L5 [. d4 t [
Unicode20:UCS2, UTF7, UTF80 H- I; x4 j- b
Properties: UppercaseLetter, LowercaseLetter, TitlecaseLetter, ModifierLetter, ) I4 s( R8 V/ ~) d, y5 n) r
OtherLetter, AnyLetter, NonSpacingMark, CombiningMark, " \) S9 p4 w# L2 L" l: z
DecimalNumber, OtherNumber, DashPunctuation, OpenPunctuation, & N9 x8 c6 V( u5 B) ` i; N8 p. E
ClosePunctuation, OtherPunctuation, MathSymbol, CurrencySymbol, * P9 @& X$ p, ^; K" X% e) I2 ^; P
OtherSymbol, SpaceSeparator, LineSeparator, ParagraphSeparator,
! Q g4 x: I9 I4 b9 ^9 m" c ControlCharacter, OtherCharacter, UndefinedScript, GeneralScript,
0 \- j; n f2 Q; [& A& M. i Latin, Greek, Cyrillic, Armenian, Hebrew, Arabic, Devanagari, 1 u& n* h/ v+ F( i& P
Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, Kannada,
9 S" A+ R* D2 Z& a; |: d# E0 \ Malayalam, Thai, Lao, Tibetan, Georgian, HangulJamo, Hiragana, - l3 }$ r8 H# ?0 u6 w, C
Katakana, Kana, Bopomofo, CJKUnifiedIdeographs, Hangul,
V$ ?6 S3 U0 e UndefinedWidth, Fullwidth, Halfwidth
5 c! _+ n9 s0 I2 ?Transforms: ToLowercase, ToUppercase, ToFullwidth, ToHalfwidth, ToHiragana,
) x8 @/ Z* h1 y, q ToKatakana, Decompose, Compose, ToCombiningMark, ToSpacingMark, . M# z; c0 g0 i: ^8 }( |
Select, Filter, ToCRLF, ToCR, ToLF, ToParagraphSeparator,
4 f" P, h4 U2 \0 n' x4 n# S8 l ToLineSeparator, ToCanonical, ToTraditionalChinese, % Y8 v; C, ?3 C2 P
ToSimplifiedChinese, RomajiToHiragana, RomajiToKatakana,
3 U' l6 Z, \9 f; R4 V KanaToRomaji, KanaToKunreiRomaji, KanaToHebonRomaji, ) t2 x; `6 x+ z
ToLatinNumber, FromSGMLEntity |