一个偶然的机会找到一个命令行的编码转换利器 uniconv 2.1d3。4 [' V+ f( D' N: \6 y. T, D9 N
可以进行各种编码的转换,先放在这里,也许以后有用。( }, n! N; x, K7 I2 w% {
- B- @8 X3 F8 z' L
$ |7 N; n6 F4 e% [. a; z用法说明2 F4 M( d5 r0 I
usage: uniconv [-debug] [-directASCII] [-subst <substitute-string>]
: v1 s; H& P8 q$ E( [5 Y* _ <input-encoding> <input-file> <output-encoding> <output-file>7 l! }( w5 W7 v" C- S! d8 q
<property | transform>*. y0 e! C @' e0 N y3 w4 H# |
Version 2.1d3, 11/18/98$ _+ U8 v) \2 Z% H) |0 M
Copyright (c) Basis Technology Corp. 1995-1998. All rights reserved.
& _) m$ B# \! ~9 h. ], `) k( u8 TType "uniconv -help" for more information.
+ D4 @7 s2 q- V8 v7 D4 JType "uniconv -describe <encoding>" for more information about an encoding.9 C# U4 Y0 }/ W% u5 b
Encodings: Arabic, ASCII, big-endian, Big5, BMP, ChineseAutoDetect,
! ?6 v `, L7 i8 j& @5 F" K' ?8 w CNS-11643, CNS-11643-1986, cp1251, cp1252, cp437, cp850, cp932,
% G6 v9 N6 M- V- D+ I4 x: J: k EBCDIC, EBCDIK, EUC-J, EUC-KR, GB2312, Greek, Hebrew, HZ,
$ s8 z$ H5 C( Q, i q) a3 {5 L7 \ ISO-2022-JP, ISO-2022-KR, ISOLatinCyrillic, JapaneseAutoDetect, 0 L2 M( g- Z! n1 \* O
Java, JIS_X0201, JIS_X_0208, KoreanAutoDetect, Latin1, Latin2,
" M9 w8 W9 {$ Y9 U Latin3, Latin4, Latin5, Latin6, little-endian, Shift-JIS, Thai, 3 G, N$ J& D2 Y1 J
UCS2, Unicode11:big-endian, Unicode11:BOM:big-endian, $ E, |4 f- c' \% }
Unicode11:BOM:Java, Unicode11:BOM:little-endian, $ E, l1 X% x& y7 E ]3 c
Unicode11:BOM:UCS2, Unicode11:BOM:UTF7, Unicode11:BOM:UTF8,
8 d6 T5 b: a& g& h X/ N Unicode11:Java, Unicode11:little-endian, Unicode11:UCS2, # C g6 {7 B' M( R
Unicode11:UTF7, Unicode11:UTF8, Unicode20:BOM:Java,
# q! d, W1 z/ q' G* }7 j* ^, K& x( d/ d Unicode20:BOM:UTF7, Unicode20:BOM:UTF8, Unicode20:little-endian,
5 B# B$ S. w) _" t Unicode20:UCS2, UTF7, UTF8
- Q. e M9 y' a* h) _* MProperties: UppercaseLetter, LowercaseLetter, TitlecaseLetter, ModifierLetter,
- p1 L4 y6 o2 c' x5 q; I: X! y0 l OtherLetter, AnyLetter, NonSpacingMark, CombiningMark,
( s- z" w0 u4 F/ M0 M- z DecimalNumber, OtherNumber, DashPunctuation, OpenPunctuation,
Q9 D' |" B C3 t) c6 Q) i ClosePunctuation, OtherPunctuation, MathSymbol, CurrencySymbol,
7 o) w7 a( B+ i OtherSymbol, SpaceSeparator, LineSeparator, ParagraphSeparator,
* C0 h; c7 C$ l0 M6 Y: R; e ControlCharacter, OtherCharacter, UndefinedScript, GeneralScript,
7 b! s' `8 i- d# q Latin, Greek, Cyrillic, Armenian, Hebrew, Arabic, Devanagari, * f* v8 D1 K& D( N; i: }
Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, Kannada, 4 Q2 d5 ]# {) e1 ~" M
Malayalam, Thai, Lao, Tibetan, Georgian, HangulJamo, Hiragana,
2 _4 t( @. @8 L8 c; D. p Katakana, Kana, Bopomofo, CJKUnifiedIdeographs, Hangul, 8 B# ]! |, q& F9 m
UndefinedWidth, Fullwidth, Halfwidth" P# l5 u; |( A& ~" m& ~
Transforms: ToLowercase, ToUppercase, ToFullwidth, ToHalfwidth, ToHiragana,
4 f" F# [- h/ C- X1 R( M ToKatakana, Decompose, Compose, ToCombiningMark, ToSpacingMark, 1 D" \1 ^) i3 L3 A
Select, Filter, ToCRLF, ToCR, ToLF, ToParagraphSeparator,
4 J4 ^5 J4 [# E ToLineSeparator, ToCanonical, ToTraditionalChinese, 2 o z7 \! h/ @5 `- V
ToSimplifiedChinese, RomajiToHiragana, RomajiToKatakana,
( w2 d7 Z+ C: i, H; d B- L9 t KanaToRomaji, KanaToKunreiRomaji, KanaToHebonRomaji,
1 K, j" Z/ Q0 }% f, ~) A6 o* o ToLatinNumber, FromSGMLEntity |