3 nkf \- Network Kanji code conversion Filter v2.0.5
14 is a yet another kanji code converter among networks, hosts and terminals.
15 It converts input kanji code to designated kanji code
16 such as 7-bit JIS, MS-kanji (shifted-JIS), utf-8 or EUC.
18 One of the most unique faculty of
20 is the guess of the input kanji encodings.
21 It currently recognizes 7-bit JIS, MS-kanji (shifted-JIS),utf-8 and EUC.
22 So users needn't give the input kanji code specification.
24 By default, X0201 kana is converted into X0208 kana. For
25 X0201 kana, SO/SI, SSO and
26 ESC-(-I methods are supported. For automatic code detection, nkf assumes
27 no X0201 kana in MS-Kanji. To accept X0201 in MS-Kanji, use \-X, \-x or
30 Options are shown below:
33 output 7-bit JIS code.
37 output MS-kanji (shifted-JIS) code.
40 output EUC (AT&T) code.
43 output UTF-8 (Unicode 8bit form).
46 Assume MS-Kanji and X0201 kana input. It also accepts JIS.
47 AT&T EUC is recognized as X0201 kana. Without \-x flag,
48 X0201 kana is converted into X0208.
51 Assume JIS input. It also accepts Japanese EUC.
52 This is the default. This flag does not exclude MS-Kanji.
55 Assume AT&T EUC input. It also accepts JIS.
59 Assume broken JIS-Kanji input, which lost ESC. Useful when your site is
60 using old B-News Nihongo patch. \-B1 allows any char after ESC-( or
61 ESC-$. \-B2 forces ASCII after NL.
67 MIME ISO-2022-JP/ISO8859-1 decode. (default) To see ISO8859-1 (Latin-1)
68 \-l is necessary. \-mN is for loose encoding. It allows line break in the
69 middle of the base64 encoding.
72 Decode MIME base64 encoded stream. Remove header or other part before
76 Decode MIME quoted stream. '_' in quoted stream is converted to space.
82 MIME encode. Header style. All ASCII code and control characters are
86 MIME encode. Base64 stream. Kanji conversion is performed before encoding,
87 so this cannot be used as a picture encoder. \MQ perfome quoted encoding.
90 Input and output code is ISO8859-1 (Latin-1) and ISO-2022-JP.
91 \-s, \-e and \-x are not compatible with this option.
96 length in a line. Default is 60. \-f40-0 forces 0 margin folding.
99 Allow X0201 kana in MS-Kanji.
100 X0201 is converted into X0208 Kana by default.
101 This is default in MSDOS.
104 Try to preseve X0208 kana.
105 Assume X0201 kana in MS-Kanji. And
106 do not convert X0201 kana to X0208.
107 In JIS output, ESC-(-I is used. In EUC output, SSO is used.
110 Convert X0208 alphabet to ASCII. \-Z1 converts X0208 kankaku to one
111 ASCII space. \-Z2 converts X0208 kankaku to two ASCII spaces.
114 Replacing >,<,",& into '>', '<', '"', '&' as in HTML.
117 Replacing non iso-2022-jp char into a geta character
118 (substitute character in Japanese).
131 Output result to file. The first string in arguments becomes output file name.
132 Please be careful. If there are no file arguments, nkf.out is chosen.
133 \--overwrite does rewriting. Original listed files are replaced by filtered
139 as sequence to designate JIS-kanji
146 as sequence to designate single-byte roman characters
151 {de/en}crypt ROT13/47
157 Text mode output (MS-DOS)
165 .B -L[wmu] new line mode
170 default is no conversion (output as it is).
174 New line preserving line folding.
177 hiragana/katakana translation
180 \-h3 \--hirakana-katakana
186 --fj,--unix,--mac,--msdos, --windows
187 convert for these system
189 --jis,--euc,--sjis,--mime,--base64
190 convert for named code
191 --jis-input,--euc-input,--sjis-input,--mime-input,--base64-input
194 -- ignore rest of -option
203 Itaru Ichikawa <ichikawa@flab.fujitsu.co.jp>,
204 (was ichikawa@fujitsu.JUNET)
206 a_kuroe@hoffman.cc.sophia.ac.jp (Akihiko Kuroe),
207 kono@ie.u-ryukyu.ac.jp (Shinji KONO),
208 furukawa@tcp-ip.or.jp ( Rei FURUKAWA )`
211 cannot handle some input that contains mixed kanji codes.
212 Automatic code detection
213 becomes very weak with \-x, \-X and \-S.
214 MIME encoding is very loose.
220 Thanks for those people.
254 www.ie.u-ryukyu.ac.jp/~kono/nkf/