5 nkf - Network Kanji Filter
13 nkf __[-butjnesliohrTVvwWJESZxXFfmMBOcdILg]__ __[___file ...___]__
21 __Nkf__ is a yet another kanji code converter among networks, hosts and terminals.
22 It converts input kanji code to designated kanji code
23 such as ISO-2022-JP, Shift_JIS, EUC-JP, UTF-8, UTF-16 or UTF-32.
25 One of the most unique faculty of __nkf__ is the guess of the input kanji encodings.
26 It currently recognizes ISO-2022-JP, Shift_JIS, EUC-JP, UTF-8, UTF-16 and UTF-32.
27 So users needn't set the input kanji code explicitly.
29 By default, X0201 kana is converted into X0208 kana.
30 For X0201 kana, SO/SI, SSO and ESC-(-I methods are supported.
31 For automatic code detection, nkf assumes no X0201 kana in Shift_JIS.
32 To accept X0201 in Shift_JIS, use __-X__, __-x__ or __-S__.
36 - __-J -S -E -W -W16 -W32 -j -s -e -w -w16 -w32__
38 Specify input and output encodings. Upper case is input.
43 ISO-2022-JP (JIS code).
47 Shift_JIS and JIS X 0201 kana.
48 EUC-JP is recognized as X0201 kana. Without __-x__ flag,
49 JIS X 0201 Katakana (a.k.a.halfwidth kana) is converted into JIS X 0208.
50 If you use Windows, see Windows-31J (CP932).
63 B or L gives whether Big Endian or Little Endian.
64 0 gives whther put BOM or not.
69 B or L gives whether Big Endian or Little Endian.
70 0 gives whther put BOM or not.
74 Output is buffered (DEFAULT), Output is unbuffered.
82 Specify the escape sequence for JIS X 0208.
86 Use ESC ( @. (JIS X 0208-1978)
90 Use ESC ( B. (JIS X 0208-1983/1990 DEFAULT)
94 Specify the escape sequence for US-ASCII/JIS X 0201 Roman. (DEFAULT B)
100 - __-h[123] --hiragana --katakana --katakana-hiragana__
103 Katakana to Hiragana conversion.
107 Hiragana to Katakana conversion.
109 - __-h3 --katakana-hiragana__
111 Katakana to Hiragana and Hiragana to Katakana conversion.
115 Text mode output (MS-DOS)
117 - __-f[_m_ [- _n_]]__
119 Folding on _m_ length with _n_ margin in a line.
120 Without this option, fold length is 60 and fold margin is 10.
124 New line preserving line folding.
128 Convert X0208 alphabet (Fullwidth Alphabets) to ASCII.
132 Convert X0208 alphabet to ASCII.
136 Convert X0208 kankaku to single ASCII space.
140 Convert X0208 kankaku to double ASCII spaces.
144 Replacing fullwidth >, <, ", & into '>', '<', '"', '&' as in HTML.
148 With __-X__ or without this option, X0201 is converted into X0208 Kana.
149 With __-x__, try to preserve X0208 kana and do not convert X0201 kana to X0208.
150 In JIS output, ESC-(-I is used. In EUC output, SS2 is used.
154 Assume broken JIS-Kanji input, which lost ESC.
155 Useful when your site is using old B-News Nihongo patch.
159 allows any chars after ESC-( or ESC-$.
163 force ASCII after NL.
167 Replacing non iso-2022-jp char into a geta character
168 (substitute character in Japanese).
172 MIME ISO-2022-JP/ISO8859-1 decode. (DEFAULT)
173 To see ISO8859-1 (Latin-1) -l is necessary.
177 Decode MIME base64 encoded stream. Remove header or other part before
182 Decode MIME quoted stream. '_' in quoted stream is converted to space.
187 It allows line break in the middle of the base64 encoding.
195 MIME encode. Header style. All ASCII code and control characters are intact.
199 MIME encode Base64 stream.
200 Kanji conversion is performed before encoding, so this cannot be used as a picture encoder.
204 Perform quoted encoding.
208 Input and output code is ISO8859-1 (Latin-1) and ISO-2022-JP.
209 __-s__, __-e__ and __-x__ are not compatible with this option.
227 Without this option, nkf doesn't convert line breaks.
229 - __--fj --unix --mac --msdos --windows__
231 Convert for these systems.
233 - __--jis --euc --sjis --mime --base64__
235 Convert to named code.
237 - __--jis-input --euc-input --sjis-input --mime-input --base64-input__
241 - __--ic=_input codeset_ --oc=_output codeset___
243 Set the input or output codeset.
244 NKF supports following codesets and those codeset names are case insensitive.
248 a.k.a. RFC1468, 7bit JIS, JUNET
252 a.k.a. AT&T JIS, Japanese EUC, UJIS
258 Microsoft Version of EUC-JP.
262 a.k.a. SJIS, MS_Kanji
280 - UTF8-MAC (input only)
290 UTF-16 Big Endian without BOM
294 UTF-16 Big Endian with BOM
298 UTF-16 Little Endian without BOM
302 UTF-16 Little Endian with BOM
310 UTF-32 Big Endian without BOM
314 UTF-32 Big Endian with BOM
318 UTF-32 Little Endian without BOM
322 UTF-32 Little Endian with BOM
324 - __--fb-{skip, html, xml, perl, java, subchar}__
326 Specify the way that nkf handles unassigned characters.
327 Without this option, --fb-skip is assumed.
329 - __--prefix=_escape character__target character_..__
331 When nkf converts to Shift_JIS,
332 nkf adds a specified escape character to specified 2nd byte of Shift_JIS characters.
333 1st byte of argument is the escape character and following bytes are target characters.
337 Handle the characters extended in CP932 as unassigned characters.
339 - __--no-best-fit-chars__
341 When Unicode to Encoded byte conversion,
342 don't convert characters which is not round trip safe.
343 When Unicode to Unicode conversion,
344 with this and -x option, nkf can be used as UTF converter.
345 (In other words, without this and -x option, nkf doesn't save some characters)
347 When nkf converts strings that related to path, you should use this opion.
351 Decode hex encoded characters.
355 Unescape percent escaped characters.
357 - __--numchar-input__
359 Decode character reference, such as "&#....;".
363 - __--in-place[=___SUFFIX___]__ __--overwrite[=___SUFFIX___]__
365 Overwrite __original__ listed files by filtered result.
367 __Note__ --overwrite preserves timestamps of original files.
371 Print guessed encoding and newline. (2 is default, 1 is only encoding)
385 Ignore rest of -option.
389 Copyright (c) 1987, Fujitsu LTD. (Itaru ICHIKAWA).
391 Copyright (c) 1996-2012, The nkf Project.