Loading ...
Sorry, an error occurred while loading the content.

1031Multi-byte printing patch (long)

Expand Messages
  • Mike Williams
    Dec 15, 2003
    • 0 Attachment

      Here is a new patch to provide multi-byte printing from VIM. This
      version supports printing of Japanese, Traditional and Simplified
      Chinese, and Korean, including half-width characters.

      Important Note: The patch does not provide any of the necessary CMap
      or CID font files for actual printing of CJK text. You will need to
      have these already installed in your printer.

      The patch is against 6.2 p154. There is also a new set of runtime
      files for the print runtime sub-directory including a new version of
      prolog.ps. I have increased the version number of prolog.ps which
      means you cannot use it with an unpatched VIM or the patched VIM with
      the old prolog.ps - you will get error messages.

      How to do multi-byte printing?

      Setting up multi-byte printing is a bit complicated, you need to
      specify the character set to print, the character encoding to use,
      and which fonts to use.

      1) Specifying the fonts to use.
      There is a new option, printmbfont. This is a comma-separated list
      that specifies the fonts to use for various styles. The fields are:
      r:name font to use for normal text
      b:name font to use for bold text
      i:name font to use for italic text
      o:name font to use for bold italic text

      There are no default font names. The r: field has to be specified,
      the rest are optional. If a field is not specified then VIM will use
      another one as follows:
      if b: is blank, then use r:
      if i: is blank, then use r:
      if o: is black, then use b: (which if blank will use r:)

      I would expect only r: and b: to be specified, so an example of
      setting the option would be:

      :set printmbfont=r:WadaMin-Regular,b:WadaMin-Bold

      A common issue is that some multi-byte fonts do not contain
      characters for codes in the ASCII range. If this is the case with
      your font, then there is a field, c:, to tell VIM to use Courier for
      characters in the ASCII range. The field should be set to yes to use
      Courier - by default Courier is not used.

      Normally, each country's ASCII code range differs in a couple of
      characters from, er, true ASCII. If you are using Courier for
      characters in the ASCII code range, then you can specify true ASCII
      character set printing with another field, a:, which takes yes or no
      as its value.

      So, if you want to use Courier for characters in the ASCII range, and
      use the American ASCII character set along with the font example
      above, the option would then be set as follows:

      :set printmbfont=c:yes,a:yes:,r:WadaMin-Regular,b:WadaMin-Bold

      2) Specifying the character encoding to use for printing.
      This works the same way as current printing - VIM first looks for a
      value for printencoding, and if that is empty then use the value of
      encoding. If printencoding is set then any necessary conversion is
      done when printing. The recognised multi-byte encodings are the ones
      that VIM knows about. Others may be supported - I have still to
      check what iconv will handle.

      3) Specifying the character set.
      Each country has multiple possible character sets, and this is where
      the fun starts. Not all character sets can be used with all
      character encodings, for example you cannot use a Unicode encoding
      with the Simplified Chinese GBT character set. If you try to specify
      this then VIM will report an error.

      You specify the character set to use with a new VIM option -
      printmbcharset. This takes one of a fixed set of values for each
      country as follows:

      Chinese (Simplified)
      Value Description
      MAC ; Apple Mac Simplified Chinese
      GBT-90_MAC ; GB/T 12345-90 Apple Mac Simplified Chinese
      GBK ; GBK (GB 13000.1-93)
      ISO10646 ; ISO 10646-1:1993

      Chinese (Traditional)
      Value Description
      CNS_1993 ; CNS 11643-1993, Planes 1 & 2
      ETEN ; Big5 with ETen extensions
      ISO10646 ; ISO 10646-1:1993

      Value Description
      MSWINDOWS ; Win3.1/95J (JIS X 1997 + NEC + IBM extensions)
      KANJITALK6 ; Apple Mac KanjiTalk V6.x
      KANJITALK7 ; Apple Mac KanjiTalk V7.x

      Value Description
      MAC ; Apple Macintosh Korean
      MSWINDOWS ; KS X 1992 with MS extensions
      ISO10646 ; ISO 10646-1:1993

      Only certain encodings and character sets can be used together when
      printing. This means if you use utf8 encoding to edit in VIM you
      will need iconv installed to convert your text to an encoding that
      can be combined with the character set you want. The following
      tables show which cahractersets can be used with which encoding for
      each country.

      Simplified Chinese
      euc-cn gbk ucs-2 utf-8
      GB_2312-80 x
      GBT_12345-90 x
      MAC x
      GBT-90_MAC x
      GBK x
      ISO10646 x x

      Traditional Chinese
      euc-tw big5 ucs-2 utf-8
      CNS_1993 x
      BIG5 x
      ETEN x
      ISO10646 x x

      euc-jp sjis ucs-2 utf-8
      JIS_C_1978 x x
      JIS_X_1983 x x
      JIS_X_1990 x x x
      KANJITALK6 x
      KANJITALK7 x

      euc-kr cp949 ucs-2 utf-8
      KS_X_1992 x
      MAC x
      ISO10646 x x

      So, to set up printing in Japanese from a VIM encoding of utf8

      set printencoding=euc-jp
      set printmbfont=c:yes,r:WadaMin-Regular,b:WadaMin-Bold
      set printmbcharset=JIS_X_1983

      If you could try things out and report any problems you have it would
      be appreciated. Once known issue is with the page header if you use
      %= to have left and right aligned components, due to printing half-
      width characters.

      If you discover a bug in the generated output if you could send me
      details including the original text file along with your print
      settings, I would appreciate it.

      I would also be interested in any comments on the the new options
      used to configure printing, on the naming of the various country
      character sets, or any other suggestions you may have.


      Brain - the apparatus with which we think that we think.
    • Show all 3 messages in this topic