Additional settings / "Encoding" tab ([Setup] menu)

Encoding

receive
Kanji character that is received from host.
Please refer to available character encoding page.
transmit
Kanji character that is sent from client.
Please refer to available character encoding page.

Unicode

Ambiguous Characters width

Select 1 Cell or 2 Cell.
Specify character width assumed by Tera Term connecting destination.
Refer to East_Asian_Width and width (cells).
Refer to Drawing resized font to fit cell width for Tera Term drawing character width. If you change Encoding-receive by pull-down, width is changed to typical characte witdh of that code automatically. Nevertheless, that width is not absolutely recommend, you can choose the width what you want for font or other reasons.

Override Emoji Characters width

When checked, Overrides characters width from East_Asian_Width.
Refer to About Emoji width (cells).

DEC Special Graphics

Select display method for DEC Special Graphics
Mapping Unicode to DEC Special Grahpics
Unicode is converted to DEC Special Graphic, displays with "Tera Special" font.
Character width of DEC Special Graphic is 1 cell (half-width).
Select character type to be converted. See Mapping of Unicode to DEC special character
Line drawing characters(U+2500-U+257F)
Punctuation, Block Elements, Shade
Middle dots(U+00B7,U+2024,U+2219)
Mapping DEC Special Grahpics to Unicode
DEC Special Graphic is converted to Unicode, displays with VT Window font. The character width displayed in Unicode differs for each character. It changes depending on Ambiguous Characters width setting.

Japanese JIS

Reveive

Half-width kana
When "HANKAKU" katakana code from host is described for 7bit (Shift control with SO/SI is using), please select this entry. This is only enabled when Kanji (receive) is JIS.

Transmit

Half-width kana
When "HANKAKU" katakana code from host is described for 7bit (Shift control with SO/SI is using), please select this entry. This is only enabled when Kanji (transmit) is JIS.
Kanji-in
The escape sequence is used when Kanji (transmit) is JIS. (Sequence regarding Kanji character is specified to G0. ^[$@ or ^[$B). The escape sequence from host can use any sequence.
Kanji-out
The escape sequence is used when Kanji (transmit) is JIS. (Sequence regarding ASCII and JIS Roman code is specified to G0. ^[(B or ^[(J ). The escape sequence from host can use any sequence.

NOTICE: ^[(H can not be selected on initial setting.


Unicode setting

Character width (cells)

Character width for single-byte character code such as Latin-1 is 1 cell.

Character width for double-byte character code such as Shift_JIS is 1 cell for 1-byte characters and 2 cells for 2-byte characters.

In Unicode, character width of a single character changes case-by-case.

Example, "§" (section sign, section mark)
| code               | character code(code point) | cell   |
|--------------------|----------------------------|--------|
| ISO8859-1(Latin-1) | 0xA7                       | 1      |
| Shift_JIS(CP932)   | 0x8198                     | 2      |
| KS5601(CP949)      | 0xA1D7                     | 2      |
| Big5(CP950)        | 0xA1B1                     | 2      |
| BG2312(CP936)      | 0xA1EC                     | 2      |
| Unicode            | 0xA7 (U+00A7)              | 1 or 2 |

In a multibyte character code environment (CJK), character width should be 2 cell, and in other environments it should be 1 cell for natural use. Type of character whose width changes are called Ambiguous. Refer to East_Asian_Width and width (cells) for detail.

About displayed characters

test text in Tera Term repository can be displayed and checked. Please note the following:

East_Asian_Width and width (cells)

Each characters are assigned East_Asian_Width property.

5 types exsit, each types are defined character width.

In addition, there are two types of interpretation of character width

  1. In an East Asian context
  2. in a non-East Asian context

The following table

cells(2=full/1=half)
|              | East Asian | non-East Asian |
| property     | context    | context        |
| F(Fullwidth) | 2          | 2              |
| H(Halfwidth) | 1          | 1              |
| W(Wide)      | 2          | 2              |
| Na(Narrow)   | 1          | 1              |
| A(Ambiguous) | 2          | 1              |
| N(Neutral)   | 1          | 1              |

In CJK environment, it is more natural to set the Ambiguous character width to 2Cell.
In addition, most Japanese fonts are designed in 2Cell.

Attributes are determined based on the following data
http://www.unicode.org/Public/UCD/latest/ucd/EastAsianWidth.txt

About Emoji width (cells)

Emoji property is other propery from East Asian Width property.

In the CJK environment, as with the East_Asian_Width property, Characters that are 2 byte in DBCS, 2 cell is more natural.

In non-CJK environments, many characters handling 1 cell is natural, because 2 cell characters did not exist in traditional character codes. Emoji with code points U+1F000 or higher that did not exist before Unicode, so they may be handled as 2 cell characters.

The following information is used to determine Emoji.
https://www.unicode.org/Public/UCD/latest/ucd/emoji/emoji-data.txt
However, code points less than U+0080 are not treated as Emoji

Neutral contains Emoji, that Emoji is unnatural in Japan when rendered in 1cell. Emoji's width can be changed to make them appear more natural.

Example

"☺", U+263A
WHITE SMILING FACE
"❤", U+2764
HEAVY BLACK HEART