Additional settings / "Encoding" tab ([Setup] menu)

Encoding

receive: Kanji character that is received from host.
Please refer to available character encoding page.
transmit (Use different code): Kanji character that is sent from client.
Please refer to available character encoding page.

Unicode

Ambiguous Characters width

Select 1 Cell or 2 Cell.
Specify character width assumed by Tera Term connecting destination.
Refer to East_Asian_Width and width (cells).
Refer to Drawing resized font to fit cell width for Tera Term drawing character width. If you change Encoding-receive by pull-down, width is changed to typical characte witdh of that code automatically. Nevertheless, that width is not absolutely recommend, you can choose the width what you want for font or other reasons.

Override Emoji Characters width

When checked, Overrides characters width from East_Asian_Width.
Refer to About Emoji width (cells).

Emoji with U+1F000 and above are 2Cell (full-width).
Emoji less than U+1F000:

selected 2cell

2Cell(full-width)

selected 1cell

1Cell(half-width)

DEC Special Graphics

Select display method for DEC Special Graphics

Mapping Unicode to DEC Special Grahpics

Unicode is converted to DEC Special Graphic, displays with "Tera Special" font.
Character width of DEC Special Graphic is 1 cell (half-width).
Select character type to be converted. See Mapping of Unicode to DEC special character

Box-drawing character (U+2500-U+257F)
Punctuation, Block Elements, Shade
Middle dots(U+00B7,U+2024,U+2219)

Mapping DEC Special Grahpics to Unicode

DEC Special Graphic is converted to Unicode, displays with VT Window font. The character width displayed in Unicode differs for each character. It changes depending on Ambiguous Characters width setting.

Do not mapping

DEC Special Graphic displays with "Tera Special" font. No Unicode replacement is performed.

Japanese JIS

Reveive

Half-width kana: When "HANKAKU" katakana code from host is described for 7bit (Shift control with SO/SI is using), please select this entry. This is only enabled when Kanji (receive) is JIS.

Transmit

Half-width kana transmit: When "HANKAKU" katakana code from host is described for 7bit (Shift control with SO/SI is using), please select this entry. This is only enabled when Kanji (transmit) is JIS.
Kanji-in: The escape sequence is used when Kanji (transmit) is JIS. (Sequence regarding Kanji character is specified to G0. ^[$@ or ^[$B). The escape sequence from host can use any sequence.
Kanji-out: The escape sequence is used when Kanji (transmit) is JIS. (Sequence regarding ASCII and JIS Roman code is specified to G0. ^[(B or ^[(J ). The escape sequence from host can use any sequence.
NOTICE: ^[(H can not be selected on initial setting.

Unicode setting

Character width (cells)

Character width for single-byte character code such as Latin-1 is 1 cell.

Character width for double-byte character code such as Shift_JIS is 1 cell for 1-byte characters and 2 cells for 2-byte characters.

In Unicode, character width of a single character changes case-by-case.

Example, "§" (section sign, section mark)

| code               | character code(code point) | cell   |
|--------------------|----------------------------|--------|
| ISO8859-1(Latin-1) | 0xA7                       | 1      |
| Shift_JIS(CP932)   | 0x8198                     | 2      |
| KS5601(CP949)      | 0xA1D7                     | 2      |
| Big5(CP950)        | 0xA1B1                     | 2      |
| BG2312(CP936)      | 0xA1EC                     | 2      |
| Unicode            | 0xA7 (U+00A7)              | 1 or 2 |

In a multibyte character code environment (CJK), character width should be 2 cell, and in other environments it should be 1 cell for natural use. Type of character whose width changes are called Ambiguous. Refer to East_Asian_Width and width (cells) for detail.

About displayed characters

test text in Tera Term repository can be displayed and checked.

Unicode (Kanji) width
"wget https://github.com/TeraTermProject/teraterm/raw/main/tests/unicodebuf-east_asian_width.txt -O -"
Unicode Emoji width
"wget https://raw.githubusercontent.com/TeraTermProject/teraterm/main/tests/unicodebuf-text-emoji.txt -O -"

Please note the following:

Font is displayed unchanged or resized.
See Font setting.
Character width handled by Tera Term and intended by host may differ.
Programs, library examples
- Text editor(vim,emacs,nano etc)
- viewer(more,less,lv etc)
- readline library linked to bash, etc.
- curses library linked to aptitude, etc.
Character to be displayed may not be stored in font file.
- If the character to be displayed (glyph) does not exist,
  a black dot will be drawn by OS. Depending on the version of OS, other character may be used.
- Some OS versions automatically draw using other fonts.
  This feature is called font linking.

East_Asian_Width and width (cells)

Each characters are assigned East_Asian_Width property.

5 types exsit, each types are defined character width.

In addition, there are two types of interpretation of character width

In an East Asian context
in a non-East Asian context

The following table

cells(2=full/1=half)
|              | East Asian | non-East Asian |
| property     | context    | context        |
| F(Fullwidth) | 2          | 2              |
| H(Halfwidth) | 1          | 1              |
| W(Wide)      | 2          | 2              |
| Na(Narrow)   | 1          | 1              |
| A(Ambiguous) | 2          | 1              |
| N(Neutral)   | 1          | 1              |

In CJK environment, it is more natural to set the Ambiguous character width to 2Cell.
In addition, most Japanese fonts are designed in 2Cell.

Attributes are determined based on the following data
http://www.unicode.org/Public/UCD/latest/ucd/EastAsianWidth.txt

About Emoji width (cells)

Emoji property is other propery from East Asian Width property.

In the CJK environment, as with the East_Asian_Width property, Characters that are 2 byte in DBCS, 2 cell is more natural.

In non-CJK environments, many characters handling 1 cell is natural, because 2 cell characters did not exist in traditional character codes. Emoji with code points U+1F000 or higher that did not exist before Unicode, so they may be handled as 2 cell characters.

The following information is used to determine Emoji.
https://www.unicode.org/Public/UCD/latest/ucd/emoji/emoji-data.txt
However, code points less than U+0080 are not treated as Emoji

Neutral contains Emoji, that Emoji is unnatural in Japan when rendered in 1cell. Emoji's width can be changed to make them appear more natural.

Example

"☺", U+263A: WHITE SMILING FACE
"❤", U+2764: HEAVY BLACK HEART