Skip to content

Character encoding

Character encoding is complicated, and the execution is not always precise. If you encounter a page not displaying as expected, you can transcode the text to show a different set of characters.

Transcode character encodings can be cycled by clicking the encoding results under the Information header of each tab.

A working example of transcoding

The yellow marked text can be clicked to transcode the text

A working example of transcoding

Here is a text document with the correct character encoding

Nonworking example of transcoding

Here is the same document with the incorrect transcoding

The default behavior lets RetroTxt try to determine the base character encoding of the text and, when needed, apply any transcoding.

UTF-8 is the most common, contemporary code set to encoded Unicode text (including Emojis), and it is nearly always in use with HTML5. ISO-8859-1 was the original code set used by Linux, the Commodore Amiga, and online during the 1990s. It is near identical to ISO-8859-15 but lacks a few European-centric characters, such as the sign.

> CP1252

Force the active tab to parse the source text using CP-1252 encoding.

> ISO8859-15

Force the active tab to display the text using ISO-8859-15, the recommended encoding for Linux and the web in the 2000s.

> USASCII

Force the active tab not to transcode and can help to make documents encoded in UTF-8 or ISO-8859-1 to display.

CP1252 >

Force the active tab to display the text using CP-1252, a familiar legacy Microsoft Windows encoding, instead of the default, USA MS-DOS, CP-437.

ISO8859-5 >

Force the active tab to parse the source text using ISO 8895-5 encoding.