Text Encoding Converter
Convert text between different character encodings. Handle special characters, emojis, and various scripts with support for BOM markers and error handling.
Input Text
Options
Converted Text
Converted text will appear here...
About Text Encodings
- UTF-8: Variable-width encoding that can represent any Unicode character. Most common on the web.
- UTF-16: Variable-width encoding that uses 16 bits per code unit. Common in Windows and Java.
- ISO-8859-1: 8-bit encoding for Western European languages. Limited to 256 characters.
- Windows-1252: Microsoft's extension of ISO-8859-1 with additional characters.
- ASCII: 7-bit encoding limited to basic Latin characters and control codes.
BOM (Byte Order Mark) is a special marker used at the beginning of UTF-16 text to indicate whether it's little-endian (LE) or big-endian (BE).