answersLogoWhite

0

What is mojibake?

Updated: 11/10/2022
User Avatar

Kamuna

Lvl 1
15y ago

Best Answer

A mojibake is the happenstance of incorrect, unreadable characters shown when computer software fails to render a text correctly according to its associated character encoding.

User Avatar

Wiki User

15y ago
This answer is:
User Avatar

Add your answer:

Earn +20 pts
Q: What is mojibake?
Write your answer...
Submit
Still have questions?
magnify glass
imp
Related questions

What are the difference between character symbol '1' and string 1 inside the computer How they will be presented in the computer memory Can this affect the application performance?

It really depends.As a numerical value, it can be stored as 0x01 as a 8-bit (1-byte) value, or in any other permutation for data values for numbers (e.g. 8/16/32-bit [un]signed integer*, 32-bit single/double precision floating point values**)***As text data, it will be 0x31 (ASCII value 49) in ASCII and UTF-8 (8-bit Unicode), while in other text encodings may be different (16-bit Unicode in little and big endian, MS Shift-JIS, EUC, GB, Big5). Also to mention if the character "1" is represented as a single char value, it would only contain the data for that single character. It "might" be bigger if it's stored into a string, as in certain programming languages a stringis a vector (resizable array) of individual char values.*For signed and unsigned integers, this refers to whether these [whole] numbers have a positive or negative value. For unsigned integers, the values go from 0 to whichever is the maximum.**Floating point values as opposed to integers are just numbers with decimal points (e.g. everyone knows the approximate value of pi is 3.1415.... and so on). As far as single or double-precision, that just refers to how many more numbers the computer can stuff after the decimal point.***Text encoding is important in computers as every country and language has different requirements to identify each character as a specific value. Using improper encodings will cause a phenomenon called mojibake (Japanese, refers to "text corruption" or "garbled text"). MS Shift-JIS, EUC, the GB series, and Big5 are all encodings specific to East Asian text encoding codepages (particularly Japanese, Korean, and Simplified/Traditional Chinese) where as ASCII/ANSI and a few unmentioned are for Latin scripts (English, Spanish, French, Italian, and Portuguese, and possibly a few other European/Scandinavian languages). UTF-8 and UTF-16LE and UTF16-BE are efforts by the Unicode Consortium to create a encoding that will encompass the world's languages and scripts (Latin alphabet [Americas/Europe/Turkish?], Greek alphabet, African languages[Ethiopian], Native/aboriginal scripts [Cherokee], Arabic/Farsi, Indic scripts [Devanagari, Sanskrit], Cyrillic [Russian, Mongolian, Ukranian, etc], CJK scripts [Chinese, Japanese, Korean], and much more in the works).