Why is it that emails written in non-Latin languages sometimes display correctly, while other times they just appear as squares and question marks?

Given the nature of the Internet, your email might potentially be handled by software which is decades old. Protocols for handling email are very old, and that means that your email might potentially be handled by software that can't deal with any form of Unicode at all -- or any other non-ASCII character set for that matter. The content of email can be mangled by such software.

Newer protocols have been designed to avoid this kind of problem in handling character sets, but because of the widely distributed nature of the email infrastructure, potential points of failure of character set conversion may exist for decades yet, until everything is using Unicode correctly.

There is no way to diagnose a particular email problem without exact details of scenarios, because so many pieces of software and so many different protocols are involved. These kinds of problems can creep in at any of the interfaces between them -- or sometimes even internally to some particular piece of software.

Simply saying "My email doesn't work for script X" is usually all that an end user knows, but that is a little like a patient approaching a doctor saying "I have a fever." Something is clearly wrong, but its cause could be any of hundreds of things and requires detailed diagnosis of what is happening on a case-by-case basis.

There simply is no single satisfactory and satisfying answer to the "My email is broken" query, because nobody in the IT field -- not even the email specialists in the IETF -- has mastery of all the software that could be involved and which could be going haywire in character conversions someplace.

No comments:

topics