From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Emanuel Berg Newsgroups: gmane.emacs.help Subject: Facts for fans: encodings history (was: Re: Getting Emacs to play nice with Hunspell and apostrophes) Date: Sat, 14 Jun 2014 07:19:25 +0200 Organization: Aioe.org NNTP Server Message-ID: <87ioo4ukyq.fsf_-_@debian.uxu> References: <87ha3s71mt.fsf@debian.uxu> <87tx7rsevi.fsf@debian.uxu> <8738fbscao.fsf@debian.uxu> <87sin8use8.fsf@debian.uxu> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain X-Trace: ger.gmane.org 1402723223 22930 80.91.229.3 (14 Jun 2014 05:20:23 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Sat, 14 Jun 2014 05:20:23 +0000 (UTC) To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Sat Jun 14 07:20:19 2014 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1WvgO2-0004Oi-B7 for geh-help-gnu-emacs@m.gmane.org; Sat, 14 Jun 2014 07:20:18 +0200 Original-Received: from localhost ([::1]:34232 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1WvgO1-0006fK-8E for geh-help-gnu-emacs@m.gmane.org; Sat, 14 Jun 2014 01:20:17 -0400 Original-Path: usenet.stanford.edu!news.kjsl.com!feeder.erje.net!eu.feeder.erje.net!news.stack.nl!aioe.org!.POSTED!not-for-mail Original-Newsgroups: gnu.emacs.help Original-Lines: 42 Original-NNTP-Posting-Host: SIvZRMPqRkkTHAHL6NkRuw.user.speranza.aioe.org Original-X-Complaints-To: abuse@aioe.org User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.3 (gnu/linux) X-Notice: Filtered by postfilter v. 0.8.2 Cancel-Lock: sha1:91vani0diwycbnKLYaIho8YGI08= Mail-Copies-To: never Original-Xref: usenet.stanford.edu gnu.emacs.help:205966 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:98236 Archived-At: Emanuel Berg writes: >> Yuri Khan writes: >> >> You could order a book in an Internet shop, have >> them completely b0rk up the encoding of the shipping >> address: >> >> http://cdn.imagepush.to/in/625x2090/i/3/30/301/24.jpg >> >> Then somebody at the postal system might decode the >> characters and the package would still be delivered >> at the intended address. > > Ha-ha, unbelievable! How did that happen? First you > wrote in Russian at the Internet shop's web page - > then it got like that because of them translating > Unicode (?) to ISO-8859-1 (which is 8-bit, with the > ASCII as its lower half) - ? Why didn't the Internet > shop do it? Did they actually think that was a > language or some transcription of Russian? How was it > translated to Russian at the postal office? I can > only make out the first line: Russia, Moscow. I read an article on this: Pre-1990s: the 7-bit period. US-ASCII, with ISO 646 in Scandinavia and Finland (with 0x5B-D and 0x7B-D replaced with national chars: those were [ \ ] and { | } respectively in the US-ASCII). The 90s: 8-bits. ISO 8859-1 with the ASCII as its lower half. Russian: KOI8-R, ISO 8859-5, and CP1251. 2000s: the multi-byte era. EUC and ISO 2022-JP for CJK. Linux moves from 8859 to UTF-8, an ASCII-compatible implementation of the likely future standard Unicode/ISO 10646. -- underground experts united: http://user.it.uu.se/~embe8573