From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: David Kastrup Newsgroups: gmane.emacs.help Subject: Re: Meta-Characters, Special Characters Date: Sat, 02 Jun 2007 09:45:54 +0200 Organization: Organization?!? Message-ID: <856466x0i5.fsf@lola.goethe.zz> References: <5c2mbdF2ung8hU1@mid.individual.net> <1180481373.651591.253210@i38g2000prf.googlegroups.com> <873b1eudkj.fsf@nict.go.jp> <87myzjt57b.fsf@catnip.gol.com> <87myziq3qe.fsf@aikishugyo.dnsdojo.org> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: sea.gmane.org 1180773666 17159 80.91.229.12 (2 Jun 2007 08:41:06 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Sat, 2 Jun 2007 08:41:06 +0000 (UTC) To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Sat Jun 02 10:41:02 2007 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1HuPAT-0007LV-A5 for geh-help-gnu-emacs@m.gmane.org; Sat, 02 Jun 2007 10:41:02 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1HuPAR-0003mK-Q9 for geh-help-gnu-emacs@m.gmane.org; Sat, 02 Jun 2007 04:40:59 -0400 Original-Path: shelby.stanford.edu!newsfeed.stanford.edu!news.tele.dk!news.tele.dk!small.news.tele.dk!newsfeed00.sul.t-online.de!newsfeed01.sul.t-online.de!t-online.de!newsfeed.arcor.de!newsspool1.arcor-online.net!news.arcor.de.POSTED!not-for-mail Original-Newsgroups: gnu.emacs.help,comp.emacs X-Face: 2FEFf>]>q>2iw=B6, xrUubRI>pR&Ml9=ao@P@i)L:\urd*t9M~y1^:+Y]'C0~{mAl`oQuAl \!3KEIp?*w`|bL5qr,H)LFO6Q=qx~iH4DN; i"; /yuIsqbLLCh/!U#X[S~(5eZ41to5f%E@'ELIi$t^ Vc\LWP@J5p^rst0+('>Er0=^1{]M9!p?&:\z]|;&=NP3AhB!B_bi^]Pfkw User-Agent: Gnus/5.11 (Gnus v5.11) Emacs/22.1.50 (gnu/linux) Cancel-Lock: sha1:Coa0X76Rmo5XpnSnQ2AIP8gBGMM= Original-Lines: 34 Original-NNTP-Posting-Date: 02 Jun 2007 09:45:55 CEST Original-NNTP-Posting-Host: 4b5fd99f.newsspool1.arcor-online.net Original-X-Trace: DXC=Rdkb0[\BH3YbBAmePN2; @LcCV`H8_`hhQd^9QSCVg3dOfOIC5DBZVJ9kT>4I2QgX0Qam_\elI9DEI` Original-X-Complaints-To: usenet-abuse@arcor.de Original-Xref: shelby.stanford.edu gnu.emacs.help:149017 comp.emacs:94412 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:44606 Archived-At: Gernot Hassenpflug writes: > Miles Bader writes: > >> Gernot Hassenpflug writes: >>> I am happy to note that Windows too stores its iinformation in UTF-8 >>> internally, no matter what the user's settings for a particular >>> program may be. >> >> I thought windows used something a bit more annoying and ad-hoc, UCS-16 >> or something like that. > > Oh, you may be right there, I should have qualified my statement: as > opposed to a Windows-specific charset I think Windows uses a > universal charset. I am not sure why UCS-16 is more ad-hoc than > UTF-8, but I would be more than happy if linux instead of UTF-8 > moved to UTF-16 or UTF-32, in view of the many charsets I need in my > work. I am not nearly educated enough on this topic to hold a > coherent conversation however, still reading. -- Grrr!! ...Pick a > reason... As soon as you leave the UTF-16 base plane, you need to deal with surrogate character pairs. The issues are pretty much the same as when dealing with UTF-8, and you get the additional complications of wide characters, quite more conspicuous byte order marks, Endianness portability problems and so on. In short: this buys you positively nothing unless you restrict yourself to the base 16-bit subset (which makes this infeasible for a number of tasks). And even then, the disadvantages are not really in a good balance with the advantages. -- David Kastrup, Kriemhildstr. 15, 44793 Bochum