From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Gernot Hassenpflug Newsgroups: gmane.emacs.help Subject: Re: Meta-Characters, Special Characters Date: Sun, 03 Jun 2007 00:39:28 +0900 Organization: Hase at Home on GNU/Linux Debian unstable Message-ID: <87tztqnz67.fsf@aikishugyo.dnsdojo.org> References: <5c2mbdF2ung8hU1@mid.individual.net> <1180481373.651591.253210@i38g2000prf.googlegroups.com> <873b1eudkj.fsf@nict.go.jp> <87myzjt57b.fsf@catnip.gol.com> <87myziq3qe.fsf@aikishugyo.dnsdojo.org> <856466x0i5.fsf@lola.goethe.zz> Reply-To: Gernot Hassenpflug NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: sea.gmane.org 1180798840 16884 80.91.229.12 (2 Jun 2007 15:40:40 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Sat, 2 Jun 2007 15:40:40 +0000 (UTC) To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Sat Jun 02 17:40:38 2007 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1HuViW-0002Pe-Ku for geh-help-gnu-emacs@m.gmane.org; Sat, 02 Jun 2007 17:40:36 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1HuViW-0000ou-41 for geh-help-gnu-emacs@m.gmane.org; Sat, 02 Jun 2007 11:40:36 -0400 Original-Path: shelby.stanford.edu!newsfeed.stanford.edu!news.tele.dk!news.tele.dk!small.news.tele.dk!news-fra1.dfn.de!storethat.news.telefonica.de!telefonica.de!news.germany.com!news.motzarella.org!not-for-mail Original-Newsgroups: gnu.emacs.help,comp.emacs Original-Lines: 40 Original-X-Trace: tomate.motzarella.org U2FsdGVkX1/DXguGnscYzCuu4XvoJpESuNBtcwIrcgKbpO3LxhNHYq2wuAzLWYoO+8CdYxE/nyULME9N2GUHePO97vHrQ+88jadTbVuPAwE0BI5LnHZIYvLW2lDMqLokVn8uKZraYb3p0fncm0RiMw== Original-X-Complaints-To: Please send complaints to abuse@motzarella.org with full headers Original-NNTP-Posting-Date: Sat, 2 Jun 2007 15:40:00 +0000 (UTC) X-Window-System: x X-Auth-Sender: U2FsdGVkX1+WRoFlbQrWip7Dy4IedJjcmd+ndxfDrzRih0VVqy4obA== Face: iVBORw0KGgoAAAANSUhEUgAAADAAAAAwBAMAAAClLOS0AAAAElBMVEWSOz8aFCdvW1mtXVmv lpldMjj6TOyFAAACM0lEQVR4nGWUUY7cIAyGPYpyANDy3pjwTvHMAZpwgIRd3/8q/Q1Md6Wi EZP4w/ZvEyBVFpH4Ul5EMBfG2OtGzFFKAWQpMcGofudab9IEC7NwrPjbObmPg/fDEXzN47nX U1JjA5X3zZEyl8j8rFVkjRWgVq43PLhugfWrHhKXeqbbwOYCCR+cIKOeBerodgA2iFvRto+X MynAeCFtqrHOccDu/AQgT3vq0x8A9wbJy4zEanb3NUFs8hzra+rgY+Yo+llPW2/1b+8kFFrk gPpY1SQFTAYOalIktPpqPRabrGAqCG0t915mJdwJHjY6RYq+9lEeVk+9jkpZDnSqHhpURx4A tL2kT3TzTE4vohWkA3YksesPSsN+dYD9sMruaTevCy/PA6DbDaDnlHvjg7ADOB9wRgjsRgbO ik0yIKvrIi9aRC3W6kI2QL1WXQNscECqW/NmwOwBbnqt+HXN9Ab+ergeBakWkpdFIUvRXiMU qWWSnGGjxwA2EB+iTd0/MGpeLxnjtwGrjpF3XcdqfJBiQnvuhPAZVcvSgUUaYOO8UO8JpvMb NDs7qwmCx0+Aj+HMELX0JAAzR2DPMdMYyw/gcV5anmbzoAna1tjHWbXMs2CAN04a31VD7QQ4 TpxSzHO5gUcHHqISji42b8nmRTJAS561TV3Dbe2AIcrjHBe6fs1wLwPBcodmF8U7O24OAN9F mf34To8PlRpEbalfLPIN1C4A6NX/gFJMvjVLgXPQG2IAG/oXQjHCyIHx6H4AAAAASUVORK5C YII= X-Face: @/]4<8I6V+nDm_ddk^dXq4/vw|(yVB5=enLEe^kP8/d{\F&yMgrFJ1:8@ToxjpTte[5n[= Rtl"P=cy[+Qm]{Q$+ftU`X}pfp7hq Cancel-Lock: sha1:QjNC2CGCbT7Qx6yrlQOa06fZZ38= User-Agent: Gnus/5.110006 (No Gnus v0.6) Emacs/22.0.95 (gnu/linux) X-URL: http://aikishugyo.dnsdojo.org X-Jost-Rating: Gernot X-Attribution: Gernot Hassenpflug Original-Xref: shelby.stanford.edu gnu.emacs.help:149029 comp.emacs:94413 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:44617 Archived-At: David Kastrup writes: > Gernot Hassenpflug writes: > >> Miles Bader writes: >> >>> Gernot Hassenpflug writes: >>>> I am happy to note that Windows too stores its iinformation in UTF-8 >>>> internally, no matter what the user's settings for a particular >>>> program may be. >>> >>> I thought windows used something a bit more annoying and ad-hoc, UCS-16 >>> or something like that. >> >> Oh, you may be right there, I should have qualified my statement: as >> opposed to a Windows-specific charset I think Windows uses a >> universal charset. I am not sure why UCS-16 is more ad-hoc than >> UTF-8, but I would be more than happy if linux instead of UTF-8 >> moved to UTF-16 or UTF-32, in view of the many charsets I need in my >> work. I am not nearly educated enough on this topic to hold a >> coherent conversation however, still reading. -- Grrr!! ...Pick a >> reason... > > As soon as you leave the UTF-16 base plane, you need to deal with > surrogate character pairs. The issues are pretty much the same as > when dealing with UTF-8, and you get the additional complications of > wide characters, quite more conspicuous byte order marks, Endianness > portability problems and so on. > > In short: this buys you positively nothing unless you restrict > yourself to the base 16-bit subset (which makes this infeasible for a > number of tasks). And even then, the disadvantages are not really in > a good balance with the advantages. Thanks for the explanation. In view of this, I assume at least some experts are exploring the possibility of introducing 16-bit bytes. Problems with legacy systems are probably unsurmountable at present though... -- Grrr!! ...Pick a reason...