From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.devel Subject: Re: Case mapping of sharp s Date: Sat, 21 Nov 2009 21:15:44 +0200 Message-ID: <833a47pu3j.fsf@gnu.org> References: <4B05A11F.5000700@gmx.de> <87iqd6gmpk.fsf@lola.goethe.zz> <87tywp7tir.fsf@uwakimon.sk.tsukuba.ac.jp> <87aayheki7.fsf@uwakimon.sk.tsukuba.ac.jp> <87fx88aw6a.fsf@lola.goethe.zz> <87zl6fdbeu.fsf@uwakimon.sk.tsukuba.ac.jp> Reply-To: Eli Zaretskii NNTP-Posting-Host: lo.gmane.org X-Trace: ger.gmane.org 1258841503 31740 80.91.229.12 (21 Nov 2009 22:11:43 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Sat, 21 Nov 2009 22:11:43 +0000 (UTC) Cc: dak@gnu.org, rms@gnu.org, monnier@iro.umontreal.ca, emacs-devel@gnu.org To: "Stephen J. Turnbull" Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Sat Nov 21 23:11:35 2009 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1NByAB-0003SB-U1 for ged-emacs-devel@m.gmane.org; Sat, 21 Nov 2009 23:10:40 +0100 Original-Received: from localhost ([127.0.0.1]:33298 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1NBvRz-0007I1-7e for ged-emacs-devel@m.gmane.org; Sat, 21 Nov 2009 14:16:51 -0500 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1NBvR4-00066z-TT for emacs-devel@gnu.org; Sat, 21 Nov 2009 14:15:54 -0500 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1NBvR0-00064v-1o for emacs-devel@gnu.org; Sat, 21 Nov 2009 14:15:54 -0500 Original-Received: from [199.232.76.173] (port=54272 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1NBvQz-00064q-U0 for emacs-devel@gnu.org; Sat, 21 Nov 2009 14:15:49 -0500 Original-Received: from mtaout21.012.net.il ([80.179.55.169]:47962) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1NBvQv-00039J-Sh; Sat, 21 Nov 2009 14:15:46 -0500 Original-Received: from conversion-daemon.a-mtaout21.012.net.il by a-mtaout21.012.net.il (HyperSendmail v2007.08) id <0KTH00J003UFNJ00@a-mtaout21.012.net.il>; Sat, 21 Nov 2009 21:15:44 +0200 (IST) Original-Received: from HOME-C4E4A596F7 ([87.70.37.193]) by a-mtaout21.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0KTH00J9B467V600@a-mtaout21.012.net.il>; Sat, 21 Nov 2009 21:15:44 +0200 (IST) In-reply-to: <87zl6fdbeu.fsf@uwakimon.sk.tsukuba.ac.jp> X-012-Sender: halo1@inter.net.il X-detected-operating-system: by monty-python.gnu.org: Solaris 10 (beta) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:117462 Archived-At: > From: "Stephen J. Turnbull" > Date: Sun, 22 Nov 2009 02:40:09 +0900 > Cc: emacs-devel@gnu.org, rms@gnu.org, monnier@iro.umontreal.ca > > UTF-8 doesn't really have any algorithmic string-processing advantages > over Mule code. Even the fact that you can compute the length of a > character algorithmically from a UTF-8 leading byte is unimportant, > since it's much more efficient to use a table lookup for that. The > big advantage of UTF-8 is that it's based on Unicode, so characters > that never should have been distinguished in the first place don't > have to be reidentified in Lisp. Not to mention all of the useful > character data and the bidi algorithm, etc. Not that it's important, but since we are talking principles here: the bidi algorithm has nothing to do with Unicode, let alone UTF-8. It is not even the best algorithm to handle bidirectional text (rumor has it that Microsoft lobbied the consortium into the algorithm they developed for their word processors). Better algorithms with saner results were available years before UAX#9 was published, and they worked with codepoints from DOS the codepage 862 just dandy. You (Stephen) may recall that I presented one of them in Tsukuba 9 years ago.