From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: =?utf-8?q?J=C3=A9r=C3=B4me_Marant?= Newsgroups: gmane.emacs.devel Subject: Re: Fwd: Problem with non-bmp unicode Date: Sat, 18 Nov 2006 14:22:47 +0100 Message-ID: <200611181422.47603.jerome@marant.org> References: <200611101127.47456.jerome@marant.org> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: quoted-printable X-Trace: sea.gmane.org 1163860173 22994 80.91.229.2 (18 Nov 2006 14:29:33 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Sat, 18 Nov 2006 14:29:33 +0000 (UTC) Cc: Kenichi Handa Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Sat Nov 18 15:29:31 2006 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by ciao.gmane.org with esmtp (Exim 4.43) id 1GlRCE-00040d-PA for ged-emacs-devel@m.gmane.org; Sat, 18 Nov 2006 15:29:31 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1GlRCE-0005uy-6Q for ged-emacs-devel@m.gmane.org; Sat, 18 Nov 2006 09:29:30 -0500 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1GlQAD-0007Ur-H6 for emacs-devel@gnu.org; Sat, 18 Nov 2006 08:23:21 -0500 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1GlQAB-0007SI-Tn for emacs-devel@gnu.org; Sat, 18 Nov 2006 08:23:20 -0500 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1GlQAB-0007S5-M5 for emacs-devel@gnu.org; Sat, 18 Nov 2006 08:23:19 -0500 Original-Received: from [212.27.42.35] (helo=smtp5-g19.free.fr) by monty-python.gnu.org with esmtp (Exim 4.52) id 1GlQAA-0006c8-SK for emacs-devel@gnu.org; Sat, 18 Nov 2006 08:23:19 -0500 Original-Received: from localhost.localdomain (mol92-4-82-227-97-206.fbx.proxad.net [82.227.97.206]) by smtp5-g19.free.fr (Postfix) with ESMTP id 25EB627BBA; Sat, 18 Nov 2006 14:22:48 +0100 (CET) Original-Received: by localhost.localdomain (Postfix, from userid 1000) id EEC399812A0; Sat, 18 Nov 2006 14:22:47 +0100 (CET) Original-To: emacs-devel@gnu.org User-Agent: KMail/1.9.5 In-Reply-To: Content-Disposition: inline X-Mailman-Approved-At: Sat, 18 Nov 2006 09:28:20 -0500 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:62420 Archived-At: Le dimanche 12 novembre 2006 03:32, Kenichi Handa a =C3=A9crit=C2=A0: > > I tried a bunch of other characters outside the BMP, all of which > > fail in the same way. Characters in the BMP work nicely. >=20 > Emacs 22 still doesn't support Unicode characters over BMP. > If you really need to handle them, please use the CVS branch > emacs-unicode-2. OK. Noted. > > Apparently, emacs 22 shows a question mark instead of "\360\220\216\2= 00" > > but trying to delete the question mark character with backspace turn = it into > > "\360\220\216". >=20 > This is written in the comment of utf-8.el. >=20 > ;; We compose the untranslatable sequences into a single character, > ;; and move point to the next character. > ;; This is infelicitous for editing, because there's currently no > ;; mechanism for treating compositions as atomic, but is OK for > ;; display. They are composed to U+FFFD with help-echo which > ;; indicates the unicodes they represent. This function GCs too much. >=20 > I tried to fix this editting problem by using > modification-hooks text property, but couldn't accomplish a > good result. Thanks for trying anyway. --=20 J=C3=A9r=C3=B4me Marant