From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Kenichi Handa Newsgroups: gmane.emacs.bugs Subject: bug#11073: 24.0.94; BIDI-related crash in redisplay with certain byte sequences Date: Thu, 29 Mar 2012 14:19:50 +0900 Message-ID: References: <83sjgzvb6w.fsf@gnu.org> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain X-Trace: dough.gmane.org 1332998447 3863 80.91.229.3 (29 Mar 2012 05:20:47 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Thu, 29 Mar 2012 05:20:47 +0000 (UTC) Cc: 11073@debbugs.gnu.org To: Stefan Monnier Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Thu Mar 29 07:20:46 2012 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1SD7mv-0005A2-NL for geb-bug-gnu-emacs@m.gmane.org; Thu, 29 Mar 2012 07:20:45 +0200 Original-Received: from localhost ([::1]:40455 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1SD7mu-0004AR-T2 for geb-bug-gnu-emacs@m.gmane.org; Thu, 29 Mar 2012 01:20:44 -0400 Original-Received: from eggs.gnu.org ([208.118.235.92]:42369) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1SD7mr-00049k-6V for bug-gnu-emacs@gnu.org; Thu, 29 Mar 2012 01:20:42 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1SD7mp-0007kO-Eo for bug-gnu-emacs@gnu.org; Thu, 29 Mar 2012 01:20:40 -0400 Original-Received: from debbugs.gnu.org ([140.186.70.43]:37149) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1SD7mp-0007kA-BC for bug-gnu-emacs@gnu.org; Thu, 29 Mar 2012 01:20:39 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.72) (envelope-from ) id 1SD8HD-00074h-OW for bug-gnu-emacs@gnu.org; Thu, 29 Mar 2012 01:52:03 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Kenichi Handa Original-Sender: debbugs-submit-bounces@debbugs.gnu.org Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Thu, 29 Mar 2012 05:52:03 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 11073 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: Original-Received: via spool by 11073-submit@debbugs.gnu.org id=B11073.133300031627181 (code B ref 11073); Thu, 29 Mar 2012 05:52:03 +0000 Original-Received: (at 11073) by debbugs.gnu.org; 29 Mar 2012 05:51:56 +0000 Original-Received: from localhost ([127.0.0.1]:43981 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1SD8H4-00074L-Uc for submit@debbugs.gnu.org; Thu, 29 Mar 2012 01:51:56 -0400 Original-Received: from mx1.aist.go.jp ([150.29.246.133]:42669) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1SD8GV-00073c-RA for 11073@debbugs.gnu.org; Thu, 29 Mar 2012 01:51:53 -0400 Original-Received: from rqsmtp2.aist.go.jp (rqsmtp2.aist.go.jp [150.29.254.123]) by mx1.aist.go.jp with ESMTP id q2T5JqOJ018654; Thu, 29 Mar 2012 14:19:52 +0900 (JST) env-from (handa@m17n.org) Original-Received: from smtp4.aist.go.jp by rqsmtp2.aist.go.jp with ESMTP id q2T5JqhF029699; Thu, 29 Mar 2012 14:19:52 +0900 (JST) env-from (handa@m17n.org) Original-Received: by smtp4.aist.go.jp with ESMTP id q2T5Jonp018772; Thu, 29 Mar 2012 14:19:50 +0900 (JST) env-from (handa@m17n.org) In-Reply-To: (message from Stefan Monnier on Mon, 26 Mar 2012 08:23:58 -0400) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-Received-From: 140.186.70.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:58278 Archived-At: In article , Stefan Monnier writes: > I understand this part. The part I don't understand is why we do > unification when reading a char from the buffer's text. That is: why > unify chars in `int' (or Lisp_Object) form but not in the > internal-utf-8 representation? > I would expect the unification to happen during encoding/decoding Usually, yes. But as far as there is a code space in high area for a CJK charset, it is unavoidable to have a buffer/string that contains a character represented by a byte sequence in that high area as the test case of Bug#11073. And, as "unification" means to treat such a character the same way as the unified character, I thought they both have the same character code. --- Kenichi Handa handa@m17n.org