From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Kenichi Handa Newsgroups: gmane.emacs.bugs Subject: bug#11073: 24.0.94; BIDI-related crash in redisplay with certain byte sequences Date: Tue, 03 Apr 2012 14:55:11 +0900 Message-ID: References: <83sjgzvb6w.fsf@gnu.org> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain X-Trace: dough.gmane.org 1333432552 16549 80.91.229.3 (3 Apr 2012 05:55:52 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Tue, 3 Apr 2012 05:55:52 +0000 (UTC) Cc: 11073@debbugs.gnu.org To: Stefan Monnier Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Tue Apr 03 07:55:49 2012 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1SEwib-0001yN-DU for geb-bug-gnu-emacs@m.gmane.org; Tue, 03 Apr 2012 07:55:49 +0200 Original-Received: from localhost ([::1]:47401 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1SEwia-00043x-Qw for geb-bug-gnu-emacs@m.gmane.org; Tue, 03 Apr 2012 01:55:48 -0400 Original-Received: from eggs.gnu.org ([208.118.235.92]:32819) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1SEwiW-00043h-QD for bug-gnu-emacs@gnu.org; Tue, 03 Apr 2012 01:55:47 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1SEwiV-0004Ix-1f for bug-gnu-emacs@gnu.org; Tue, 03 Apr 2012 01:55:44 -0400 Original-Received: from debbugs.gnu.org ([140.186.70.43]:40818) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1SEwiU-0004Is-UZ for bug-gnu-emacs@gnu.org; Tue, 03 Apr 2012 01:55:42 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.72) (envelope-from ) id 1SEwio-0006cN-C4 for bug-gnu-emacs@gnu.org; Tue, 03 Apr 2012 01:56:02 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Kenichi Handa Original-Sender: debbugs-submit-bounces@debbugs.gnu.org Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Tue, 03 Apr 2012 05:56:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 11073 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: Original-Received: via spool by 11073-submit@debbugs.gnu.org id=B11073.133343253925408 (code B ref 11073); Tue, 03 Apr 2012 05:56:02 +0000 Original-Received: (at 11073) by debbugs.gnu.org; 3 Apr 2012 05:55:39 +0000 Original-Received: from localhost ([127.0.0.1]:37356 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1SEwiQ-0006bk-EC for submit@debbugs.gnu.org; Tue, 03 Apr 2012 01:55:39 -0400 Original-Received: from mx1.aist.go.jp ([150.29.246.133]:34558) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1SEwiN-0006bc-F1 for 11073@debbugs.gnu.org; Tue, 03 Apr 2012 01:55:37 -0400 Original-Received: from rqsmtp2.aist.go.jp (rqsmtp2.aist.go.jp [150.29.254.123]) by mx1.aist.go.jp with ESMTP id q335tDPE010924; Tue, 3 Apr 2012 14:55:13 +0900 (JST) env-from (handa@m17n.org) Original-Received: from smtp4.aist.go.jp by rqsmtp2.aist.go.jp with ESMTP id q335tCK0025726; Tue, 3 Apr 2012 14:55:12 +0900 (JST) env-from (handa@m17n.org) Original-Received: by smtp4.aist.go.jp with ESMTP id q335tBFn026094; Tue, 3 Apr 2012 14:55:11 +0900 (JST) env-from (handa@m17n.org) In-Reply-To: (message from Stefan Monnier on Tue, 03 Apr 2012 00:22:32 -0400) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-Received-From: 140.186.70.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:58483 Archived-At: In article , Stefan Monnier writes: > > Please note that not all characters in the code-space of a > > CJK charset are unified. For instance, Big5 has it's own > > PUA (private use area), and characters in PUA are not > > unified by default. So, if Emacs reads a Big5 file that > > contains PUA chars, those chars stay in high-area. Then, > > one can provide his own unification map that also maps PUA > > chars to some Unicode chars as this: > > (unify-charset 'big5 "MyBig5.map") > > After this, I thought that previously read PUA chars staying > > in the high-area should be treated as the corresponding > > Unicode chars (in displaying, search, etc). > But again, this unification takes place during decoding. No. In the above scenario, PUA chars read before the call of unify-charset are not unified. The unification should take place after the call of unify-charset. > Whereas what > I'm talking about takes place when reading the internal utf-8 > representation, which should be already unified. I'm talking about exactly that case. --- Kenichi Handa handa@m17n.org