From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Thomas Morgan Newsgroups: gmane.emacs.bugs Subject: Re: decode-char & utf-8-fragment-on-decoding Date: 07 Sep 2002 21:07:35 -0400 Sender: bug-gnu-emacs-admin@gnu.org Message-ID: <87n0qtnvag.fsf@cricket.magic.csuhayward.edu> References: <87y9aii9hc.fsf@cricket.magic.csuhayward.edu> <200209040818.RAA12414@etlken.m17n.org> <87vg5lovra.fsf@cricket.magic.csuhayward.edu> <87k7lzq5se.fsf@cricket.magic.csuhayward.edu> NNTP-Posting-Host: localhost.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: main.gmane.org 1031447648 31716 127.0.0.1 (8 Sep 2002 01:14:08 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Sun, 8 Sep 2002 01:14:08 +0000 (UTC) Cc: handa@etl.go.jp, bug-gnu-emacs@gnu.org Return-path: Original-Received: from monty-python.gnu.org ([199.232.76.173]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 17nqeM-0008FH-00 for ; Sun, 08 Sep 2002 03:14:06 +0200 Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org) by monty-python.gnu.org with esmtp (Exim 4.10) id 17nqeN-0006KO-00; Sat, 07 Sep 2002 21:14:07 -0400 Original-Received: from list by monty-python.gnu.org with tmda-scanned (Exim 4.10) id 17nqeH-0006J3-00 for bug-gnu-emacs@gnu.org; Sat, 07 Sep 2002 21:14:01 -0400 Original-Received: from mail by monty-python.gnu.org with spam-scanned (Exim 4.10) id 17nqeF-0006Ir-00 for bug-gnu-emacs@gnu.org; Sat, 07 Sep 2002 21:14:00 -0400 Original-Received: from hawk.mail.pas.earthlink.net ([207.217.120.22]) by monty-python.gnu.org with esmtp (Exim 4.10) id 17nqeE-0006In-00 for bug-gnu-emacs@gnu.org; Sat, 07 Sep 2002 21:13:58 -0400 Original-Received: from user-uive8l4.dsl.mindspring.com ([165.247.34.164] helo=cricket.magic.csuhayward.edu) by hawk.mail.pas.earthlink.net with esmtp (Exim 3.33 #1) id 17nqe9-0002yv-00; Sat, 07 Sep 2002 18:13:54 -0700 In-Reply-To: (message from Dave Love on 08 Sep 2002 00:14:57 +0100) Original-To: d.love@dl.ac.uk Original-Lines: 29 Errors-To: bug-gnu-emacs-admin@gnu.org X-BeenThere: bug-gnu-emacs@gnu.org X-Mailman-Version: 2.0.11 Precedence: bulk List-Help: List-Post: List-Subscribe: , List-Id: Bug reports for GNU Emacs, the Swiss army knife of text editors List-Unsubscribe: , List-Archive: Xref: main.gmane.org gmane.emacs.bugs:3452 X-Report-Spam: http://spam.gmane.org/gmane.emacs.bugs:3452 It's indeed a bug that (eq utf-8-translation-table-for-decode (get 'utf-8-translation-table-for-decode 'translation-table)) => nil I noticed this code in international/characters.el: (modify-category-entry (make-char 'greek-iso8859-7) ?g) (let ((c #x370)) (while (<= c #x3ff) (modify-category-entry (decode-char 'ucs c) ?g) (setq c (1+ c)))) My understanding is that the first line is sufficient to put all characters from the charset `greek-iso8859-7' into the category ?g. Then the purpose of the remaining lines must be to add Greek Unicode characters into the category ?g, but they do not do that; instead, they add the corresponding characters from greek-iso8859-7 in again. So it looks like this bug has at least one practical consequence: Greek Unicode characters are not put into the category ?g. It occurred to me that even after this bug is fixed, the problem would remain if utf-8-fragment-on-decoding were enabled while this code is executed. However, this code is executed before the user has a chance to enable the option, right? So as long as fragmentation is not the default, that will be ok.