From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Kenichi Handa Newsgroups: gmane.emacs.devel Subject: Re: multiple destinations in charset mapping files Date: Thu, 25 Jun 2009 09:36:36 +0900 Message-ID: References: NNTP-Posting-Host: lo.gmane.org X-Trace: ger.gmane.org 1245890220 528 80.91.229.12 (25 Jun 2009 00:37:00 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Thu, 25 Jun 2009 00:37:00 +0000 (UTC) Cc: emacs-devel@gnu.org To: YAMAMOTO Mitsuharu Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Thu Jun 25 02:36:53 2009 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1MJcxQ-0003oV-Lm for ged-emacs-devel@m.gmane.org; Thu, 25 Jun 2009 02:36:52 +0200 Original-Received: from localhost ([127.0.0.1]:54684 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1MJcxQ-0003Xj-5h for ged-emacs-devel@m.gmane.org; Wed, 24 Jun 2009 20:36:52 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1MJcxL-0003Xe-Jb for emacs-devel@gnu.org; Wed, 24 Jun 2009 20:36:47 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1MJcxG-0003VH-4t for emacs-devel@gnu.org; Wed, 24 Jun 2009 20:36:46 -0400 Original-Received: from [199.232.76.173] (port=45366 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1MJcxF-0003VE-Uy for emacs-devel@gnu.org; Wed, 24 Jun 2009 20:36:41 -0400 Original-Received: from mx1.aist.go.jp ([150.29.246.133]:53767) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1MJcxF-0008Gj-7S for emacs-devel@gnu.org; Wed, 24 Jun 2009 20:36:41 -0400 Original-Received: from rqsmtp1.aist.go.jp (rqsmtp1.aist.go.jp [150.29.254.115]) by mx1.aist.go.jp with ESMTP id n5P0aa1R009016; Thu, 25 Jun 2009 09:36:36 +0900 (JST) env-from (handa@m17n.org) Original-Received: from smtp3.aist.go.jp by rqsmtp1.aist.go.jp with ESMTP id n5P0aavS014726; Thu, 25 Jun 2009 09:36:36 +0900 (JST) env-from (handa@m17n.org) Original-Received: by smtp3.aist.go.jp with ESMTP id n5P0aaer026746; Thu, 25 Jun 2009 09:36:36 +0900 (JST) env-from (handa@m17n.org) Original-Received: from handa by etlken with local (Exim 4.69) (envelope-from ) id 1MJcxA-0000sT-5x; Thu, 25 Jun 2009 09:36:36 +0900 In-reply-to: (message from YAMAMOTO Mitsuharu on Wed, 24 Jun 2009 17:11:56 +0900) X-detected-operating-system: by monty-python.gnu.org: Solaris 9 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:111706 Archived-At: In article , YAMAMOTO Mitsuharu writes: > I noticed that some charset mapping files such as > etc/charsets/symbol.map contain entries where the same source is > mapped to multiple destinations, and the latter one is preferred in > decoding in such cases. > 0x20 0x0020 > 0x20 0x00A0 > (decode-char 'symbol #x20) -> 160 > But at least for symbol.map, it seems to be more natural to prefer the > former entry (e.g., SPACE vs. NO-BRAKE SPACE, GREEK CAPITAL LETTER > DELTA vs. INCREMENT). WDYT? I agree. By this script in etc/charsets: % for f in *.map; do awk '{print $1}' < $f | sort | uniq -c | grep '^ *[2-9] 0' && echo $f; done I confirmed only symbol.map and stdenc.map contain such duplications, so I regenrated those maps (simply by doing sort -r) and committed to EMACS_23_1_RC and trunk. Do you find any other maps that have duplications? --- Kenichi Handa handa@m17n.org