From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Harald Hanche-Olsen Newsgroups: gmane.emacs.devel Subject: Re: Intermittent problem with unencodable-char-position Date: Wed, 14 Apr 2010 11:42:52 -0400 (EDT) Message-ID: <20100414.114252.374870777.hanche@math.ntnu.no> References: <20100414.001955.448182314.hanche@math.ntnu.no> <20100414.003857.429770267.hanche@math.ntnu.no> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: Text/Plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Trace: dough.gmane.org 1271260046 27574 80.91.229.12 (14 Apr 2010 15:47:26 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Wed, 14 Apr 2010 15:47:26 +0000 (UTC) To: emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Wed Apr 14 17:47:22 2010 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1O24ns-0001HN-Gn for ged-emacs-devel@m.gmane.org; Wed, 14 Apr 2010 17:47:19 +0200 Original-Received: from localhost ([127.0.0.1]:43878 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1O24nL-0004n2-34 for ged-emacs-devel@m.gmane.org; Wed, 14 Apr 2010 11:46:27 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1O24k1-0003qn-PQ for emacs-devel@gnu.org; Wed, 14 Apr 2010 11:43:01 -0400 Original-Received: from [140.186.70.92] (port=37891 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1O24k0-0003q5-3p for emacs-devel@gnu.org; Wed, 14 Apr 2010 11:43:01 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.69) (envelope-from ) id 1O24jy-00037R-0Z for emacs-devel@gnu.org; Wed, 14 Apr 2010 11:43:00 -0400 Original-Received: from anne.math.ntnu.no ([129.241.15.150]:58447) by eggs.gnu.org with smtp (Exim 4.69) (envelope-from ) id 1O24jx-000372-L3 for emacs-devel@gnu.org; Wed, 14 Apr 2010 11:42:57 -0400 Original-Received: (qmail 28212 invoked from network); 14 Apr 2010 15:42:54 -0000 Original-Received: from gauss.math.ntnu.no (HELO localhost) (hanche@129.241.15.102) by anne.math.ntnu.no with ESMTPA; 14 Apr 2010 15:42:54 -0000 In-Reply-To: <20100414.003857.429770267.hanche@math.ntnu.no> X-URL: http://www.math.ntnu.no/~hanche/ X-Mailer: Mew version 6.3.50 on Emacs 24.0.50 / Mule 6.0 (HANACHIRUSATO) X-detected-operating-system: by eggs.gnu.org: Solaris 10 (beta) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:123645 Archived-At: + Harald Hanche-Olsen : > + Harald Hanche-Olsen : > > > Evaluating the form > > > > (unencodable-char-position 0 5 'iso-latin-1-unix 1 "100 Ω") > > > > normally returns the list (4), since capital Omega is not encodable in > > latin-1. However, after I have run emacs for a while, it happens that > > this form begins to return nil [*]. I have no idea what triggers this > > behaviour, [...] > > Well, lo and behold, after sending the above mail I immediately > discovered how to trigger the problem: Sending mail does it. After a couple hours of debugging effort I managed to drill down to the code in mew that triggers the problem: It is this little snippet (apply 'set-charset-priority charset-list) in which charset-list is a humongous list of charset names. (Included below my signature in order to not interrupt your train of thought.) I can undo the damage by running set-charset-priority on a much shorter list, snipped from the head of the big one. I have no idea why the author of mew thinks he needs to do this, but in any case, having it influence the behaviour of unencodable-char-position must surely be a bug? I'll submit a bug report to that effect unless someone here jumps up and explains why it is not a bug. - Harald PS. Damaging value of charset-list: (unicode-bmp unicode iso-8859-1 ascii latin-iso8859-1 control-1 iso-8859-2 latin-iso8859-2 iso-8859-3 latin-iso8859-3 iso-8859-4 latin-iso8859-4 iso-8859-5 cyrillic-iso8859-5 iso-8859-6 arabic-iso8859-6 iso-8859-7 greek-iso8859-7 iso-8859-8 hebrew-iso8859-8 iso-8859-9 latin-iso8859-9 iso-8859-10 latin-iso8859-10 iso-8859-11 thai-iso8859-11 iso-8859-13 latin-iso8859-13 iso-8859-14 latin-iso8859-14 iso-8859-15 latin-iso8859-15 iso-8859-16 latin-iso8859-16 thai-tis620 tis620-2533 jisx0201 chinese-gb2312 chinese-gbk chinese-cns11643-1 chinese-cns11643-2 chinese-cns11643-3 chinese-cns11643-4 chinese-cns11643-5 chinese-cns11643-6 chinese-cns11643-7 big5 japanese-jisx0208 japanese-jisx0208-1978 japanese-jisx0212 japanese-jisx0213-1 japanese-jisx0213-2 japanese-jisx0213.2004-1 cp932 korean-ksc5601 big5-hkscs cp949 viscii vscii vscii-2 koi8-r alternativnyj cp866 koi8-u koi8-t georgian-ps georgian-academy windows-1250 windows-1251 windows-1252 windows-1253 windows-1254 windows-1255 windows-1256 windows-1257 windows-1258 next cp1125 cp437 cp720 cp737 cp775 cp851 cp852 cp855 cp857 cp858 cp860 cp861 cp862 cp863 cp864 cp865 cp869 cp874 unicode-smp unicode-sip unicode-ssp mac-roman ebcdic-us ebcdic-uk ibm1047 hp-roman8 adobe-standard-encoding symbol ibm850 mik ptcp154 gb18030 chinese-cns11643-15 emacs eight-bit eight-bit-control eight-bit-graphic latin-jisx0201 katakana-jisx0201 chinese-big5-1 chinese-big5-2 japanese-jisx0213-a katakana-sjis cp932-2-byte cp949-2-byte chinese-sisheng ipa vietnamese-viscii-lower vietnamese-viscii-upper arabic-digit arabic-1-column arabic-2-column lao mule-lao indian-is13194 devanagari-cdac sanskrit-cdac bengali-cdac tamil-cdac telugu-cdac assamese-cdac oriya-cdac kannada-cdac malayalam-cdac gujarati-cdac punjabi-cdac devanagari-akruti bengali-akruti punjabi-akruti gujarati-akruti oriya-akruti tamil-akruti telugu-akruti kannada-akruti malayalam-akruti indian-glyph indian-1-column indian-2-column tibetan tibetan-1-column mule-unicode-2500-33ff mule-unicode-e000-ffff mule-unicode-0100-24ff ethiopic gb18030-2-byte gb18030-4-byte-bmp gb18030-4-byte-smp gb18030-4-byte-ext-1 gb18030-4-byte-ext-2)