From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Stephen Berman Newsgroups: gmane.emacs.devel,gmane.emacs.pretest.bugs Subject: 23.0.60; Non-ascii character string weirdness in Emacs Lisp mode Date: Thu, 05 Jun 2008 11:59:36 +0200 Message-ID: <87skvs6w3b.fsf@escher.local.home> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 8bit X-Trace: ger.gmane.org 1212660189 16614 80.91.229.12 (5 Jun 2008 10:03:09 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Thu, 5 Jun 2008 10:03:09 +0000 (UTC) To: emacs-pretest-bug@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Thu Jun 05 12:03:50 2008 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1K4CH7-0004kd-U8 for ged-emacs-devel@m.gmane.org; Thu, 05 Jun 2008 12:00:54 +0200 Original-Received: from localhost ([127.0.0.1]:32868 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1K4CGL-0001kr-3A for ged-emacs-devel@m.gmane.org; Thu, 05 Jun 2008 06:00:05 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1K4CGD-0001i4-R0 for emacs-devel@gnu.org; Thu, 05 Jun 2008 05:59:57 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1K4CGC-0001hI-Gp for emacs-devel@gnu.org; Thu, 05 Jun 2008 05:59:57 -0400 Original-Received: from [199.232.76.173] (port=51575 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1K4CGC-0001hF-AE for emacs-devel@gnu.org; Thu, 05 Jun 2008 05:59:56 -0400 Original-Received: from fencepost.gnu.org ([140.186.70.10]:58893) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1K4CGC-0006OG-0I for emacs-devel@gnu.org; Thu, 05 Jun 2008 05:59:56 -0400 Original-Received: from mx10.gnu.org ([199.232.76.166]:40674) by fencepost.gnu.org with esmtp (Exim 4.67) (envelope-from ) id 1K4CEL-0007Md-O1 for emacs-pretest-bug@gnu.org; Thu, 05 Jun 2008 05:58:01 -0400 Original-Received: from Debian-exim by monty-python.gnu.org with spam-scanned (Exim 4.60) (envelope-from ) id 1K4CG6-0006NH-0n for emacs-pretest-bug@gnu.org; Thu, 05 Jun 2008 05:59:55 -0400 Original-Received: from mail.gmx.net ([213.165.64.20]:35091) by monty-python.gnu.org with smtp (Exim 4.60) (envelope-from ) id 1K4CG5-0006Mv-IC for emacs-pretest-bug@gnu.org; Thu, 05 Jun 2008 05:59:49 -0400 Original-Received: (qmail invoked by alias); 05 Jun 2008 09:59:47 -0000 Original-Received: from i5387D36E.versanet.de (EHLO escher.local.home) [83.135.211.110] by mail.gmx.net (mp029) with SMTP; 05 Jun 2008 11:59:47 +0200 X-Authenticated: #20778731 X-Provags-ID: V01U2FsdGVkX1/qi/oJTvbqSzm+E2EEtwqOQF+ifDbebUh05VXPKC yubt6pdW1/4vMD Original-Received: by escher.local.home (Postfix, from userid 1000) id 024A31D0E7A; Thu, 5 Jun 2008 11:59:46 +0200 (CEST) User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.0.60 (gnu/linux) X-Y-GMX-Trusted: 0 X-detected-kernel: by monty-python.gnu.org: Linux 2.6, seldom 2.4 (older, 4) X-detected-kernel: by monty-python.gnu.org: Linux 2.6, seldom 2.4 (older, 4) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:98462 gmane.emacs.pretest.bugs:22562 Archived-At: Some change after 2008-05-29 has resulted in strange behavior of non-ascii characters in string in Emacs Lisp mode. I have an elisp file that contains the string "§" (that's the section charater, unicode U+00A7). When I visit that file in my Emacs builds from yesterday or today, that character appears as "§" (backslash 247). C-u C-x = says this: character: § (4194215, #o17777647, #x3fffa7) preferred charset: eight-bit (Raw bytes 0-255) code point: 0xA7 syntax: w which means: word buffer code: #xA7 file code: not encodable by coding system utf-8-unix display: no font available Yet I can type "§" (the section character) in Emacs Lisp mode in these builds and it appears as it should and describe-character recognizes it. Then, in my build from 2008-05-29, which does not have the above problem, I wrote an elisp file containing the following string: "§ßäöüµ" (in words: section German-eszet a-umlaut o-umlaut u-umlaut mu) When I visit that file in my builds from yesterday or todo I see this: "§ßäöüµ" (in words: A-with-circumflex section A-with-tilde \237 A-with-tilde currency-character A-with-tilde pilcrow A-with-tilde one-over-four A-with-circumflex mu) When I change this file (but not the string) and resave it in my 2008-05-29 build, and then in the newer builds invoke revert-buffer on that already visited file, the string changes to the same as in the earlier build, i.e., the characters now look like they should. If, in these newer builds, I now kill the buffer and then revisit the file, the string no longer looks strange, but the same as in the earlier build. I am using the Xft font backend and start Emacs with -fn "Dejavu Sans Mono-10". In GNU Emacs 23.0.60.1 (i686-pc-linux-gnu, GTK+ Version 2.12.0) of 2008-06-05 on escher Windowing system distributor `The X.Org Foundation', version 11.0.70200000 Important settings: value of $LC_ALL: nil value of $LC_COLLATE: nil value of $LC_CTYPE: nil value of $LC_MESSAGES: nil value of $LC_MONETARY: nil value of $LC_NUMERIC: nil value of $LC_TIME: nil value of $LANG: en_US.UTF-8 value of $XMODIFIERS: @im=local locale-coding-system: utf-8-unix default-enable-multibyte-characters: t Major mode: Emacs-Lisp Minor modes in effect: tooltip-mode: t tool-bar-mode: t mouse-wheel-mode: t menu-bar-mode: t file-name-shadow-mode: t global-font-lock-mode: t font-lock-mode: t blink-cursor-mode: t global-auto-composition-mode: t auto-composition-mode: t auto-encryption-mode: t auto-compression-mode: t line-number-mode: t transient-mark-mode: t Recent input: C-x C-f ~ / . e m a . d . t o d y C-x k C-x C-f m o s i t o d o C-s t C-w - p C-w C-s C-s C-s C-s C-s C-x C-f d i a d i - l o n g g C-x k C-x C-f g b C-s $ C-g C-s § C-g C-g C-x C-f t e s c h C-x C-f t e s t . e C-x k C-x k C-x k C-x C-f C-x C-f M-x r e p o r