From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: owner@emacsbugs.donarmstrong.com (Emacs bug Tracking System) Newsgroups: gmane.emacs.bugs Subject: bug#2354: marked as done (23.0.90; Emacs fails to detect utf-8 encoding with language environment Latin-1) Date: Sat, 28 Feb 2009 12:30:04 +0000 Message-ID: References: <87y6w5jqqo.fsf@engster.org> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="----------=_1235824204-12677-0" X-Trace: ger.gmane.org 1235825164 8369 80.91.229.12 (28 Feb 2009 12:46:04 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Sat, 28 Feb 2009 12:46:04 +0000 (UTC) To: Eli Zaretskii Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Sat Feb 28 13:47:20 2009 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1LdOb6-0000Zh-QR for geb-bug-gnu-emacs@m.gmane.org; Sat, 28 Feb 2009 13:47:17 +0100 Original-Received: from localhost ([127.0.0.1]:59318 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1LdOZl-0005DY-Sr for geb-bug-gnu-emacs@m.gmane.org; Sat, 28 Feb 2009 07:45:53 -0500 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1LdOXk-0004kh-KT for bug-gnu-emacs@gnu.org; Sat, 28 Feb 2009 07:43:48 -0500 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1LdOXj-0004jv-Le for bug-gnu-emacs@gnu.org; Sat, 28 Feb 2009 07:43:47 -0500 Original-Received: from [199.232.76.173] (port=34695 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1LdOXj-0004jr-2B for bug-gnu-emacs@gnu.org; Sat, 28 Feb 2009 07:43:47 -0500 Original-Received: from rzlab.ucr.edu ([138.23.92.77]:46496) by monty-python.gnu.org with esmtps (TLS-1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.60) (envelope-from ) id 1LdOXd-0004k3-8a; Sat, 28 Feb 2009 07:43:41 -0500 Original-Received: from rzlab.ucr.edu (rzlab.ucr.edu [127.0.0.1]) by rzlab.ucr.edu (8.13.8/8.13.8/Debian-3) with ESMTP id n1SChc0b016460; Sat, 28 Feb 2009 04:43:39 -0800 Original-Received: (from debbugs@localhost) by rzlab.ucr.edu (8.13.8/8.13.8/Submit) id n1SCU4NZ012793; Sat, 28 Feb 2009 04:30:04 -0800 X-Mailer: MIME-tools 5.420 (Entity 5.420) X-Loop: owner@emacsbugs.donarmstrong.com X-Emacs-PR-Message: closed 2354 X-Emacs-PR-Package: emacs X-detected-operating-system: by monty-python.gnu.org: GNU/Linux 2.6 (newer, 3) X-BeenThere: bug-gnu-emacs@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:25843 Archived-At: This is a multi-part message in MIME format... ------------=_1235824204-12677-0 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=utf-8 Your message dated Sat, 28 Feb 2009 14:21:08 +0200 with message-id and subject line Re: bug#2497: 23.0.91; Fails to read UTF-8 on Win2k has caused the Emacs bug report #2354, regarding 23.0.90; Emacs fails to detect utf-8 encoding with language envir= onment Latin-1 to be marked as done. This means that you claim that the problem has been dealt with. If this is not the case it is now your responsibility to reopen the bug report if necessary, and/or fix the problem forthwith. (NB: If you are a system administrator and have no idea what this message is talking about, this may indicate a serious mail system misconfiguration somewhere. Please contact owner@emacsbugs.donarmstrong.com immediately.) --=20 2354: http://emacsbugs.donarmstrong.com/cgi-bin/bugreport.cgi?bug=3D2354 Emacs Bug Tracking System Contact owner@emacsbugs.donarmstrong.com with problems ------------=_1235824204-12677-0 Content-Type: message/rfc822 Content-Disposition: inline Content-Transfer-Encoding: 7bit Received: (at submit) by emacsbugs.donarmstrong.com; 17 Feb 2009 10:35:40 +0000 X-Spam-Checker-Version: SpamAssassin 3.2.5-bugs.debian.org_2005_01_02 (2008-06-10) on rzlab.ucr.edu X-Spam-Level: X-Spam-Bayes: score:0.5 Bayes not run. spammytokens:Tokens not available. hammytokens:Tokens not available. X-Spam-Status: No, score=0.1 required=4.0 tests=FOURLA autolearn=no version=3.2.5-bugs.debian.org_2005_01_02 Received: from lists.gnu.org (lists.gnu.org [199.232.76.165]) by rzlab.ucr.edu (8.13.8/8.13.8/Debian-3) with ESMTP id n1HAZVl4028159 for ; Tue, 17 Feb 2009 02:35:33 -0800 Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1LZNIY-0003fK-Sq for bug-gnu-emacs@gnu.org; Tue, 17 Feb 2009 05:35:30 -0500 Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1LZNIR-0003VJ-Qq for bug-gnu-emacs@gnu.org; Tue, 17 Feb 2009 05:35:25 -0500 Received: from [199.232.76.173] (port=51423 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1LZNIQ-0003UY-NL for bug-gnu-emacs@gnu.org; Tue, 17 Feb 2009 05:35:22 -0500 Received: from m61s02.vlinux.de ([83.151.21.164]:48492) by monty-python.gnu.org with esmtps (TLS-1.0:RSA_AES_256_CBC_SHA1:32) (Exim 4.60) (envelope-from ) id 1LZNIQ-0007hp-1t for bug-gnu-emacs@gnu.org; Tue, 17 Feb 2009 05:35:22 -0500 Received: from dslb-082-083-056-080.pools.arcor-ip.net ([82.83.56.80] helo=void) by m61s02.vlinux.de with esmtpsa (TLS-1.0:DHE_RSA_AES_128_CBC_SHA1:16) (Exim 4.63) (envelope-from ) id 1LZNKC-0001iN-Ng for bug-gnu-emacs@gnu.org; Tue, 17 Feb 2009 11:37:12 +0100 From: David Engster To: bug-gnu-emacs@gnu.org Subject: 23.0.90; Emacs fails to detect utf-8 encoding with language environment Latin-1 User-Agent: Gnus/5.110011 (No Gnus v0.11) Emacs/23.0.90 (gnu/linux) Date: Tue, 17 Feb 2009 11:35:11 +0100 Message-ID: <87y6w5jqqo.fsf@engster.org> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable X-detected-operating-system: by monty-python.gnu.org: GNU/Linux 2.6 (newer, 3) This is what I believe to be a regression in CVS Emacs since the 23.0.90 pretest. I'm using a fresh CVS checkout from 2009-02-17, compiled with 'make bootstrap'. You can reproduce it as follows: 1. emacs -Q 2. M-x set-language-environment RET Latin-1 RET 3. In some buffer write: (ucs-insert "2500") 4. Eval it, so that the unicode character is inserted into the buffer. 5. Save the file and choose utf-8 as encoding. 6. Kill the buffer. 7. Load the file you just saved. Result: Emacs displays "=E2\224\200" for the unicode character. Expected behaviour: Emacs should detect utf-8 encoding and display correct character. Please note that this has worked without problems with the Emacs 23.0.90 pretest, so it must be due to some change(s) since then in CVS. In GNU Emacs 23.0.90.1 (i686-pc-linux-gnu, GTK+ Version 2.12.11) of 2009-02-17 on void Windowing system distributor `The X.Org Foundation', version 11.0.10402000 configured using `configure '--prefix=3D/usr/local/emacs'' Important settings: value of $LC_ALL: nil value of $LC_COLLATE: nil value of $LC_CTYPE: nil value of $LC_MESSAGES: nil value of $LC_MONETARY: nil value of $LC_NUMERIC: nil value of $LC_TIME: nil value of $LANG: nil value of $XMODIFIERS: nil locale-coding-system: nil default-enable-multibyte-characters: t Major mode: Lisp Interaction Minor modes in effect: tooltip-mode: t tool-bar-mode: t mouse-wheel-mode: t menu-bar-mode: t file-name-shadow-mode: t global-font-lock-mode: t font-lock-mode: t blink-cursor-mode: t global-auto-composition-mode: t auto-composition-mode: t auto-encryption-mode: t auto-compression-mode: t line-number-mode: t transient-mark-mode: t Recent input: M-x r e p o r C-g M-x s e t - l a n =20 L a t i n w - w =20 1 M-x r e p o r Recent messages: For information about GNU Emacs and the GNU system, type C-h C-a. Making completion list... Quit Making completion list... ------------=_1235824204-12677-0 Content-Type: message/rfc822 Content-Disposition: inline Content-Transfer-Encoding: 7bit Received: (at 2354-done) by emacsbugs.donarmstrong.com; 28 Feb 2009 12:21:20 +0000 X-Spam-Checker-Version: SpamAssassin 3.2.5-bugs.debian.org_2005_01_02 (2008-06-10) on rzlab.ucr.edu X-Spam-Level: X-Spam-Bayes: score:0.5 Bayes not run. spammytokens:Tokens not available. hammytokens:Tokens not available. X-Spam-Status: No, score=-2.9 required=4.0 tests=FOURLA,HAS_BUG_NUMBER autolearn=ham version=3.2.5-bugs.debian.org_2005_01_02 Received: from mtaout2.012.net.il (mtaout2.012.net.il [84.95.2.4]) by rzlab.ucr.edu (8.13.8/8.13.8/Debian-3) with ESMTP id n1SCLBmc011258; Sat, 28 Feb 2009 04:21:12 -0800 Received: from conversion-daemon.i_mtaout2.012.net.il by i_mtaout2.012.net.il (HyperSendmail v2004.12) id <0KFR00H00ZLJR800@i_mtaout2.012.net.il>; Sat, 28 Feb 2009 14:21:42 +0200 (IST) Received: from HOME-C4E4A596F7 ([77.127.167.119]) by i_mtaout2.012.net.il (HyperSendmail v2004.12) with ESMTPA id <0KFR0028OZO478C2@i_mtaout2.012.net.il>; Sat, 28 Feb 2009 14:21:42 +0200 (IST) Date: Sat, 28 Feb 2009 14:21:08 +0200 From: Eli Zaretskii Subject: Re: bug#2497: 23.0.91; Fails to read UTF-8 on Win2k In-reply-to: <87d4d3u61n.fsf@engster.org> X-012-Sender: halo1@inter.net.il To: 2497-done@emacsbugs.donarmstrong.com, 2354-done@emacsbugs.donarmstrong.com Reply-to: Eli Zaretskii Message-id: References: <877i3c55tg.fsf@tum.de> <87d4d3u61n.fsf@engster.org> X-CrossAssassin-Score: 2 > From: David Engster > Date: Fri, 27 Feb 2009 18:46:12 +0100 > Cc: emacs-pretest-bug@gnu.org, 2497@emacsbugs.donarmstrong.com > > Uwe Siart writes: > > I'm using the windows port of 23.0.91 on Win2k SP4 and I found that it > > fails to read utf-8 encoded files correctly. When visiting a file in > > utf-8 encoding all characters above 255 are screwed up and "C-h C RET" > > indicates iso-latin1-dos for saving the file. This has not been an > > issue in 23.0.90. > > Maybe this is a duplicate of what I reported in > > http://emacsbugs.donarmstrong.com/cgi-bin/bugreport.cgi?bug=2354 > > As I write later in that bug report, I think I could track down this > issue to the change in revision 1.413 of src/coding.c. Maybe you could > try if the same applies to your problem. Should be fixed by this change: 2009-02-28 Eli Zaretskii * coding.c (detect_coding_charset): Fix change from 2008-10-21. Also, check iso-latin-*, not only iso-8859-*. ------------=_1235824204-12677-0--