From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Jason Rumney Newsgroups: gmane.emacs.devel,gmane.emacs.pretest.bugs Subject: Re: 23.0.60; Defaut encoding for XML files should be undefined (instead of utf-8) Date: Fri, 15 Feb 2008 22:54:40 +0000 Message-ID: <47B61830.6020005@gnu.org> References: <87odaifv16.fsf@mundaneum.com> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Trace: ger.gmane.org 1203116103 11143 80.91.229.12 (15 Feb 2008 22:55:03 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Fri, 15 Feb 2008 22:55:03 +0000 (UTC) Cc: emacs-pretest-bug@gnu.org, emacs-devel@gnu.org To: Edward O'Connor Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Fri Feb 15 23:55:27 2008 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1JQ9So-0003lG-Jb for ged-emacs-devel@m.gmane.org; Fri, 15 Feb 2008 23:55:26 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1JQ9SK-0002Cb-Gz for ged-emacs-devel@m.gmane.org; Fri, 15 Feb 2008 17:54:56 -0500 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1JQ9SG-0002CW-7E for emacs-devel@gnu.org; Fri, 15 Feb 2008 17:54:52 -0500 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1JQ9SE-0002C4-Qd for emacs-devel@gnu.org; Fri, 15 Feb 2008 17:54:51 -0500 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1JQ9SE-0002C1-Kt for emacs-devel@gnu.org; Fri, 15 Feb 2008 17:54:50 -0500 Original-Received: from fencepost.gnu.org ([140.186.70.10]) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1JQ9SE-0001ip-Cm for emacs-devel@gnu.org; Fri, 15 Feb 2008 17:54:50 -0500 Original-Received: from mx10.gnu.org ([199.232.76.166]) by fencepost.gnu.org with esmtp (Exim 4.67) (envelope-from ) id 1JQ9SD-00015a-WF for emacs-pretest-bug@gnu.org; Fri, 15 Feb 2008 17:54:50 -0500 Original-Received: from Debian-exim by monty-python.gnu.org with spam-scanned (Exim 4.60) (envelope-from ) id 1JQ9SB-0001iL-48 for emacs-pretest-bug@gnu.org; Fri, 15 Feb 2008 17:54:50 -0500 Original-Received: from mk-outboundfilter-4.mail.uk.tiscali.com ([212.74.114.32]) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1JQ9SA-0001i5-Pv; Fri, 15 Feb 2008 17:54:46 -0500 Original-X-Trace: 7107250/mk-outboundfilter-2.mail.uk.tiscali.com/F2S/$ACCEPTED/freedom2Surf-customers/83.67.23.108 X-SBRS: None X-RemoteIP: 83.67.23.108 X-IP-MAIL-FROM: jasonr@gnu.org X-IP-BHB: Once X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: AgAAAGCmtUdTQxds/2dsb2JhbAAIri8 X-IronPort-AV: E=McAfee;i="5200,2160,5231"; a="7107250" X-IronPort-AV: E=Sophos;i="4.25,360,1199664000"; d="scan'208";a="7107250" X-IP-Direction: OUT Original-Received: from i-83-67-23-108.freedom2surf.net (HELO [127.0.0.1]) ([83.67.23.108]) by smtp.f2s.tiscali.co.uk with ESMTP/TLS/DHE-RSA-AES256-SHA; 15 Feb 2008 22:54:44 +0000 User-Agent: Thunderbird 2.0.0.9 (Windows/20071031) In-Reply-To: X-detected-kernel: by monty-python.gnu.org: Genre and OS details not recognized. X-detected-kernel: by monty-python.gnu.org: Linux 2.6, seldom 2.4 (older, 4) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:89187 gmane.emacs.pretest.bugs:21118 Archived-At: Edward O'Connor wrote: > My understanding is that the XML specification requires XML documents > lacking an explicit instruction to be UTF-8 or > UTF-16, so Emacs does the right thing already. > Emacs goes beyond doing the right thing at the moment. The right thing would be to guide users into using utf-8 by making that the default encoding for *new* XML files, and perhaps warning if an existing file was detected as non-utf-8 without a charset declaration in the header. Forcing users into using utf-8 by ignoring explicit requests to save the file as latin-1 and by opening latin-1 encoded files as utf-8 even when the decoding fails is not the right behaviour. Our users are not slaves to specifications.