From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.bugs Subject: bug#20623: XML and HTML files with encoding/charset="utf-8" declaration loose BOM; Coding system is reset from utf-8-with-signature to utf-8 on save Date: Sun, 12 Aug 2018 22:07:57 +0300 Message-ID: <83wosvjtde.fsf@gnu.org> References: <555E2912.7060509@gmx.net> <83iobl67ao.fsf@gnu.org> <555E44EB.6070604@gmx.net> <83egm95boc.fsf@gnu.org> <555F2D3C.6090608@gmx.net> <8660oxdyxy.fsf@realize.ch> <457eu2h1sk.fsf@fencepost.gnu.org> <837eu2xmor.fsf@gnu.org> <838teatmtv.fsf@gnu.org> <83va8hko03.fsf@gnu.org> NNTP-Posting-Host: blaine.gmane.org X-Trace: blaine.gmane.org 1534100871 2561 195.159.176.226 (12 Aug 2018 19:07:51 GMT) X-Complaints-To: usenet@blaine.gmane.org NNTP-Posting-Date: Sun, 12 Aug 2018 19:07:51 +0000 (UTC) Cc: a.s@realize.ch, 20623@debbugs.gnu.org, sledergerber@gmx.net To: Stefan Monnier Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Sun Aug 12 21:07:46 2018 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by blaine.gmane.org with esmtp (Exim 4.84_2) (envelope-from ) id 1foviH-0000Wd-S6 for geb-bug-gnu-emacs@m.gmane.org; Sun, 12 Aug 2018 21:07:42 +0200 Original-Received: from localhost ([::1]:36068 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1fovkN-0003HR-TT for geb-bug-gnu-emacs@m.gmane.org; Sun, 12 Aug 2018 15:09:51 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:47090) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1fovje-0002p5-PE for bug-gnu-emacs@gnu.org; Sun, 12 Aug 2018 15:09:07 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1fovja-0000od-N5 for bug-gnu-emacs@gnu.org; Sun, 12 Aug 2018 15:09:06 -0400 Original-Received: from debbugs.gnu.org ([208.118.235.43]:44004) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1fovja-0000oZ-IT for bug-gnu-emacs@gnu.org; Sun, 12 Aug 2018 15:09:02 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1fovja-0001yh-DK for bug-gnu-emacs@gnu.org; Sun, 12 Aug 2018 15:09:02 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Eli Zaretskii Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Sun, 12 Aug 2018 19:09:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 20623 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: Original-Received: via spool by 20623-submit@debbugs.gnu.org id=B20623.15341008877474 (code B ref 20623); Sun, 12 Aug 2018 19:09:02 +0000 Original-Received: (at 20623) by debbugs.gnu.org; 12 Aug 2018 19:08:07 +0000 Original-Received: from localhost ([127.0.0.1]:49021 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1fovih-0001wT-7a for submit@debbugs.gnu.org; Sun, 12 Aug 2018 15:08:07 -0400 Original-Received: from eggs.gnu.org ([208.118.235.92]:45183) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1fovif-0001vz-Gi for 20623@debbugs.gnu.org; Sun, 12 Aug 2018 15:08:05 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1foviZ-0000RY-Fw for 20623@debbugs.gnu.org; Sun, 12 Aug 2018 15:08:00 -0400 Original-Received: from fencepost.gnu.org ([2001:4830:134:3::e]:39047) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1foviT-0000QE-Hk; Sun, 12 Aug 2018 15:07:53 -0400 Original-Received: from [176.228.60.248] (port=4609 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1foviS-0002VT-NF; Sun, 12 Aug 2018 15:07:53 -0400 In-reply-to: (message from Stefan Monnier on Sat, 11 Aug 2018 20:04:05 -0400) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 208.118.235.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.org gmane.emacs.bugs:149470 Archived-At: > From: Stefan Monnier > Cc: rgm@gnu.org, a.s@realize.ch, 20623@debbugs.gnu.org, sledergerber@gmx.net > Date: Sat, 11 Aug 2018 20:04:05 -0400 > > You say that the code I wrote is not needed to make sure an existing > latin-1-mac setting isn't overwritten by a latin-1 guess. I expect this > is indeed true (otherwise I think we'd have had bug-reports about it), > but I don't know where that is handled. It is handled inside select-safe-coding-system, which first invokes find-auto-coding to decide which encoding is appropriate (and as part of that, looks at XML or HTML charset information declared by the text), and then, if the encoding it got doesn't specify the EOL conversion, it uses the EOL conversion from the buffer's encoding or from the appropriate defaults. Since XML/HTML charset tags never specify the EOL conversion, it follows that Emacs will never override the EOL conversion of the buffer, it will only use the charset for "text conversion". I hope this answers your question.