From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: "Stephen J. Turnbull" Newsgroups: gmane.emacs.bugs Subject: bug#20385: missing charset for non-ASCII text/x-patch MIME parts in Thunderbird Date: Thu, 14 May 2015 17:28:43 +0900 Message-ID: <87k2wb1s1w.fsf__26608.329304918$1431592164$gmane$org@uwakimon.sk.tsukuba.ac.jp> References: <1429555155-4695-1-git-send-email-eggert@cs.ucla.edu> <5552FDAC.4080004@cs.ucla.edu> <55534080.6010400@yandex.ru> <555369FD.30701@cs.ucla.edu> <5553D12F.7000809@yandex.ru> <87h9rfiqh1.fsf_-_@violet.siamics.net> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1431592164 28613 80.91.229.3 (14 May 2015 08:29:24 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Thu, 14 May 2015 08:29:24 +0000 (UTC) Cc: 20385@debbugs.gnu.org, Paul Eggert , emacs-devel@gnu.org, Dmitry Gutov To: Ivan Shmakov Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Thu May 14 10:29:12 2015 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1YsoVy-0006Dx-4r for geb-bug-gnu-emacs@m.gmane.org; Thu, 14 May 2015 10:29:10 +0200 Original-Received: from localhost ([::1]:53094 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YsoVx-0002b2-Cz for geb-bug-gnu-emacs@m.gmane.org; Thu, 14 May 2015 04:29:09 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:47367) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YsoVs-0002Xl-5K for bug-gnu-emacs@gnu.org; Thu, 14 May 2015 04:29:04 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1YsoVq-0000lb-RJ for bug-gnu-emacs@gnu.org; Thu, 14 May 2015 04:29:04 -0400 Original-Received: from debbugs.gnu.org ([140.186.70.43]:34426) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YsoVq-0000l6-Hp for bug-gnu-emacs@gnu.org; Thu, 14 May 2015 04:29:02 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.80) (envelope-from ) id 1YsoVq-0000R9-0q for bug-gnu-emacs@gnu.org; Thu, 14 May 2015 04:29:02 -0400 X-Loop: help-debbugs@gnu.org Resent-From: "Stephen J. Turnbull" Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Thu, 14 May 2015 08:29:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 20385 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: patch Original-Received: via spool by 20385-submit@debbugs.gnu.org id=B20385.14315921301649 (code B ref 20385); Thu, 14 May 2015 08:29:01 +0000 Original-Received: (at 20385) by debbugs.gnu.org; 14 May 2015 08:28:50 +0000 Original-Received: from localhost ([127.0.0.1]:44401 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1YsoVd-0000QT-PZ for submit@debbugs.gnu.org; Thu, 14 May 2015 04:28:50 -0400 Original-Received: from shako.sk.tsukuba.ac.jp ([130.158.97.161]:33490) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1YsoVa-0000QC-G4 for 20385@debbugs.gnu.org; Thu, 14 May 2015 04:28:47 -0400 Original-Received: from uwakimon.sk.tsukuba.ac.jp (uwakimon.sk.tsukuba.ac.jp [130.158.99.156]) (using TLSv1.2 with cipher DHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by shako.sk.tsukuba.ac.jp (Postfix) with ESMTPS id B2E4B1C3944; Thu, 14 May 2015 17:28:43 +0900 (JST) Original-Received: by uwakimon.sk.tsukuba.ac.jp (Postfix, from userid 1000) id 99D641A3398; Thu, 14 May 2015 17:28:43 +0900 (JST) In-Reply-To: <87h9rfiqh1.fsf_-_@violet.siamics.net> X-Mailer: VM undefined under 21.5 (beta34) "kale" 83e5c3cd6be6 XEmacs Lucid (x86_64-unknown-linux) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 3.x X-Received-From: 140.186.70.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:102765 Archived-At: Ivan Shmakov writes: > As I=E2=80=99ve pointed earlier [1], Thunderbird (on the /sending/ side) > for some reason chooses /not/ to file the =E2=80=98charset=E2=80=99 File a bug on Thunderbird, then. Absence of a charset parameter means charset=3DUS-ASCII, and Thunderbird should not be emitting US-ASCII MIME parts with non-ASCII characters present. Not even if the MTAs agree to use SMTP8. > In the absence of the explicitly-stated encoding, the > receiving side may resort to guessing, A conformant receiver SHOULD NOT guess, unless the user has given it explicit permission to do that (of course, then anything is OK). From RFC 2046: 4.1.2. Charset Parameter A critical parameter that may be specified in the Content-Type field for "text/plain" data is the character set. This is specified with a "charset" parameter, as in: Content-type: text/plain; charset=3Diso-8859-1 Unlike some other parameter values, the values of the charset parameter are NOT case sensitive. The default character set, which must be assumed in the absence of a charset parameter, is US-ASCII. Note that technically speaking the MUST in this section only applies to text/plain, and not to any other text content-type. However, given that the section says The specification for any future subtypes of "text" must specify whether or not they will also utilize a "charset" parameter, and may possibly restrict its values as well. For other subtypes of "text" than "text/plain", the semantics of the "charset" parameter should be defined to be identical to those specified here for "text/plain", i.e., the body consists entirely of characters in the given charset. Pretty clearly the intent is that the behavior of text/plain is to be the default for other text content-types, unless *explicitly* stated in the content-type spec. See also section 4.1.4. Unrecognized Subtypes Unrecognized subtypes of "text" should be treated as subtype "plain" as long as the MIME implementation knows how to handle the charset. This only makes sense when charset is unspecified if charset is assumed to be US-ASCII. > I presume this issue (the one of /not/ including the =E2=80=98charset= =E2=80=99) > is specific to Thunderbird. As an example, please look at a > fragment of the original patch thus MIMEd from Gnus. File a bug on Gnus, too. :-) Of course Emacs should do what its user asks, but the default should be to assume US-ASCII if there is no charset parameter, and to bitch (not guess) if non-ASCII octets are seen.