From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Kenichi Handa Newsgroups: gmane.emacs.bugs Subject: Re: Broken charset=utf-16be articles with Gnus and Emacs 21.3 Date: Mon, 31 Mar 2003 10:51:20 +0900 (JST) Sender: bug-gnu-emacs-bounces+gnu-bug-gnu-emacs=m.gmane.org@gnu.org Message-ID: <200303310151.KAA14926@etlken.m17n.org> References: NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 (generated by SEMI 1.14.3 - "Ushinoya") Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Trace: main.gmane.org 1049075792 11090 80.91.224.249 (31 Mar 2003 01:56:32 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Mon, 31 Mar 2003 01:56:32 +0000 (UTC) Cc: overlord@gmx.li Original-X-From: bug-gnu-emacs-bounces+gnu-bug-gnu-emacs=m.gmane.org@gnu.org Mon Mar 31 03:56:29 2003 Return-path: Original-Received: from monty-python.gnu.org ([199.232.76.173]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 18zoXE-0002sR-00 for ; Mon, 31 Mar 2003 03:56:29 +0200 Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org) by monty-python.gnu.org with esmtp (Exim 4.10.13) id 18zoVW-0006wl-01 for gnu-bug-gnu-emacs@m.gmane.org; Sun, 30 Mar 2003 20:54:42 -0500 Original-Received: from list by monty-python.gnu.org with tmda-scanned (Exim 4.10.13) id 18zoUY-0006JV-00 for bug-gnu-emacs@gnu.org; Sun, 30 Mar 2003 20:53:42 -0500 Original-Received: from mail by monty-python.gnu.org with spam-scanned (Exim 4.10.13) id 18zoU0-0005xV-00 for bug-gnu-emacs@gnu.org; Sun, 30 Mar 2003 20:53:09 -0500 Original-Received: from tsukuba.m17n.org ([192.47.44.130]) by monty-python.gnu.org with esmtp (Exim 4.10.13) id 18zoST-0005V6-00 for bug-gnu-emacs@gnu.org; Sun, 30 Mar 2003 20:51:34 -0500 Original-Received: from fs.m17n.org (fs.m17n.org [192.47.44.2])h2V1pL928479; Mon, 31 Mar 2003 10:51:21 +0900 (JST) (envelope-from handa@m17n.org) Original-Received: from etlken.m17n.org (etlken.m17n.org [192.47.44.125]) h2V1pKA28575; Mon, 31 Mar 2003 10:51:20 +0900 (JST) Original-Received: (from handa@localhost) by etlken.m17n.org (8.8.8+Sun/3.7W-2001040620) id KAA14926; Mon, 31 Mar 2003 10:51:20 +0900 (JST) Original-To: reiner.steib@gmx.de In-reply-to: (message from Reiner Steib on Sat, 29 Mar 2003 12:11:11 +0100) User-Agent: SEMI/1.14.3 (Ushinoya) FLIM/1.14.2 (Yagi-Nishiguchi) APEL/10.2 Emacs/21.2.92 (sparc-sun-solaris2.6) MULE/5.0 (SAKAKI) Original-cc: bugs@gnus.org Original-cc: bug-gnu-emacs@gnu.org Original-cc: mtr-dev0@gmx.de X-BeenThere: bug-gnu-emacs@gnu.org X-Mailman-Version: 2.1b5 Precedence: list List-Id: Bug reports for GNU Emacs, the Swiss army knife of text editors List-Help: List-Post: List-Subscribe: , List-Archive: List-Unsubscribe: , Errors-To: bug-gnu-emacs-bounces+gnu-bug-gnu-emacs=m.gmane.org@gnu.org Xref: main.gmane.org gmane.emacs.bugs:4671 X-Report-Spam: http://spam.gmane.org/gmane.emacs.bugs:4671 In article , Reiner Steib <4.uce= .03.r.s@nurfuerspam.de> writes: > Actions to reproduce the bug: > - Start Gnus `M-x gnus RET' and compose an article `a'. > - Switch to TeX input method: `M-x set-input-method RET TeX RET' > - Enter some characters: > \sigma =E4 (a with diaeresis) \omega > (Note: \sigma and \omega (without =E4) is not sufficient.) =20 > .. *or* ... > \sigma \omega \alpha \o \int > - Preview `M-m P' (or `C-u M-m P') or send the message. > The preview (and the outgoing message) will be encoded with > "Content-Type: text/plain; charset=3Dutf-16be" and it will not be > readable with Gnus and most other MUAs or newsreaders. > I've been told (-> Simon Krahnke) that the result isn't even correct > UTF-16. Oops, I've just found that Emacs' coding systems utf-16-le and utf-16-be produce BOM (Byte Order Mark) which is a bug according to their specifications. I've just installed a fix. > Expected behavior: > - The article should be encoded with > "Content-Type: text/plain; charset=3Dutf-8". I don't know why GNUS prefers utf-16-X to utf-8. At least, sort-coding-systems prefers utf-8. --- Ken'ichi HANDA handa@m17n.org