From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.devel Subject: Re: undecided vs utf-8 Date: Fri, 05 Nov 2010 09:56:58 +0200 Message-ID: <83wros4305.fsf@gnu.org> References: Reply-To: Eli Zaretskii NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: QUOTED-PRINTABLE X-Trace: dough.gmane.org 1288943863 16165 80.91.229.12 (5 Nov 2010 07:57:43 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Fri, 5 Nov 2010 07:57:43 +0000 (UTC) Cc: emacs-devel@gnu.org To: Lars Magne Ingebrigtsen Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Fri Nov 05 08:57:38 2010 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1PEHB4-0008T8-0B for ged-emacs-devel@m.gmane.org; Fri, 05 Nov 2010 08:57:38 +0100 Original-Received: from localhost ([127.0.0.1]:45099 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1PEHB2-0002c7-1w for ged-emacs-devel@m.gmane.org; Fri, 05 Nov 2010 03:57:36 -0400 Original-Received: from [140.186.70.92] (port=48838 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1PEHAs-0002bP-HX for emacs-devel@gnu.org; Fri, 05 Nov 2010 03:57:31 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1PEHAj-0006uT-Jb for emacs-devel@gnu.org; Fri, 05 Nov 2010 03:57:26 -0400 Original-Received: from mtaout20.012.net.il ([80.179.55.166]:56883) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1PEHAj-0006th-Bz for emacs-devel@gnu.org; Fri, 05 Nov 2010 03:57:17 -0400 Original-Received: from conversion-daemon.a-mtaout20.012.net.il by a-mtaout20.012.net.il (HyperSendmail v2007.08) id <0LBE00900J0DK100@a-mtaout20.012.net.il> for emacs-devel@gnu.org; Fri, 05 Nov 2010 09:56:56 +0200 (IST) Original-Received: from HOME-C4E4A596F7 ([77.126.249.126]) by a-mtaout20.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0LBE008VJJESHHD0@a-mtaout20.012.net.il>; Fri, 05 Nov 2010 09:56:54 +0200 (IST) In-reply-to: X-012-Sender: halo1@inter.net.il X-detected-operating-system: by eggs.gnu.org: Solaris 10 (beta) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:132389 Archived-At: > From: Lars Magne Ingebrigtsen > Date: Thu, 04 Nov 2010 23:27:57 +0100 >=20 > When using erc, it decodes iso-8859-1 fine with the default `undeci= ded' > into encoding. However, any utf-8 strings are, sort of, just trans= lated > into the same coding system: >=20 > (decode-coding-string "u-te-\303\246ff \303\245tte" 'undecided) > =3D> "u-te-=C3=A6ff =C3=A5tte" >=20 > (decode-coding-string "u-te-\303\246ff \303\245tte" 'utf-8) > =3D> "u-te-=E6ff =E5tte" Please show the output of "M-x mule-diag RET" on the machine where this happens. > Shouldn't `decode-coding-string' prefer utf-8 if it's actually > valid? Depending on the user's locale and preferences, this could easily backfire, especially if the text is insufficiently long to distinguis= h between the two. Using incorrect decoder in a small fraction of cases is a fact of life; every program out there hits this from time to time. What we need is good defaults, and ways to customize those in specific situations. In this case, perhaps erc should use its own defaults, i= f UTF-8 is widely (or solely) used there.