From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Chad Brown Newsgroups: gmane.emacs.devel Subject: Re: [Emacs-diffs] master db828f6: Don't rely on defaults in decoding UTF-8 encoded Lisp files Date: Sun, 27 Sep 2015 12:52:15 -0700 Message-ID: <773FFA8D-BA87-406D-A6CC-8DC8BFA0951C@gmail.com> References: <20150921165211.20434.28114@vcs.savannah.gnu.org> <83fv27mt7r.fsf@gnu.org> <83wpvfix7i.fsf@gnu.org> <83fv23hr0z.fsf@gnu.org> <5605CB6B.4000102@cs.ucla.edu> <83twqhhf0g.fsf@gnu.org> <5606AC48.7090801@cs.ucla.edu> <83zj09fbzp.fsf@gnu.org> <5606C140.6090309@cs.ucla.edu> <878u7trwlb.fsf@fencepost.gnu.org> <5606E995.2000102@cs.ucla.edu> <83si61ezxd.fsf@gnu.org> <2FCA0DAD-831B-4B0A-B948-9F31DE52B6F0@gmail.com> <83mvw8fh8g.fsf@gnu.org> <60126F7B-7C7B-41F9-BF0E-F5FE27374F9D@gmail.com> <83zj07emd8.fsf@gnu.org> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 (Mac OS X Mail 8.2 \(2104\)) Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1443383553 19465 80.91.229.3 (27 Sep 2015 19:52:33 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Sun, 27 Sep 2015 19:52:33 +0000 (UTC) Cc: emacs-devel@gnu.org To: Eli Zaretskii Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Sun Sep 27 21:52:29 2015 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1ZgHzo-0000GW-TV for ged-emacs-devel@m.gmane.org; Sun, 27 Sep 2015 21:52:29 +0200 Original-Received: from localhost ([::1]:58543 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZgHzo-0003l4-Bz for ged-emacs-devel@m.gmane.org; Sun, 27 Sep 2015 15:52:28 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:52135) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZgHzl-0003kv-Fx for emacs-devel@gnu.org; Sun, 27 Sep 2015 15:52:26 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ZgHzk-0002ab-FS for emacs-devel@gnu.org; Sun, 27 Sep 2015 15:52:25 -0400 Original-Received: from mail-pa0-x22d.google.com ([2607:f8b0:400e:c03::22d]:36556) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZgHze-0002Zc-RE; Sun, 27 Sep 2015 15:52:18 -0400 Original-Received: by pablk4 with SMTP id lk4so56269868pab.3; Sun, 27 Sep 2015 12:52:18 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=content-type:mime-version:subject:from:in-reply-to:date:cc :content-transfer-encoding:message-id:references:to; bh=pF2GR3L5pqg3fEswCkfLsXlFfL1J4tGLjNQaE6cIerk=; b=yBrz5ofJOiFjo2G4NF4E8VHYAxP0qBQQIZZtDNQR8x6X++dWw2OtEmxTZx+XQTHpTe PMkPYeJyZlj933cQxwNLnx7qzr3JVt9hj0wH3BT4/cKL/DA37+0xqkAqssWzo7kqmml5 OaGVgEr1rS10y8dNgLGgnLwqC39e0lcrgwpkp6+289Kr4p7Z5Sh7MgJAxptS7fl1lqts piMUmDeeIaYrUzje1x/6PlrIbVI0+ubaNFm0XWaWQ9Eb7X4DIg8/me/BkP2pxE+2LJ9y dC2wrY5sGHBI8YbtlzsGvYCHdOOhTwk9yj2NqpJ8aM11eMPu0xcL78kJjcZMtiW416Jb MOfA== X-Received: by 10.66.193.132 with SMTP id ho4mr22266383pac.154.1443383538020; Sun, 27 Sep 2015 12:52:18 -0700 (PDT) Original-Received: from [10.0.1.31] (174-21-111-247.tukw.qwest.net. [174.21.111.247]) by smtp.gmail.com with ESMTPSA id u3sm15088084pbs.33.2015.09.27.12.52.16 (version=TLS1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Sun, 27 Sep 2015 12:52:16 -0700 (PDT) In-Reply-To: <83zj07emd8.fsf@gnu.org> X-Mailer: Apple Mail (2.2104) X-detected-operating-system: by eggs.gnu.org: Error: Malformed IPv6 address (bad octet value). X-Received-From: 2607:f8b0:400e:c03::22d X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:190415 Archived-At: > On 27 Sep 2015, at 11:41, Eli Zaretskii wrote: >=20 >> From: Chad Brown >> Date: Sun, 27 Sep 2015 09:03:54 -0700 >> Cc: emacs-devel@gnu.org >>=20 >> -finput-charset=3Dcharset >> Set the input character set, used for translation from the character >> set of the input file to the source character set used by GCC. If >> the locale does not specify, or GCC cannot get this information >> from the locale, the default is UTF-8. This can be overridden by >> either the locale or this command line option. Currently the command >> line option takes precedence if there's a conflict. charset can be >> any encoding supported by the system's iconv library routine. >=20 > Note the "if the locale does not specify" clause. That should almost > never happen. Sure. I almost mentioned that, but at the time it seemed clear to me that we were talking about the defaults for each. I used to deal with this issue =E2=80=98back in the day=E2=80=99, so it provoked = my curiosity=20 enough to look. Roughly speaking, the modern =E2=80=98programming languages=E2=80=99 these days are UTF-8, while a decent chunk of the=20 =E2=80=98scripting languages=E2=80=99 seem to be in a messier state, but = with=20 established methods (coding cookies, odd quoting, ascii by fiat,=20 try not to look at comments, etc). Since then, exchanges on this thread have suggested that maybe I was wrong about the topic at hand, but the data still seemed useful, so I pushed it along, with the full quote for context. Sorry if it = caused confusion. Thanks, ~Chad