From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.devel Subject: Re: character sets as they relate to =?utf-8?B?4oCcUmF34oCd?= string literals for elisp Date: Mon, 04 Oct 2021 21:57:02 +0300 Message-ID: <8335pg62kh.fsf@gnu.org> References: <4209edd83cfee7c84b2d75ebfcd38784fa21b23c.camel@crossproduct.net> <87v92ft9z6.fsf@db48x.net> <87o885tyle.fsf@db48x.net> <83k0it6lu5.fsf@gnu.org> <87k0isu7hz.fsf_-_@db48x.net> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="34528"; mail-complaints-to="usenet@ciao.gmane.io" Cc: anna@crossproduct.net, rms@gnu.org, emacs-devel@gnu.org To: Daniel Brooks Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Mon Oct 04 20:58:48 2021 Return-path: Envelope-to: ged-emacs-devel@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1mXTAm-0008m3-QL for ged-emacs-devel@m.gmane-mx.org; Mon, 04 Oct 2021 20:58:48 +0200 Original-Received: from localhost ([::1]:47718 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1mXTAl-0005Hi-Ia for ged-emacs-devel@m.gmane-mx.org; Mon, 04 Oct 2021 14:58:47 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:58502) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mXT9O-00049a-OK for emacs-devel@gnu.org; Mon, 04 Oct 2021 14:57:23 -0400 Original-Received: from fencepost.gnu.org ([2001:470:142:3::e]:33248) by eggs.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1mXT9N-00073C-6r; Mon, 04 Oct 2021 14:57:21 -0400 Original-Received: from 84.94.185.95.cable.012.net.il ([84.94.185.95]:4505 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mXT9G-00065j-QR; Mon, 04 Oct 2021 14:57:15 -0400 In-Reply-To: <87k0isu7hz.fsf_-_@db48x.net> (message from Daniel Brooks on Mon, 04 Oct 2021 08:36:40 -0700) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.io gmane.emacs.devel:276245 Archived-At: > From: Daniel Brooks > Cc: emacs-devel@gnu.org, rms@gnu.org, anna@crossproduct.net > Date: Mon, 04 Oct 2021 08:36:40 -0700 > > Eli Zaretskii writes: > > > We can only do this much. We don't develop any terminal emulators > > here, except the two built into Emacs. > > I was referring broadly to the whole GNU project, not trying to assign > the work specifically to the Emacs project. :) Then this is not necessarily the best place to raise these issues. > Suppose our hypothetical contributor wanted to contribute a new mode > with this type of code in it: > > (defun 日本 () (message "日本")) > > That is, all of the identifiers in the source code for this mode are > named in some horrible foreign script that you cannot read. Is it so > much more unreadable if it sometimes has to be displayed like this? > > (defun \u65E5\u672C () (message "\u65E5\u672C")) > > More to the point, do we turn away this contributor or ask them to > rewrite their code? My preference is that we simply accept the > contribution as–is. It would be very inconvenient to have such code. We have that where it's inevitable (like in some packages that define features specific to some languages), but even there we prefer to use the likes of \u672c instead of the literal characters. At the very least, that avoids the problem with not having a suitable font to display them. > If we could see our way to accepting such code, then I don’t see why we > couldn’t accept code that uses Unicode in much smaller ways, such as > this: > > (defvar variable-containing-html #r「click here」) If we avoid non-ASCII characters, we avoid some problems, so all else being equal, it's better. > PS: it occurs to me to wonder if my use of Unicode in the prose of this > message, outside of the examples, detracted from its readability in any > way? If someone is reading this on a text-mode terminal, it could.