From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.devel Subject: Re: character sets as they relate to =?utf-8?B?4oCcUmF34oCd?= string literals for elisp Date: Fri, 08 Oct 2021 09:53:54 +0300 Message-ID: <838rz4ypkt.fsf@gnu.org> References: <4209edd83cfee7c84b2d75ebfcd38784fa21b23c.camel@crossproduct.net> <87v92ft9z6.fsf@db48x.net> <87o885tyle.fsf@db48x.net> <83k0it6lu5.fsf@gnu.org> <87k0isu7hz.fsf_-_@db48x.net> <87a6jotszy.fsf@db48x.net> <877der8smr.fsf@mail.linkov.net> <83y2772y0s.fsf@gnu.org> <83sfxd1g05.fsf@gnu.org> <83tuhtyn46.fsf@gnu.org> <83o880zuxs.fsf@gnu.org> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="10789"; mail-complaints-to="usenet@ciao.gmane.io" Cc: rms@gnu.org, yuri.v.khan@gmail.com, juri@linkov.net, db48x@db48x.net, monnier@iro.umontreal.ca, emacs-devel@gnu.org To: Stefan Kangas Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Fri Oct 08 08:55:56 2021 Return-path: Envelope-to: ged-emacs-devel@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1mYjnN-0002a2-Vg for ged-emacs-devel@m.gmane-mx.org; Fri, 08 Oct 2021 08:55:53 +0200 Original-Received: from localhost ([::1]:34340 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1mYjnM-0001dV-KH for ged-emacs-devel@m.gmane-mx.org; Fri, 08 Oct 2021 02:55:52 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:41324) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mYjlw-0000w3-Ol for emacs-devel@gnu.org; Fri, 08 Oct 2021 02:54:24 -0400 Original-Received: from fencepost.gnu.org ([2001:470:142:3::e]:59150) by eggs.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1mYjlu-0006pM-0P; Fri, 08 Oct 2021 02:54:22 -0400 Original-Received: from 84.94.185.95.cable.012.net.il ([84.94.185.95]:2952 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mYjln-0004JC-95; Fri, 08 Oct 2021 02:54:15 -0400 In-Reply-To: (message from Stefan Kangas on Thu, 7 Oct 2021 20:37:19 -0400) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.io gmane.emacs.devel:276539 Archived-At: > From: Stefan Kangas > Date: Thu, 7 Oct 2021 20:37:19 -0400 > Cc: rms@gnu.org, db48x@db48x.net, yuri.v.khan@gmail.com, emacs-devel@gnu.org, > monnier@iro.umontreal.ca, juri@linkov.net > > >> What I mean is that I think it would be better if our manuals displayed > >> em dash (written as "---") as they are displayed in the texinfo manual: > >> "--" (HYPHEN-MINUS, HYPHEN-MINUS), instead of as "—" (EM DASH). I find > >> the former way to display this character easier to read in the monospace > >> fonts that we typically use. > > > > Others disagreed at the time, and so we decided quite some time ago to > > use @documentencoding UTF-8 in all our manuals. (It was not only > > about the dashes; UTF-8 encoding causes quite a lot of other Unicode > > characters to be output by makeinfo.) I see no reason to reverse that > > decision (and start all those arguments all over again). > > I also see no reason to reverse that decision, if the particular case of > how em dash is displayed was already considered in detail as part of > that discussion. > > If that case was not considered in detail, perhaps we could discuss it > now. I'd rather not start another discussion of this, as opinions tend to be polarized about it, and IME nothing can bridge over the differences of opinions in this matter. So I prefer a different way of handling this, see below. > I would hope that we could agree that how em dash is displayed is > not necessarily strictly connected to "@documentencoding UTF-8"; and > that it would be useful to continue using UTF-8 encoding, but also get > the "old" way of displaying em dash. Many people want to use and see Unicode punctuation characters in human-readable text. You can see that clearly in the Emacs mailing lists: people use Unicode quotes “..”, dashes, Emoji, and other special characters. Since Info is largely such a human-readable text, those people want to see the same there. I don't see any way of convincing them to change their views, nor do I think we should try. > Maybe that would require us to use an existing option in texinfo, or > maybe this would need the texinfo developers to provide a new option > that could support it. Even if such an option existed, it would still beg the question: how to produce the Info manuals we provide as part of the Emacs release tarballs? The downside of any decision in this matter is that it is imposed on everyone, no matter what their views on this. So I'd prefer to deal with this differently: introduce a new (buffer-local) minor mode, which will install a display-table, whereby "problematic" Unicode characters will be displayed as their ASCII equivalents or equivalent ASCII strings. We already set that up automatically on terminals that are incapable of displaying those characters, but nothing precludes us from having such a feature on demand for capable displays as well. Then users who don't want the effects of these characters on display could activate such a mode, and solve their problems without affecting the actual contents of the Info files.