From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Alan Mackenzie Newsgroups: gmane.emacs.devel Subject: Re: character sets as they =?utf-8?Q?relat?= =?utf-8?B?ZSB0byDigJxSYXfigJ0=?= string literals for elisp Date: Sat, 9 Oct 2021 15:07:28 +0000 Message-ID: References: <838rz4ypkt.fsf@gnu.org> <83k0inxvjf.fsf@gnu.org> <83ily7xsjw.fsf@gnu.org> <83ee8uyb3w.fsf@gnu.org> <83tuhqwh87.fsf@gnu.org> <83r1cuwd89.fsf@gnu.org> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="35469"; mail-complaints-to="usenet@ciao.gmane.io" Cc: emacs-devel@gnu.org, rms@gnu.org, yuri.v.khan@gmail.com, db48x@db48x.net, stefankangas@gmail.com, juri@linkov.net, monnier@iro.umontreal.ca To: Eli Zaretskii Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Sat Oct 09 17:08:44 2021 Return-path: Envelope-to: ged-emacs-devel@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1mZDxs-000952-7N for ged-emacs-devel@m.gmane-mx.org; Sat, 09 Oct 2021 17:08:44 +0200 Original-Received: from localhost ([::1]:43004 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1mZDxq-0007CC-BD for ged-emacs-devel@m.gmane-mx.org; Sat, 09 Oct 2021 11:08:42 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:46990) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mZDwj-0005w5-Td for emacs-devel@gnu.org; Sat, 09 Oct 2021 11:07:34 -0400 Original-Received: from colin.muc.de ([193.149.48.1]:20501 helo=mail.muc.de) by eggs.gnu.org with smtp (Exim 4.90_1) (envelope-from ) id 1mZDwh-00084L-UE for emacs-devel@gnu.org; Sat, 09 Oct 2021 11:07:33 -0400 Original-Received: (qmail 31492 invoked by uid 3782); 9 Oct 2021 15:07:29 -0000 Original-Received: from acm.muc.de (p4fe153c5.dip0.t-ipconnect.de [79.225.83.197]) (using STARTTLS) by colin.muc.de (tmda-ofmipd) with ESMTP; Sat, 09 Oct 2021 17:07:29 +0200 Original-Received: (qmail 7960 invoked by uid 1000); 9 Oct 2021 15:07:28 -0000 Content-Disposition: inline In-Reply-To: <83r1cuwd89.fsf@gnu.org> X-Submission-Agent: TMDA/1.3.x (Ph3nix) X-Primary-Address: acm@muc.de Received-SPF: pass client-ip=193.149.48.1; envelope-from=acm@muc.de; helo=mail.muc.de X-Spam_score_int: -18 X-Spam_score: -1.9 X-Spam_bar: - X-Spam_report: (-1.9 / 5.0 requ) BAYES_00=-1.9, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.io gmane.emacs.devel:276615 Archived-At: Hello, Eli. On Sat, Oct 09, 2021 at 16:15:50 +0300, Eli Zaretskii wrote: > > Date: Sat, 9 Oct 2021 13:08:35 +0000 > > Cc: emacs-devel@gnu.org, rms@gnu.org, juri@linkov.net, db48x@db48x.net, > > stefankangas@gmail.com, yuri.v.khan@gmail.com, monnier@iro.umontreal.ca > > From: Alan Mackenzie > > > > --- (EM DASH) appears as an inverse question mark on my screen. So do > > > > several other punctuation marks, I think. > > > That means the display-time replacement doesn't happen, I think. What > > > is your terminal-coding-system? > > M-: default-terminal-coding-system says utf-8-unix. I haven't set this > > in my site-start.el or .emacs. > That explains it, I think: info.el thinks your console can display > those characters. I vaguely remember looking into this before. I think the problem was that there was no facility in Linux for determining whether a particular character could be displayed on its console. The low level interface simply wasn't there. > > I have my font set to Latin-1. More precisely, > > consolefont="lat1-16" > > in my /etc/conf.d/consolefont. > Are there any other possible values that will cause these characters > display correctly? Or is the Linux console unable to display them no > matter what? The Linux console is limited to 256 glyphs, some of which are used by two or several Unicode characters. It would be possible but time consuming to amend the font to display the EM-DASH as, say, a minus sign. > Regardless, we could have a customizable option in info.el to force > display of the Unicode punctuation as their ASCII equivalents, even if > the terminal seems capable of the Unicode display. Again, this would only solve half of the problem, the display half, but might be worthwhile. I would prefer, though, to find a way of preventing these awkward Unicode punctuation charaacters from being in info buffers in the first place. -- Alan Mackenzie (Nuremberg, Germany).