From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.bugs Subject: bug#50983: 28.0.50; [REGRESSION, BUG] Display bugs with uncommon characters Date: Sun, 03 Oct 2021 15:02:20 +0300 Message-ID: <83mtnq8gfn.fsf@gnu.org> References: <83lf3aac5w.fsf@gnu.org> <835yuea0gb.fsf@gnu.org> <8335pi9ydi.fsf@gnu.org> Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="19898"; mail-complaints-to="usenet@ciao.gmane.io" Cc: 50983@debbugs.gnu.org, rudiwillalwaysloveyou@gmail.com To: Alan Third Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Sun Oct 03 14:03:12 2021 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1mX0D2-0004zT-Ei for geb-bug-gnu-emacs@m.gmane-mx.org; Sun, 03 Oct 2021 14:03:12 +0200 Original-Received: from localhost ([::1]:44126 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1mX0D0-0004xv-Fk for geb-bug-gnu-emacs@m.gmane-mx.org; Sun, 03 Oct 2021 08:03:10 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:36652) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mX0Cs-0004xk-IW for bug-gnu-emacs@gnu.org; Sun, 03 Oct 2021 08:03:02 -0400 Original-Received: from debbugs.gnu.org ([209.51.188.43]:49737) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1mX0Cs-0005jL-As for bug-gnu-emacs@gnu.org; Sun, 03 Oct 2021 08:03:02 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1mX0Cs-0005G3-8Q for bug-gnu-emacs@gnu.org; Sun, 03 Oct 2021 08:03:02 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Eli Zaretskii Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Sun, 03 Oct 2021 12:03:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 50983 X-GNU-PR-Package: emacs Original-Received: via spool by 50983-submit@debbugs.gnu.org id=B50983.163326256420183 (code B ref 50983); Sun, 03 Oct 2021 12:03:02 +0000 Original-Received: (at 50983) by debbugs.gnu.org; 3 Oct 2021 12:02:44 +0000 Original-Received: from localhost ([127.0.0.1]:33050 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mX0CZ-0005FS-Ns for submit@debbugs.gnu.org; Sun, 03 Oct 2021 08:02:43 -0400 Original-Received: from eggs.gnu.org ([209.51.188.92]:59718) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mX0CX-0005FG-9k for 50983@debbugs.gnu.org; Sun, 03 Oct 2021 08:02:42 -0400 Original-Received: from fencepost.gnu.org ([2001:470:142:3::e]:46562) by eggs.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1mX0CR-0005Se-0M; Sun, 03 Oct 2021 08:02:35 -0400 Original-Received: from 84.94.185.95.cable.012.net.il ([84.94.185.95]:1792 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mX0CQ-0007nm-K1; Sun, 03 Oct 2021 08:02:34 -0400 In-Reply-To: (message from Alan Third on Sun, 3 Oct 2021 12:26:22 +0100) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.io gmane.emacs.bugs:216259 Archived-At: > Date: Sun, 3 Oct 2021 12:26:22 +0100 > From: Alan Third > Cc: rudiwillalwaysloveyou@gmail.com, 50983@debbugs.gnu.org > > > And what do you mean by "underscore character"? What is its Unicode > > codepoint? > > In the screenshot (and in my own iTerm2 session) there is an > underscore character after "note-". I think it's inserted by the > terminal as a placeholder for something it doesn't understand. No, it's a special face we use to display some characters that may look like ASCII, but aren't. See nobreak-char-display. > In GUI Emacs that position in the file has a zero width character. > > If I do describe-char on the underscore it says it's a plain ascii > "o", which is clearly incorrect. In GUI it says it's 8203 (0x200B), > "ZERO WIDTH SPACE", and as I said it displays as a zero width space. Can you show the output of "C-x =" on all the characters, one by one, starting from "n" in "note" and ending with "t" in "taking" after it? Are they all incorrect, i.e. do not correspond to the place the cursor is on? That is, does the corruption start around there or does it start much earlier (and if the latter, where does it start)? > I think I agree with your other email that it's down to the terminal > doing something strange with characters it doesn't understand. If this is the case, the only way to fix the display is to use us-ascii as terminal encoding. Or maybe set up the terminal for a "simpler" encoding, like latin-1, and then set up Emacs to that using set-terminal-coding-system.