unofficial mirror of bug-gnu-emacs@gnu.org 
 help / color / mirror / code / Atom feed
From: Will Bush <will.g.bush@gmail.com>
To: Eli Zaretskii <eliz@gnu.org>
Cc: Robert Pluim <rpluim@gmail.com>, 40733@debbugs.gnu.org
Subject: bug#40733: 28.0.50; Emacs locks up on paste (yank) of unicode characters
Date: Mon, 20 Apr 2020 16:27:54 -0500	[thread overview]
Message-ID: <CA+aYz4QFc903jfCJd37a9X+5tsirCj63_xg9VMhwA-vM4Wwq+A@mail.gmail.com> (raw)
In-Reply-To: <83a736gmnu.fsf@gnu.org>


[-- Attachment #1.1.1: Type: text/plain, Size: 4265 bytes --]

>
> Does it require all of them, or is there a specific character in that
> sequence that triggers it?
>

I was able to narrow it down to this character (between the back-ticks): `︵`

Gmail (webapp) is not rendering that for me even if I change the font. In
fact, I'm starting to realize I hate using gmail for writing emails because
it doesn't support code blocks either. I haven't gotten around to trying
out Emacs email clients yet, but I'm starting to wish that I had.

So I looked for a program that can turn a character into its codes and
found unum (https://www.fourmilab.ch/webtools/unum/). I inserted a
screenshot of the output (my terminal renders the character fine) because
gmail is mangling the output. I also attached a text file with the same
content as the screenshot just in case.

[image: Screenshot from 2020-04-20 14-15-29.png]

Any chance of running emacs using gdb so we can see where the CPU
> usage is?
>

Sure. I'm pretty rusty with gdb, but I'll start looking into it tonight and
get back with you. If you could give me some pointers or link me to a tips
on debugging Emacs that would help a lot.

macOS defines?
>

Looks like that comes from here:

https://github.com/NixOS/nixpkgs/blob/3bbd074217cd11b6e14abec24655091b83aacc6f/pkgs/applications/editors/emacs/default.nix#L58

Was added in this commit:
aa2160e1b62bdc6795c465e68301ec8684540b24
Author:     Matthew Bauer <mjbauer95@gmail.com>
AuthorDate: Mon May 28 13:33:08 2018 -0400
Commit:     Matthew Bauer <mjbauer95@gmail.com>
CommitDate: Mon May 28 13:35:10 2018 -0400

Parent:     a87b50bc634 emacs: readd version 25
Contained:  master
Follows:    18.03-beta (10672)
Precedes:   18.09-beta (9925)

emacs26: add some tweaks from jwiegley’s overlay

Interestingly,  jwiegley has since removed it from his repo
https://github.com/jwiegley/nix-config in this commit:

69bb0c3ae6985f09ee2f27cea4621db21fcf0474
Author:     John Wiegley <john@dfinity.org>
AuthorDate: Tue Oct 22 16:40:15 2019 -0700
Commit:     John Wiegley <john@dfinity.org>
CommitDate: Tue Oct 22 16:40:15 2019 -0700

Parent:     711ed41 updates
Contained:  master

updates

I guess OSX users that install Nix as a package manager can also install
Emacs using that same nix expression. I found what this flag does in the
"Targeting different macOS versions" section of the
`emacs/nextstep/INSTALL` file in the emacs repository. Kinda makes me
wonder if this should be reviewed in the nixpkgs repository.

do 'C-u C-x =' on them so we know what font
> is being used, that would help.
>

Oh cool. I didn't know about that command. So in trying to do this I
realized that Emacs did not completely lock up it was just taking a really
long time. It took a long time to display the character, for `C-u C-x =` to
finish`, and for me to finally select the text and `M-w`. Basically any
time I was interacting with that character it caused long delays.

The output below is from `emacs -Q` again in version 28 like before.
Something interesting to note is that does display the vertical left paren
after some time, but in Emacs 26, which had no lag, only displayed
whitespace.

             position: 146 of 157 (92%), column: 0
            character: ︵ (displayed as ︵) (codepoint 65077, #o177065,
#xfe35)
              charset: unicode (Unicode (ISO10646))
code point in charset: 0xFE35
               script: han
               syntax: (︶    which means: open, matches ︶
             category: .:Base, c:Chinese
             to input: type "C-x 8 RET fe35" or "C-x 8 RET PRESENTATION
FORM FOR VERTICAL LEFT PARENTHESIS"
          buffer code: #xEF #xB8 #xB5
            file code: #xEF #xB8 #xB5 (encoded by coding system utf-8-unix)
              display: by this font (glyph code)

ftcrhb:-GNU-Unifont-normal-normal-normal-Sans-Serif-16-*-*-*-c-80-iso10646-1
(#xDD36)

Character code properties: customize what to show
  name: PRESENTATION FORM FOR VERTICAL LEFT PARENTHESIS
  old-name: GLYPH FOR VERTICAL OPENING PARENTHESIS
  general-category: Ps (Punctuation, Open)
  decomposition: (vertical 40) (vertical '(')

There are text properties here:
  fontified            nil
  rear-nonsticky       t

[-- Attachment #1.1.2: Type: text/html, Size: 6292 bytes --]

[-- Attachment #1.2: Screenshot from 2020-04-20 14-15-29.png --]
[-- Type: image/png, Size: 114102 bytes --]

[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #2: unum-output.txt --]
[-- Type: text/plain; charset="US-ASCII"; name="unum-output.txt", Size: 914 bytes --]

λ ~/Downloads/ wget http://www.fourmilab.ch/webtools/unum/download/unum.tar.gz
--2020-04-20 13:44:11--  http://www.fourmilab.ch/webtools/unum/download/unum.tar.gz
Resolving www.fourmilab.ch (www.fourmilab.ch)... 52.28.236.0, 2a05:d014:d43:3101:c6ee:ea42:3836:6cbf
Connecting to www.fourmilab.ch (www.fourmilab.ch)|52.28.236.0|:80... connected.
HTTP request sent, awaiting response... 200 OK
Length: 1416635 (1.4M) [application/x-gzip]
Saving to: ‘unum.tar.gz’

unum.tar.gz                 100%[========================================>]   1.35M   627KB/s    in 2.2s

2020-04-20 13:44:13 (627 KB/s) - ‘unum.tar.gz’ saved [1416635/1416635]

λ ~/Downloads/ tar -zxvf unum.tar.gz
unum.pl
λ ~/Downloads/ perl -CA unum.pl  ︵
   Octal  Decimal      Hex        HTML    Character   Unicode
 0177065    65077   0xFE35    &#65077;    "︵"         PRESENTATION FORM FOR VERTICAL LEFT PARENTHESIS
λ ~/Downloads/

  reply	other threads:[~2020-04-20 21:27 UTC|newest]

Thread overview: 30+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2020-04-20 11:05 bug#40733: 28.0.50; Emacs locks up on paste (yank) of unicode characters Will Bush
2020-04-20 15:52 ` Robert Pluim
2020-04-20 16:13   ` Eli Zaretskii
2020-04-20 21:27     ` Will Bush [this message]
2020-04-20 20:20   ` Alan Third
2020-04-20 22:48 ` Basil L. Contovounesios
2020-04-21 10:01   ` Robert Pluim
2020-04-21 12:19     ` Will Bush
2020-04-21 13:19       ` Robert Pluim
2020-04-21 19:35         ` James Cloos
2020-04-22  7:35           ` Robert Pluim
2020-04-25 10:34             ` Will Bush
     [not found]               ` <CA+aYz4RNB1-g5uUz-M-XuJEhZPGpA4X6n8NSiTCUdOMkpReFng@mail.gmail.com>
2020-04-25 13:34                 ` bug#40733: Fwd: " Will Bush
2020-04-25 13:50               ` Eli Zaretskii
2020-04-29 11:59                 ` Will Bush
2020-04-29 12:16                   ` Eli Zaretskii
2020-04-29 12:42                     ` Will Bush
2020-04-29 12:50                       ` Robert Pluim
2020-04-29 14:30                       ` Eli Zaretskii
2020-06-01 11:19                         ` Will Bush
2020-06-01 11:44                           ` Pip Cet
2020-06-01 15:15                             ` Eli Zaretskii
2020-06-01 15:50                               ` Pip Cet
2022-04-24 14:20                           ` Lars Ingebrigtsen
2022-05-18  3:39                             ` Will Bush
2022-05-18 11:18                               ` Eli Zaretskii
2022-06-15 12:40                               ` Lars Ingebrigtsen
2022-06-19 21:05                                 ` Will Bush
2022-06-19 22:25                                   ` Lars Ingebrigtsen
2020-04-21 14:29       ` Eli Zaretskii

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://www.gnu.org/software/emacs/

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=CA+aYz4QFc903jfCJd37a9X+5tsirCj63_xg9VMhwA-vM4Wwq+A@mail.gmail.com \
    --to=will.g.bush@gmail.com \
    --cc=40733@debbugs.gnu.org \
    --cc=eliz@gnu.org \
    --cc=rpluim@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).