From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Pip Cet Newsgroups: gmane.emacs.devel Subject: Re: [scratch/igc] 985247b6bee crash on Linux, KDE, Wayland Date: Fri, 06 Sep 2024 19:29:28 +0000 Message-ID: <87v7z8eg2y.fsf@protonmail.com> References: <8734mezkgo.fsf@gmail.com> <875xrani8k.fsf@gmail.com> <86bk122azc.fsf@gnu.org> <87v7zagcal.fsf@gmail.com> <867cbp3nw7.fsf@gnu.org> <87v7z9msrl.fsf@gmail.com> <874j6tqxyg.fsf@gmail.com> <87jzfpfhmf.fsf@protonmail.com> <87o751z3zr.fsf@gmail.com> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="20955"; mail-complaints-to="usenet@ciao.gmane.io" Cc: Eli Zaretskii , gerd.moellmann@gmail.com, emacs-devel@gnu.org To: Eval EXEC Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Sat Sep 07 07:40:39 2024 Return-path: Envelope-to: ged-emacs-devel@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1smoBb-0005Ik-0H for ged-emacs-devel@m.gmane-mx.org; Sat, 07 Sep 2024 07:40:39 +0200 Original-Received: from localhost ([::1] helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1smoAw-0003WZ-Sg; Sat, 07 Sep 2024 01:39:58 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1smeeH-0004jd-NJ for emacs-devel@gnu.org; Fri, 06 Sep 2024 15:29:37 -0400 Original-Received: from mail-40131.protonmail.ch ([185.70.40.131]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1smeeF-0003yg-BR for emacs-devel@gnu.org; Fri, 06 Sep 2024 15:29:37 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=protonmail.com; s=protonmail3; t=1725650973; x=1725910173; bh=+XfhF0yF4i9/RLFiFvrf5JPefl8AXAki4PwGei0ZC4E=; h=Date:To:From:Cc:Subject:Message-ID:In-Reply-To:References: Feedback-ID:From:To:Cc:Date:Subject:Reply-To:Feedback-ID: Message-ID:BIMI-Selector; b=Npk+hgQgWCC0ETEaL7oRFbkfONHhR0SpFMZ3LJQMtdVapgA94wfHkGRDdD9GnrvJO Xv2ci/09kuwQT46j15S6Ob3Wvla3jUz+Rt/AJvtHa1mNfb7plUv/6cbB8gu2C2irqy kYNOl23kfjxUjgsO2KA5fT4KHSlkY0KCZGB8dVO+fo2wGGJIePyf4p0IQmUAtch2Y4 31tdf2dr+xf3nkpRAN+iukTHJ74nkwDUgae7Oe+kU48ISTY8epgCg6YCq3V3Gz0kkK 6Wpxs21aHKRNhFhUVTlIjM0ITDCYvv5cSqvULSJMrmouAYMEx3P0ew424KdL+BSBvR Xh3BWcCAb0G4w== In-Reply-To: <87o751z3zr.fsf@gmail.com> Feedback-ID: 112775352:user:proton X-Pm-Message-ID: 36d8d8baa55c2cc4506aa22b3a72b05ba3d4f0c8 Received-SPF: pass client-ip=185.70.40.131; envelope-from=pipcet@protonmail.com; helo=mail-40131.protonmail.ch X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_MSPIKE_H3=0.001, RCVD_IN_MSPIKE_WL=0.001, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01 autolearn=unavailable autolearn_force=no X-Spam_action: no action X-Mailman-Approved-At: Sat, 07 Sep 2024 01:39:57 -0400 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Xref: news.gmane.io gmane.emacs.devel:323482 Archived-At: "Eval EXEC" writes: > Pip Cet writes: > >> "Eval EXEC" writes: >> >>> Eval EXEC writes: >>> >>>> I recompiled commit 95a30325a84 (HEAD -> scratch/igc, origin/scratch/i= gc) >>>> * src/igc.c (fix_frame): Correct the previous change. >>>> >>>> After testing, I believe the issue has been resolved. >>> >>> scratch/igc 95a30325 crash again: >>> >>> I use latest scratch/igc commit: * 95a30325a84 - (HEAD -> scratch/igc, = origin/scratch/igc) * src/igc.c (fix_frame): Fix last change. (8 hours ago)= >>> >>> build it by: >>> >>> #!/usr/bin/env bash >>> set -ex >>> >>> make extraclean >>> >>> BRANCH_NAME=3D$(git branch --show-current | sed 's/\//_/g') >>> COMMIT_ID=3D$(git rev-parse --short=3D8 HEAD) >>> BUILD_DIR=3D${BRANCH_NAME}-commit-${COMMIT_ID} >>> INSTALL_PREFIX=3D$(realpath ../emacs-build/${BUILD_DIR}) >>> >>> ./autogen.sh >>> ./configure CFLAGS=3D'-g3 -ggdb -O2 -fno-omit-frame-pointer -mtune=3Dna= tive -march=3Dnative' \ >>> --prefix=3D${INSTALL_PREFIX} \ >>> --with-mps=3Dyes \ >>> --with-imagemagick \ >>> --with-modules \ >>> --without-compress-install \ >>> --with-native-compilation --with-mailutils\ >>> --enable-link-time-optimization \ >>> --with-tree-sitter --with-xinput2 \ >>> --with-dbus --with-native-compilation=3Daot \ >>> --with-file-notification=3Dinotify\ >>> && make -j30 install >>> >>> rm ../emacs-build/emacs >>> ln -s ${INSTALL_PREFIX} ../emacs-build/emacs >> >> Thanks, and sorry you're seeing so many crashes. It's stable here, so >> we're going to need your help debugging this :-) > > Thanks for the update. No problem. I'm really happy to debug with you all= . > =E5=85=A8=E4=B8=96=E7=95=8C Emacs =E7=94=A8=E6=88=B7=E8=81=94=E5=90= =88=E8=B5=B7=E6=9D=A5=EF=BC=8C Make Emacs Great Again! > >> Did you use any special options or patches when building mps (link time >> optimization or -O3, in particular)? > > No special options or patches > >> Could you please try (but please >> keep the core file and binary for this crash) rebuilding mps with "-g3 >> -ggdb -O0" to see whether the problem is, maybe, mps rather than Emacs? >> That should also improve the debugging output, so please provide further >> crashes if they do happen with those options. > > OK, I will rebuild mps and emacs with "-g3 -ggdb -O0" now Has the crash happened again with those settings? >>> #9 0x0000000000690c05 in fix_lisp_obj (ss=3Dss@entry=3D0x7ffc7653e6f8,= pobj=3Dpobj@entry=3D0x7f08addd06b8) at /home/exec/Projects/git.savannah.gn= u.org/git/emacs/src/igc.c:975 >> >> This is the car of a cons cell, and it looks like it was freed by a >> previous garbage collection so it's no longer valid. >> >>> off =3D >>> client =3D >>> base =3D 0x7f088a262a50 >> >> Could you open the coredump again and run >> >> (gdb) x/32gx 0x7f088a262a00 > It's: > (gdb) x/32gx 0x7f088a262a00 > 0x7f088a262a00: 0x00000000005a2a06 0x0000000000000000 > 0x7f088a262a10: 0x0000000322e4d00d 0x0000000000004c58 > 0x7f088a262a20: 0x0000000000000000 0x0000000322e4d10d > 0x7f088a262a30: 0x00000000005a2a06 0x0000000000000000 > 0x7f088a262a40: 0x0000000322e4d20d 0x00000000005a2a0a > 0x7f088a262a50: 0x0000000000000000 0x0000000322e4d30d > 0x7f088a262a60: 0x0000000000004c58 0x0000000000000000 > 0x7f088a262a70: 0x0000000322e4d40d 0x00000000005a2a0a > 0x7f088a262a80: 0x0000000000000000 0x0000000322e4d50d > 0x7f088a262a90: 0x00000000005a2a0e 0x0000000000000000 > 0x7f088a262aa0: 0x0000000322e4d60d 0x0000000000004c58 > 0x7f088a262ab0: 0x0000000000000000 0x0000000322e4d70d > 0x7f088a262ac0: 0x00000000005a2a0e 0x0000000000000000 > 0x7f088a262ad0: 0x0000000322e4d80d 0x00000000005a2a12 > 0x7f088a262ae0: 0x0000000000000000 0x0000000322e4d90d > 0x7f088a262af0: 0x0000000000004c58 0x0000000000000000 > > >> so we can get an idea of what was allocated around that time? >> >>> res =3D >>> new_off =3D >>> p =3D 0x7f08addd06b8 >>> word =3D >>> tag =3D >>> _ss =3D 0x7ffc7653e6f8 >>> _mps_zs =3D >>> _mps_ufs =3D 36029896530599944 >>> _mps_wt =3D >>> _mps_w =3D >>> #10 0x00000000006919f8 in fix_cons (cons=3D0x7f08addd06b0, ss=3D0x7ffc7= 653e6f8) at /home/exec/Projects/git.savannah.gnu.org/git/emacs/src/igc.c:17= 51 >> >> This is the cons cell itself (the IGC header is what "cons" points to). >> >> Please also run >> >> (gdb) x/64gx 0x7f08addd0600 > It's > (gdb) x/64gx 0x7f08addd0600 > 0x7f08addd0600: 0x00007f08addd063b 0x00000003df45210d So we can decode those to three interleaved lists reading, in part: (nil font-lock-face (:foreground ...)) (rear-nonsticky t ...) (nil font-lock-face (...)) is a pointer to what looks like the nursery generation, but one which we must have failed to trace (presumably the symbol was either uninterned and freed or interned and moved to an older generation) and which was subsequently reused for cons cells by composite.c Going back to the original report, I notice that it was trying to print an "error in process filter: " message while handling what looks like a (long) sequence of terminal escape codes. Were you using M-x term at the time? Did you notice such error messages? I'll have another look at the process filter/longjmp code, but I suspect we're going to have to wait for further crashes to get to the bottom of this. Thanks Pip