From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED.blaine.gmane.org!not-for-mail From: Lars Ingebrigtsen Newsgroups: gmane.emacs.bugs Subject: bug#38191: incorrect text properties in result of `format' with multibyte(?) characters Date: Thu, 14 Nov 2019 06:30:30 +0100 Message-ID: <87k183rpmx.fsf@gnus.org> References: Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Injection-Info: blaine.gmane.org; posting-host="blaine.gmane.org:195.159.176.226"; logging-data="44687"; mail-complaints-to="usenet@blaine.gmane.org" User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.0.50 (gnu/linux) Cc: 38191@debbugs.gnu.org To: Paul Pogonyshev Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Thu Nov 14 06:31:21 2019 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([209.51.188.17]) by blaine.gmane.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.89) (envelope-from ) id 1iV7iy-000BSp-2M for geb-bug-gnu-emacs@m.gmane.org; Thu, 14 Nov 2019 06:31:20 +0100 Original-Received: from localhost ([::1]:53528 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1iV7iw-0006Wi-S7 for geb-bug-gnu-emacs@m.gmane.org; Thu, 14 Nov 2019 00:31:18 -0500 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:37237) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1iV7il-0006WY-78 for bug-gnu-emacs@gnu.org; Thu, 14 Nov 2019 00:31:09 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1iV7ii-0000hQ-N7 for bug-gnu-emacs@gnu.org; Thu, 14 Nov 2019 00:31:06 -0500 Original-Received: from debbugs.gnu.org ([209.51.188.43]:51793) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1iV7if-0000gC-Q3 for bug-gnu-emacs@gnu.org; Thu, 14 Nov 2019 00:31:03 -0500 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1iV7if-0008J8-LD for bug-gnu-emacs@gnu.org; Thu, 14 Nov 2019 00:31:01 -0500 X-Loop: help-debbugs@gnu.org Resent-From: Lars Ingebrigtsen Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Thu, 14 Nov 2019 05:31:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 38191 X-GNU-PR-Package: emacs Original-Received: via spool by 38191-submit@debbugs.gnu.org id=B38191.157370944531908 (code B ref 38191); Thu, 14 Nov 2019 05:31:01 +0000 Original-Received: (at 38191) by debbugs.gnu.org; 14 Nov 2019 05:30:45 +0000 Original-Received: from localhost ([127.0.0.1]:60614 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1iV7iM-0008IU-TI for submit@debbugs.gnu.org; Thu, 14 Nov 2019 00:30:44 -0500 Original-Received: from quimby.gnus.org ([95.216.78.240]:49206) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1iV7iK-0008IG-AO for 38191@debbugs.gnu.org; Thu, 14 Nov 2019 00:30:40 -0500 Original-Received: from cm-84.212.202.86.getinternet.no ([84.212.202.86] helo=marnie) by quimby.gnus.org with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1iV7iA-0000d7-RB; Thu, 14 Nov 2019 06:30:33 +0100 In-Reply-To: (Paul Pogonyshev's message of "Wed, 13 Nov 2019 01:31:25 +0100") X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 209.51.188.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.org gmane.emacs.bugs:171528 Archived-At: Paul Pogonyshev writes: > "Multibyte" is a guess, I don't really know the underlying reason. > > Examples: > > (format (propertize "`foo' %s bar" 'face 'bold) "xxx") > =3D> #("`foo' xxx bar" 0 13 (face bold)) > > (format (propertize "=E2=80=98foo=E2=80=99 %s bar" 'face 'bold) "xxx") > =3D> #("=E2=80=98foo=E2=80=99 xxx bar" 0 10 (face bold)) > > Length of the string is the same in both cases. In the first example > the face is correctly applied to the whole string, in the second > example 3 last characters incorrectly lack a face. > > This is a regression, it used to work correctly before, but I don't > know when it became broken. It's always off by the length of the inserted string, so it's at least systematic: (format (propertize "=C3=A7=C3=A7foo %s bar" 'face 'bold) "xxx") =3D> #("=C3=A7=C3=A7foo xxx bar" 0 10 (face bold)) (format (propertize "=C3=A7=C3=A7foo %s bar" 'face 'bold) "xxxx") =3D> #("=C3=A7=C3=A7foo xxxx bar" 0 10 (face bold)) It doesn't happen if there's just one multibyte character in the format spec -- there has to be two or more. --=20 (domestic pets only, the antidote for overdose, milk.) bloggy blog: http://lars.ingebrigtsen.no