From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.bugs Subject: bug#68751: 29.1; "\x0e0" is a multibyte string Date: Sat, 27 Jan 2024 10:18:26 +0200 Message-ID: <864jez9okt.fsf@gnu.org> References: <3848edca-6d0b-4875-944d-c6ee193b92db@stegny.2a.pl> <924db1f7-1bee-41cb-bd59-c54a9572b830@stegny.2a.pl> Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="35519"; mail-complaints-to="usenet@ciao.gmane.io" Cc: 68751@debbugs.gnu.org To: Christopher Yeleighton Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Sat Jan 27 09:19:17 2024 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1rTduG-0008z6-NE for geb-bug-gnu-emacs@m.gmane-mx.org; Sat, 27 Jan 2024 09:19:16 +0100 Original-Received: from localhost ([::1] helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1rTdtw-0004CX-M5; Sat, 27 Jan 2024 03:18:56 -0500 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1rTdtv-0004At-SS for bug-gnu-emacs@gnu.org; Sat, 27 Jan 2024 03:18:55 -0500 Original-Received: from debbugs.gnu.org ([2001:470:142:5::43]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1rTdtv-0008Nj-Jy for bug-gnu-emacs@gnu.org; Sat, 27 Jan 2024 03:18:55 -0500 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1rTdu2-0004k3-Bx for bug-gnu-emacs@gnu.org; Sat, 27 Jan 2024 03:19:02 -0500 X-Loop: help-debbugs@gnu.org Resent-From: Eli Zaretskii Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Sat, 27 Jan 2024 08:19:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 68751 X-GNU-PR-Package: emacs Original-Received: via spool by 68751-submit@debbugs.gnu.org id=B68751.170634352618203 (code B ref 68751); Sat, 27 Jan 2024 08:19:02 +0000 Original-Received: (at 68751) by debbugs.gnu.org; 27 Jan 2024 08:18:46 +0000 Original-Received: from localhost ([127.0.0.1]:53384 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1rTdtl-0004jW-K9 for submit@debbugs.gnu.org; Sat, 27 Jan 2024 03:18:45 -0500 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:45500) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1rTdtj-0004jI-FR for 68751@debbugs.gnu.org; Sat, 27 Jan 2024 03:18:44 -0500 Original-Received: from fencepost.gnu.org ([2001:470:142:3::e]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1rTdtU-0008NE-Hm; Sat, 27 Jan 2024 03:18:28 -0500 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnu.org; s=fencepost-gnu-org; h=References:Subject:In-Reply-To:To:From:Date: mime-version; bh=/jYeF5ARS2+3KQv6mVJZzAEQBtUYGCyRK9RzzshZp+s=; b=Ti1WG+ZTl9Q6 wsKKNgx9kksshMNHjnfOD9YEWGEw9xyXkVAeZyMENMjFuxr35auzoL+g9Peuf6ca0COY9Mkujb58t bOKuNqYXO3pg7282FZYIqtr5eddugcTLPSewFk/5kyd0oI2OlCj0mdIYiqs+yQxD27q+8s5cNEAGj nGgR123okXYDSlKgf3rPRYsHvTdOBDD6ynX5r3G+9JJ+8ckaZgRE/Wumdfc34z2bUQRUEhF7sEgdD +vj2jBythl/SsYdJCYYe/krl3zfxyNW6mWzwB4tPi0xhcvpY//aEpY8/J11uIgWZKpJ3IT/Apx8p4 D7f2RsmKXVycIoasVy97+w==; In-Reply-To: <924db1f7-1bee-41cb-bd59-c54a9572b830@stegny.2a.pl> (message from Christopher Yeleighton on Sat, 27 Jan 2024 06:46:36 +0000) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Xref: news.gmane.io gmane.emacs.bugs:278969 Archived-At: > Date: Sat, 27 Jan 2024 06:46:36 +0000 > From: Christopher Yeleighton > > Info (elisp) Non-ASCII in Strings says: > > > If a string constant contains hexadecimal or octal escape sequences, > and these > > escape sequences all specify unibyte characters (i.e., less than 256), > > and there are no other literal non-ASCII characters or Unicode-style > > escape sequences in the string, then Emacs automatically assumes that it > > is a unibyte string. > > I believe it should say: > > | (i.e., less than 256 and octal or written with 2 hexadecimal digits), Right. I modified the text to that effect. > and additionally > > | Unibyte characters embedded in multibyte string constants evaluate to > private character codes, > | e.g. "\x0a0\xa0" equals "\x0a0\x3fffa0". I didn't make this change because I don't see how it is useful. First, "evaluate" is confusing here. Also, "private character codes" is confusing/incorrect, as it could be interpreted to mean Emacs somehow uses the PUA of Unicode codespace, which it doesn't. Finally, when Emacs converts from a single-byte representation of a raw byte to its multibyte representation is an obscure matter largely defined by ad-hoc compatibility considerations, and doesn't belong to the ELisp manual. I think this bug can be closed now.