From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Juri Linkov Newsgroups: gmane.emacs.bugs Subject: bug#69968: Case-folding of Mathematical Alphanumeric Symbols Date: Sun, 24 Mar 2024 19:09:10 +0200 Organization: LINKOV.NET Message-ID: <86le67abtb.fsf@mail.linkov.net> References: <86zfuoua66.fsf@mail.linkov.net> <86r0g0xhyc.fsf@gnu.org> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="5658"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/30.0.50 (x86_64-pc-linux-gnu) Cc: 69968@debbugs.gnu.org To: Eli Zaretskii Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Sun Mar 24 18:22:00 2024 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1roRXj-0001D7-LZ for geb-bug-gnu-emacs@m.gmane-mx.org; Sun, 24 Mar 2024 18:21:59 +0100 Original-Received: from localhost ([::1] helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1roRX9-0006rE-Ec; Sun, 24 Mar 2024 13:21:23 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1roRX7-0006qd-Mp for bug-gnu-emacs@gnu.org; Sun, 24 Mar 2024 13:21:21 -0400 Original-Received: from debbugs.gnu.org ([2001:470:142:5::43]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1roRX7-0000nd-Ep for bug-gnu-emacs@gnu.org; Sun, 24 Mar 2024 13:21:21 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1roRXn-0004uQ-9f for bug-gnu-emacs@gnu.org; Sun, 24 Mar 2024 13:22:03 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Juri Linkov Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Sun, 24 Mar 2024 17:22:03 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 69968 X-GNU-PR-Package: emacs Original-Received: via spool by 69968-submit@debbugs.gnu.org id=B69968.171130091018807 (code B ref 69968); Sun, 24 Mar 2024 17:22:03 +0000 Original-Received: (at 69968) by debbugs.gnu.org; 24 Mar 2024 17:21:50 +0000 Original-Received: from localhost ([127.0.0.1]:45378 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1roRXZ-0004tH-LG for submit@debbugs.gnu.org; Sun, 24 Mar 2024 13:21:50 -0400 Original-Received: from relay9-d.mail.gandi.net ([217.70.183.199]:55397) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1roRXW-0004sY-DE for 69968@debbugs.gnu.org; Sun, 24 Mar 2024 13:21:48 -0400 Original-Received: by mail.gandi.net (Postfix) with ESMTPSA id F3A9BFF803; Sun, 24 Mar 2024 17:20:37 +0000 (UTC) In-Reply-To: <86r0g0xhyc.fsf@gnu.org> (Eli Zaretskii's message of "Sun, 24 Mar 2024 08:27:39 +0200") X-GND-Sasl: juri@linkov.net X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Xref: news.gmane.io gmane.emacs.bugs:282025 Archived-At: >> I wonder why case-folding is not supported for letters from >> the Unicode block "Mathematical Alphanumeric Symbols": >> https://en.wikipedia.org/wiki/Mathematical_Alphanumeric_Symbols > > These are not letters, they are symbols. And letter-case is not > defined for symbols. π˜‹π˜° 𝘺𝘰𝘢 𝘳𝘦𝘒𝘭𝘭𝘺 𝘡𝘩π˜ͺ𝘯𝘬 𝘡𝘩π˜ͺ𝘴 𝘡𝘦𝘹𝘡 π˜ͺ𝘴 𝘯𝘰𝘡 𝘸𝘳π˜ͺ𝘡𝘡𝘦𝘯 𝘸π˜ͺ𝘡𝘩 π™‘π™šπ™©π™©π™šπ™§π™¨? >> Is it because the Unicode standard doesn't provide information >> about their case-folding? And indeed they are missing from >> https://unicode.org/Public/UNIDATA/CaseFolding.txt > > Unicode doesn't consider them letters. Ок, if Unicode doesn't consider them letters, let's stick to the Unicode standard. >> But OTOH, I can't find the file CaseFolding.txt in admin/unidata. >> This means Emacs doesn't use this file? > > We don't. We use the case-conversion information in UnicodeData.txt, > as it tells us everything we need to know. Thanks, I didn't remember that case-conversion is in UnicodeData.txt. I checked admin/unidata/UnicodeData.txt and indeed there is no case-conversion for Mathematical Alphanumeric Symbols. >> Then should we add more case-folding information explicitly >> for this Unicode block? > > What is the rationale for doing so? It's against Unicode, so we need > to have a good reason, as this will have to be maintained by hand, and > also because some users might be surprised. I don't think that some users might be surprised because when they don't need to change case, they just don't use case-changing functions. But when they expect that case should be changed, then indeed they will be surprised that case is not changed. >> Case-folding is already supported for some characters from other >> Unicode blocks such e.g. FULLWIDTH LATIN CAPITAL LETTERs, >> CIRCLED LATIN CAPITAL LETTERs, etc. > > That's because UnicodeData.txt defines their letter-case conversions. Ok, then it's very strange that the Unicode standard doesn't define letter-case conversions for other letters. But what can we do. >> But e.g. PARENTHESIZED LATIN CAPITAL LETTERs are missing too. >> What is worse is that in Emacs β’œ doesn't have even a word syntax >> like its counterpart πŸ„. > > I think the fact that πŸ„ has the word syntax might be a mistake. These > are both symbols, so why would we want them to have the word syntax? Because they look like letters with diacritics.