From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.bugs Subject: bug#11309: 24.1.50; Case problems with [:upper:] and Cyrillic, Greek Date: Tue, 08 Dec 2020 18:02:05 +0200 Message-ID: <83ft4g70ci.fsf@gnu.org> References: <5D75AE9F-F1F7-4A7E-A135-0071E03369AA@acm.org> <70DAA5B7-B336-4E8E-A342-05BD46BC0472@acm.org> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="4434"; mail-complaints-to="usenet@ciao.gmane.io" Cc: kehoea@parhasard.net, larsi@gnus.org, 11309@debbugs.gnu.org To: Mattias =?UTF-8?Q?Engdeg=C3=A5rd?= Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Tue Dec 08 17:03:10 2020 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1kmfSI-00011o-7a for geb-bug-gnu-emacs@m.gmane-mx.org; Tue, 08 Dec 2020 17:03:10 +0100 Original-Received: from localhost ([::1]:38860 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1kmfSH-0000wv-9U for geb-bug-gnu-emacs@m.gmane-mx.org; Tue, 08 Dec 2020 11:03:09 -0500 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:57472) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1kmfSA-0000wY-80 for bug-gnu-emacs@gnu.org; Tue, 08 Dec 2020 11:03:02 -0500 Original-Received: from debbugs.gnu.org ([209.51.188.43]:47758) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1kmfSA-00007E-1B for bug-gnu-emacs@gnu.org; Tue, 08 Dec 2020 11:03:02 -0500 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1kmfS9-0007f6-Uh for bug-gnu-emacs@gnu.org; Tue, 08 Dec 2020 11:03:01 -0500 X-Loop: help-debbugs@gnu.org Resent-From: Eli Zaretskii Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Tue, 08 Dec 2020 16:03:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 11309 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: patch Original-Received: via spool by 11309-submit@debbugs.gnu.org id=B11309.160744334228192 (code B ref 11309); Tue, 08 Dec 2020 16:03:01 +0000 Original-Received: (at 11309) by debbugs.gnu.org; 8 Dec 2020 16:02:22 +0000 Original-Received: from localhost ([127.0.0.1]:59302 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kmfRV-0007KR-NH for submit@debbugs.gnu.org; Tue, 08 Dec 2020 11:02:21 -0500 Original-Received: from eggs.gnu.org ([209.51.188.92]:35872) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kmfRU-0007Fm-5w for 11309@debbugs.gnu.org; Tue, 08 Dec 2020 11:02:20 -0500 Original-Received: from fencepost.gnu.org ([2001:470:142:3::e]:53147) by eggs.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1kmfRN-00088z-WE; Tue, 08 Dec 2020 11:02:14 -0500 Original-Received: from [176.228.60.248] (port=3462 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1kmfRN-00056B-6m; Tue, 08 Dec 2020 11:02:13 -0500 In-Reply-To: <70DAA5B7-B336-4E8E-A342-05BD46BC0472@acm.org> (message from Mattias =?UTF-8?Q?Engdeg=C3=A5rd?= on Tue, 8 Dec 2020 15:48:42 +0100) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.io gmane.emacs.bugs:195375 Archived-At: > From: Mattias Engdegård > Date: Tue, 8 Dec 2020 15:48:42 +0100 > Cc: 11309@debbugs.gnu.org > > The remaining problem seems to be that the upcase table maps ß to itself, which is wrong -- as long as we don't upcase ß to U+1E9E, it should not have an upcase table entry at all. I'll see what can be done about that. Why is this a problem? AFAIR characters that don't have an upper-case form map to themselves when downcased. E.g. (upcase ?1) => ?1 Why should ß violate this convention? > * src/regex-emacs.c (execute_charset): Add canon_table argument to > allow expression of a correct predicate for [:upper:] and [:lower:]. > (mutually_exclusive_p, re_match_2_internal): Pass extra argument. > * test/src/regex-emacs-tests.el (regexp-case-fold, regexp-eszett): > New tests. Parts of regexp-eszett still fail and are commented out. Thanks, LGTM.