From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Sam Steingold Newsgroups: gmane.emacs.devel Subject: Re: case-insensitive string comparison Date: Mon, 25 Jul 2022 10:23:30 -0400 Message-ID: References: <87ilnsq4cr.fsf@gnu.org> Reply-To: sds@gnu.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="35207"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/29.0.50 (darwin) To: emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Mon Jul 25 16:25:14 2022 Return-path: Envelope-to: ged-emacs-devel@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1oFz1F-0008y1-Gc for ged-emacs-devel@m.gmane-mx.org; Mon, 25 Jul 2022 16:25:13 +0200 Original-Received: from localhost ([::1]:41522 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1oFz1E-0006qX-BV for ged-emacs-devel@m.gmane-mx.org; Mon, 25 Jul 2022 10:25:12 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:52684) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1oFyzh-0005VX-M4 for emacs-devel@gnu.org; Mon, 25 Jul 2022 10:23:37 -0400 Original-Received: from mail-qt1-x829.google.com ([2607:f8b0:4864:20::829]:42902) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1oFyzf-0003A4-W5; Mon, 25 Jul 2022 10:23:37 -0400 Original-Received: by mail-qt1-x829.google.com with SMTP id w29so8284572qtv.9; Mon, 25 Jul 2022 07:23:33 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=sender:from:to:subject:in-reply-to:references:user-agent :return-receipt-to:reply-to:mail-followup-to:date:message-id :mime-version:content-transfer-encoding; bh=LxWWawkWPqyeJJ/IhsuH6u0OSUe9N7WDCbP5kzVzLYE=; b=aGOAVaRc28RJAfwYI4EX3j0WEVN9B1kJJGaLLviPKUPTUk2FuA0fgBDJZXmjY3OyMk HMIP2Fb0lXRXcB1Hb3bcUV3shxew2IpILS+ZLFHC8hwQy5cvcVXiINyf+a1AD7dJEKz0 R9/U96wAapsiNsj+GkCU22m+rjCt0eSG3JtCXvFocjcQpcOxKug6Kg68rqUsAbMPhDGg XZ+PsHxi3Ha/ZGnJg+TZn10hIto0uNnd6ItmXa9Gbdb1ZUryDoE8kAovwQpUBR9op3q8 27pTxtScor2yggKtqP/98n2vzYnmjJRGcwGdIwYk8NxGr85lK71EHr/5SjPsCaS6Ku2A uv7Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:sender:from:to:subject:in-reply-to:references :user-agent:return-receipt-to:reply-to:mail-followup-to:date :message-id:mime-version:content-transfer-encoding; bh=LxWWawkWPqyeJJ/IhsuH6u0OSUe9N7WDCbP5kzVzLYE=; b=BsCSkHiXj6uqtiKaUpd3rU0bcM46+Nmbz1JLyE133nOqg9LGbzFGqRTf1NmTrzzItT MSrdggfENftOxhNEobwRbOg0E7b05G3GW3sjzWgaCDCBz3hBLUmNluuWVCMxoLIiN5oA pASE9Y4fqE8fbfgR/sAQhc7GgsCgdbmN3KvXKimLF4s1FHszN2QwtknIxZAKbC3hoGoy djwUH3jWeLkoCg4jes0OO8Bd3086/ozTSHih35rj1YoNgBMQndEaLWOk8CsUVmxh1QBR STqH8TCv2YSSy0/3VoSKUA6LIz5CF+VEVMX5BGBYKtPwtkpizHGN889DvqZWZ0pzz2X7 Dn6A== X-Gm-Message-State: AJIora+haNCQvZE00WKsdKPEcRtTu4rFm+l6huEIl+svaeEck1gY/wzO 5nWJ2n87vwzG+aSDM1SV9V2D9GoWXg== X-Google-Smtp-Source: AGRyM1uOr2mx2F04dUELUf7EPgXZ5a+/13xCNkH11oEiBL5/O54OotEDuKh4IA3K9RffptZgIHmbVA== X-Received: by 2002:a05:622a:1892:b0:31f:1e13:856 with SMTP id v18-20020a05622a189200b0031f1e130856mr10698541qtc.396.1658759012804; Mon, 25 Jul 2022 07:23:32 -0700 (PDT) Original-Received: from 3c22fb11fdab.ant.amazon.com (pool-108-30-23-113.nycmny.fios.verizon.net. [108.30.23.113]) by smtp.gmail.com with ESMTPSA id z11-20020ae9c10b000000b006a3325fd985sm8631728qki.13.2022.07.25.07.23.31 (version=TLS1_2 cipher=ECDHE-ECDSA-CHACHA20-POLY1305 bits=256/256); Mon, 25 Jul 2022 07:23:32 -0700 (PDT) In-Reply-To: (Sam Steingold's message of "Wed, 20 Jul 2022 12:22:33 -0400") X-Attribution: Sam X-Disclaimer: You should not expect anyone to agree with me. Mail-Followup-To: emacs-devel@gnu.org Received-SPF: pass client-ip=2607:f8b0:4864:20::829; envelope-from=sam.steingold@gmail.com; helo=mail-qt1-x829.google.com X-Spam_score_int: -14 X-Spam_score: -1.5 X-Spam_bar: - X-Spam_report: (-1.5 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FORGED_FROMDOMAIN=0.249, FREEMAIL_FROM=0.001, HEADER_FROM_DIFFERENT_DOMAINS=0.249, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01 autolearn=no autolearn_force=no X-Spam_action: no action X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.io gmane.emacs.devel:292633 Archived-At: > * Sam Steingold [2022-07-20 12:22:33 -0400]: > >> * Stefan Monnier [2022-07-19 23:01:31 -0400]: >> >>> PS. Actually, compare-strings/ignore_case is broken because it does, >>> essentially, upcase both arguments, see https://stackoverflow.com/q/319= 426/850781 >> >> Hmm... `string-collate-equalp`? > > (string-collate-equalp "a" "A" current-locale-environment t) > =3D=3D> nil > current-locale-environment > =3D=3D> "en_US.UTF-8" So, how do we do case-insensitive string comparison in Emacs? It is okay to add a `string-equal-ignore-case' based on `compare-strings'? (even though it does not recognize "SS" and "=C3=9F" as equal) Or should we first implement something like casefold in Python? https://docs.python.org/3/library/stdtypes.html#str.casefold --=20 Sam Steingold (http://sds.podval.org/) on darwin Ns 10.3.2113 http://childpsy.net http://calmchildstories.com http://steingoldpsychology.= com https://camera.org https://honestreporting.com https://www.memritv.org Warning! Dates in calendar are closer than they appear!