From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Ludovic =?UTF-8?Q?Court=C3=A8s?= Newsgroups: gmane.lisp.guile.bugs Subject: bug#57507: Regular expression matching depends on locale encoding Date: Mon, 05 Sep 2022 09:48:36 +0200 Message-ID: <87mtbe5kiz.fsf@gnu.org> References: Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="3562"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.1 (gnu/linux) Cc: 57507@debbugs.gnu.org To: Jean Abou Samra Original-X-From: bug-guile-bounces+guile-bugs=m.gmane-mx.org@gnu.org Mon Sep 05 09:49:13 2022 Return-path: Envelope-to: guile-bugs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1oV6r2-0000l0-Nu for guile-bugs@m.gmane-mx.org; Mon, 05 Sep 2022 09:49:12 +0200 Original-Received: from localhost ([::1]:35344 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1oV6r1-00065D-Cp for guile-bugs@m.gmane-mx.org; Mon, 05 Sep 2022 03:49:11 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:33086) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1oV6qs-00062C-D8 for bug-guile@gnu.org; Mon, 05 Sep 2022 03:49:02 -0400 Original-Received: from debbugs.gnu.org ([209.51.188.43]:57644) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1oV6qs-00077e-2x for bug-guile@gnu.org; Mon, 05 Sep 2022 03:49:02 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1oV6qr-00029Z-LU for bug-guile@gnu.org; Mon, 05 Sep 2022 03:49:01 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Ludovic =?UTF-8?Q?Court=C3=A8s?= Original-Sender: "Debbugs-submit" Resent-CC: bug-guile@gnu.org Resent-Date: Mon, 05 Sep 2022 07:49:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 57507 X-GNU-PR-Package: guile Original-Received: via spool by 57507-submit@debbugs.gnu.org id=B57507.16623641278251 (code B ref 57507); Mon, 05 Sep 2022 07:49:01 +0000 Original-Received: (at 57507) by debbugs.gnu.org; 5 Sep 2022 07:48:47 +0000 Original-Received: from localhost ([127.0.0.1]:46343 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oV6qd-000291-Iw for submit@debbugs.gnu.org; Mon, 05 Sep 2022 03:48:47 -0400 Original-Received: from eggs.gnu.org ([209.51.188.92]:40328) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oV6qb-00028m-52 for 57507@debbugs.gnu.org; Mon, 05 Sep 2022 03:48:46 -0400 Original-Received: from fencepost.gnu.org ([2001:470:142:3::e]:41818) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1oV6qV-00074l-H6; Mon, 05 Sep 2022 03:48:39 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnu.org; s=fencepost-gnu-org; h=MIME-Version:In-Reply-To:Date:References:Subject:To: From; bh=yjLZpPpqJjrzvrwGwTJso5nzx7vIJSnh4hcje0lCoyg=; b=el5seU5/+mmjrJa+VYWn BaR1Cib8BUPnNJuzRHN+UdY2gv0L3OHo9kB2948dU9Nbz3EGsxjrQVi7TH1TVkFHRnceCJPZApB81 bjcs6FXJgOJ2knj101zxr73hjnipfKVsrrYj0/CrUvlbZpeW0ITEzYH7l8OWzh+mMU9drO7Ghd4R/ xYclP2/0tY81R2t5EwTTH3WKGBT5rYMSXDQqb1UgXyTDusHR9r8B5nl8XQYv0p/J2046YZfTGXb4i 4U6KpRsp89wVK/HY/rFgexnqqzpT6Y9VyWchhp0xUYo7dlvMbqegACeHdZ9nwZIKk8fCl2Ug6+XPa uTLK8eUPK7fgoA==; Original-Received: from [2001:660:6102:320:e120:2c8f:8909:cdfe] (port=39144 helo=ribbon) by fencepost.gnu.org with esmtpsa (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1oV6qV-0008FW-1s; Mon, 05 Sep 2022 03:48:39 -0400 In-Reply-To: (Jean Abou Samra's message of "Wed, 31 Aug 2022 18:54:50 +0200") X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: bug-guile@gnu.org List-Id: "Bug reports for GUILE, GNU's Ubiquitous Extension Language" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-guile-bounces+guile-bugs=m.gmane-mx.org@gnu.org Original-Sender: "bug-guile" Xref: news.gmane.io gmane.lisp.guile.bugs:10365 Archived-At: Hi Jean, Jean Abou Samra skribis: > Regular expressions do funky things with Unicode if a non-Unicode-aware > locale is set. Yet, they're purely string operations, so I don't think > it's expected that they depend on the locale encoding. This is the expected behavior: first because (ice-9 regex) is implemented in terms of the libc regex functions, as Dale put (but that could be thought as an implementation detail), and second because things such as character classes are necessarily locale-dependent (this has bitten us in the past, for instance with ). I hope that makes sense. Thanks, Ludo=E2=80=99.