From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mp12.migadu.com ([2001:41d0:403:478a::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by ms9.migadu.com with LMTPS id YK29OeOF52SY8QAASxT56A (envelope-from ) for ; Thu, 24 Aug 2023 18:31:32 +0200 Received: from aspmx1.migadu.com ([2001:41d0:403:478a::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by mp12.migadu.com with LMTPS id 2KlnOeOF52RezwAAauVa8A (envelope-from ) for ; Thu, 24 Aug 2023 18:31:31 +0200 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by aspmx1.migadu.com (Postfix) with ESMTPS id 9E3A564663 for ; Thu, 24 Aug 2023 18:31:31 +0200 (CEST) Authentication-Results: aspmx1.migadu.com; dkim=pass header.d=protonmail.com header.s=protonmail3 header.b=leceCgLC; spf=pass (aspmx1.migadu.com: domain of "guix-devel-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="guix-devel-bounces+larch=yhetil.org@gnu.org"; dmarc=pass (policy=quarantine) header.from=protonmail.com ARC-Seal: i=1; s=key1; d=yhetil.org; t=1692894691; a=rsa-sha256; cv=none; b=kPt21iB2mJnYrNKn5ks1LrvI4rVXRsbrR/+cbXlamHlazmZp68OVkaeM8229YYtSaIYiEB JQOcwd7Q9nITtUhcBVfOAMavakRAgwDmi0fg8XUaOPG1sobcUcD95UUj0AsOv8Mq7qjoJB O76rJ64reY7NIMu4MW1jUdqOT8w9YmACJdJdsXOcM4LtXGxhl7/EFdIqXppm2ZS8bCqbyU nOAA5J692eK/YxiyB6ovAzQFHpkfWnymrtHsiqtNyZmUlTq82AYMqqcc+5w5uX8lM5WDzQ 5KG3HASn6hkfkMWL/QuM3d6GBEPJku3wu4LpUsdRt/cR+35hOtZaVM1mSn31ZA== ARC-Authentication-Results: i=1; aspmx1.migadu.com; dkim=pass header.d=protonmail.com header.s=protonmail3 header.b=leceCgLC; spf=pass (aspmx1.migadu.com: domain of "guix-devel-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="guix-devel-bounces+larch=yhetil.org@gnu.org"; dmarc=pass (policy=quarantine) header.from=protonmail.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=yhetil.org; s=key1; t=1692894691; h=from:from:sender:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:list-id:list-help: list-unsubscribe:list-subscribe:list-post:dkim-signature; bh=PPXm/qAgmC1J+JV8QFoRI3XE4iXgazF4sw0dgCuiJek=; b=XA9ZWExr2DUmLyZDRtZ/ydXj6arAP1VLP/snRPFhMEjHV8HWB0rEL/srUXQ/H6I3gq0gG4 W/9GVeck+urKdMmVKJm1x/XiQIR+xdWckZoYRFVpvFPwXfdvERIIzxp0h1WEGsbY24tDQP iK64xAF0Uv7EgONfayLlO5zf7HUFY48Jqr+QsR26VoAXewjUWuaHUA4XL2TuDkbZdJT9y9 35HnuMGi16ncbOZvao45h8+tXWGkPkkrio0C8qOFbwiqWVB5njA9n50HVd+G0SfZpzdJCs /gNTNx5P5HanY2IEOyICFo4BuwDFFuN14D+CwOzQJaEBtlV3ZmybQArjIbaWkg== Received: from localhost ([::1] helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1qZDEZ-0006AR-Qx; Thu, 24 Aug 2023 12:30:59 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1qZDEY-00069d-Bn for guix-devel@gnu.org; Thu, 24 Aug 2023 12:30:58 -0400 Received: from mail-40133.protonmail.ch ([185.70.40.133]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1qZDEU-0006sb-Pk for guix-devel@gnu.org; Thu, 24 Aug 2023 12:30:58 -0400 Date: Thu, 24 Aug 2023 16:30:26 +0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=protonmail.com; s=protonmail3; t=1692894643; x=1693153843; bh=PPXm/qAgmC1J+JV8QFoRI3XE4iXgazF4sw0dgCuiJek=; h=Date:To:From:Cc:Subject:Message-ID:In-Reply-To:References: Feedback-ID:From:To:Cc:Date:Subject:Reply-To:Feedback-ID: Message-ID:BIMI-Selector; b=leceCgLCkCpAxFQkfOcmfPf1+7SZTc92/1DBLawjVoUI/wgnBbEjVd005CeZNjCTn y2MbQgf+habGRuYWZ0JlEaAMgqjiYXr1jAjtCvuwCj+VdpHyOe4BqbjipW148GXcAO uByzx47BJMS+FIbK147tyTzfL8CWic6UW8y3spnQdValQsgo0/27e3Fnp1Grwpk2Ki 8BueC/ZAHwSpfTm5fgEcBtLAcnwXPIzS2u0dZTVGil7eZrsI8C1/xx4KDP/ixBbj+7 CUSNDA2Y785NCg2BP0/9RFSmwGCdpuyCP8OmEF8AJ+Vblk76467/ZxQSBsCJvz69Pb ztSGiIaTj3DTQ== To: =?utf-8?Q?Eidvilas_Markevi=C4=8Dius?= From: Kaelyn Cc: guix-devel@gnu.org Subject: Re: Relaxing the restrictions for store item names Message-ID: In-Reply-To: References: Feedback-ID: 34709329:user:proton MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Received-SPF: pass client-ip=185.70.40.133; envelope-from=kaelyn.alexi@protonmail.com; helo=mail-40133.protonmail.ch X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_MSPIKE_H5=0.001, RCVD_IN_MSPIKE_WL=0.001, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: guix-devel@gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: "Development of GNU Guix and the GNU System distribution." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guix-devel-bounces+larch=yhetil.org@gnu.org Sender: guix-devel-bounces+larch=yhetil.org@gnu.org X-Migadu-Flow: FLOW_IN X-Migadu-Country: US X-Migadu-Scanner: mx0.migadu.com X-Spam-Score: -7.81 X-Migadu-Queue-Id: 9E3A564663 X-Migadu-Spam-Score: -7.81 X-TUID: atUVMR0uo3QL Hi, On Tuesday, August 22nd, 2023 at 6:49 AM, Eidvilas Markevi=C4=8Dius wrote: > Therefore, my proposal is to relax these limitations as much as > possible (or at least somewhat) and to allow some more freedom when it > comes to naming packages and other kinds of items in the store. We > could, of course, still disallow all the main problematic characters, > such as NUL, /, $, ~, space, newline and a few others, but other than > that, I don't see any reason to forbid any of the remaining ones from > being used. While I don't really have an opinion on the matter aside from the biases of growing up in the US, one non-trivial issue with Unicode store paths and package names which hasn't been mentioned is that of Unicode equivalence[1], particularly homographs[2]. For example U+0061 and U+0430 (the Latin and Cyrillic small letter "a", respectively) are often visually identical but programmatically distinct. If not handled well, it could lead to untypable package or store names by virtue of the user having to guess which Unicode code point(s) is/are the correct one(s) for a certain visual glyph. Cheers, Kaelyn [1] https://en.wikipedia.org/wiki/Unicode_equivalence [2] https://en.wikipedia.org/wiki/IDN_homograph_attack