From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on dcvr.yhbt.net X-Spam-Level: X-Spam-ASN: AS3215 2.6.0.0/16 X-Spam-Status: No, score=-3.6 required=3.0 tests=AWL,BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_HI, SPF_HELO_NONE,SPF_PASS shortcircuit=no autolearn=ham autolearn_force=no version=3.4.2 Received: from mail-qt1-x834.google.com (mail-qt1-x834.google.com [IPv6:2607:f8b0:4864:20::834]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (No client certificate requested) by dcvr.yhbt.net (Postfix) with ESMTPS id 671971F910 for ; Thu, 24 Nov 2022 15:37:19 +0000 (UTC) Authentication-Results: dcvr.yhbt.net; dkim=pass (1024-bit key; unprotected) header.d=linuxfoundation.org header.i=@linuxfoundation.org header.b="RwJYXn+Z"; dkim-atps=neutral Received: by mail-qt1-x834.google.com with SMTP id cg5so1180226qtb.12 for ; Thu, 24 Nov 2022 07:37:19 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linuxfoundation.org; s=google; h=content-disposition:mime-version:message-id:subject:to:from:date :from:to:cc:subject:date:message-id:reply-to; bh=BvIMbpx0J0fY2pC7o2SnDB8csGCGKtMR6Idwi98uyrI=; b=RwJYXn+ZibOgcVIxG7DdwkACGcTfmFXqzriUdik6EVMCe+zeSDnd684if9DcbYkige NsXX5h53lTV1CGb1uj3ZX4m6N5NcLj9V/UcGqp7MbkEQ1LdSs8+J1XAgQ5Rw07V607FR iWQva9Ex4qg3tJG4atg9aVttbS+oiXp4kc4VU= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-disposition:mime-version:message-id:subject:to:from:date :x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=BvIMbpx0J0fY2pC7o2SnDB8csGCGKtMR6Idwi98uyrI=; b=hlv+DYrRHG4bTHJ7GSN3OjJI04oL/AOP5X1spn5xk+BpRrjTeniOmwemklZHMECxqF ziROqdwQ1H0wCosyk94HrFTyZ/DrV59Fe5IOCboifo3KZVHfyDoJQq4sCKw2hFHnhR23 tXz9SeJ71uEna2KrztopKFM0HzfG+/8/by/Yn4jBKYfk+Bn6ZE8CNIhXAw6eHoFVKeL1 APlBrzB+ZjltbBNQi04jCko6DRVdSUp9h2s/9raXwfI+fDC0HxXf/Pm8eTCRZetkHwak etfTHcq3LlJQcta7Hu3wyRYG5R3sjsZVvNNfA66qtmRI4PpMlrZHvRydLu+G84Z2o8k5 TtyA== X-Gm-Message-State: ANoB5plyvIxWLpsf4TD6oskANufDSKZnhGafJdZbFegQ1FE40lNiNz19 DjxuQaxjx6PeRoMs8BD5gMxADEP+wb5ZiA== X-Google-Smtp-Source: AA0mqf6am8ayTzGKvp+ccaKLDHQ8+kXM2Kr5JDdm5ERoeUV/r8sv04q3PIU94Ln2kbMYd5PKz3iDnA== X-Received: by 2002:ac8:5441:0:b0:3a5:50ba:b20c with SMTP id d1-20020ac85441000000b003a550bab20cmr31808147qtq.588.1669304237604; Thu, 24 Nov 2022 07:37:17 -0800 (PST) Received: from meerkat.local (bras-base-mtrlpq5031w-grc-33-142-113-79-147.dsl.bell.ca. [142.113.79.147]) by smtp.gmail.com with ESMTPSA id s18-20020a05620a255200b006bbf85cad0fsm1046398qko.20.2022.11.24.07.37.17 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 24 Nov 2022 07:37:17 -0800 (PST) Date: Thu, 24 Nov 2022 10:37:15 -0500 From: Konstantin Ryabitsev To: meta@public-inbox.org Subject: handling unquoted utf8 in the headers Message-ID: <20221124153715.3nenjpjzj43vqxr2@meerkat.local> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline List-Id: Hello: There's a bit of inconsistency handling messages with utf8 content in the headers: https://lore.kernel.org/b4-sent/20221122-gud-shadow-plane-v1-0-9de3afa3383e@tronnes.org/ You can see that the name in the From: line is mangled, but in the thread overview it is displayed correctly. I know older SMTP standards still require 7bit escaping in the headers, but with SMTPUTF8 being very widely available, it should be possible to store and properly display messages with 8bit unicode in the headers. -K