From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED.blaine.gmane.org!not-for-mail From: Robert Pluim Newsgroups: gmane.emacs.bugs Subject: bug#36852: 27.0.50; ietf-drums-parse-address doesn't handle non-ascii properly Date: Tue, 30 Jul 2019 11:53:36 +0200 Message-ID: References: <871ry7g9ve.fsf@gmail.com> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Injection-Info: blaine.gmane.org; posting-host="blaine.gmane.org:195.159.176.226"; logging-data="29067"; mail-complaints-to="usenet@blaine.gmane.org" Cc: 36852@debbugs.gnu.org To: =?UTF-8?Q?=C5=A0t=C4=9Bp=C3=A1n_?= =?UTF-8?Q?N=C4=9Bmec?= Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Tue Jul 30 11:54:12 2019 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([209.51.188.17]) by blaine.gmane.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.89) (envelope-from ) id 1hsOpe-0007R6-Kx for geb-bug-gnu-emacs@m.gmane.org; Tue, 30 Jul 2019 11:54:10 +0200 Original-Received: from localhost ([::1]:59278 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.86_2) (envelope-from ) id 1hsOpd-0001OI-HM for geb-bug-gnu-emacs@m.gmane.org; Tue, 30 Jul 2019 05:54:09 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:37092) by lists.gnu.org with esmtp (Exim 4.86_2) (envelope-from ) id 1hsOpX-0001Nz-PK for bug-gnu-emacs@gnu.org; Tue, 30 Jul 2019 05:54:04 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1hsOpW-0004LC-OA for bug-gnu-emacs@gnu.org; Tue, 30 Jul 2019 05:54:03 -0400 Original-Received: from debbugs.gnu.org ([209.51.188.43]:40740) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1hsOpW-0004L6-J4 for bug-gnu-emacs@gnu.org; Tue, 30 Jul 2019 05:54:02 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1hsOpW-0000Vz-DN for bug-gnu-emacs@gnu.org; Tue, 30 Jul 2019 05:54:02 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Robert Pluim Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Tue, 30 Jul 2019 09:54:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 36852 X-GNU-PR-Package: emacs Original-Received: via spool by 36852-submit@debbugs.gnu.org id=B36852.15644804291962 (code B ref 36852); Tue, 30 Jul 2019 09:54:02 +0000 Original-Received: (at 36852) by debbugs.gnu.org; 30 Jul 2019 09:53:49 +0000 Original-Received: from localhost ([127.0.0.1]:49561 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hsOpI-0000Va-N4 for submit@debbugs.gnu.org; Tue, 30 Jul 2019 05:53:48 -0400 Original-Received: from mail-wr1-f49.google.com ([209.85.221.49]:35931) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hsOpG-0000VK-VD for 36852@debbugs.gnu.org; Tue, 30 Jul 2019 05:53:47 -0400 Original-Received: by mail-wr1-f49.google.com with SMTP id n4so65119977wrs.3 for <36852@debbugs.gnu.org>; Tue, 30 Jul 2019 02:53:46 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:to:cc:subject:references:mail-copies-to:gmane-reply-to-list :date:in-reply-to:message-id:mime-version:content-transfer-encoding; bh=bSNz1Ud/6hHenYocG3EbWUu1OVuyprdkVT+IdvUaZ28=; b=pfL5Owkgs2H6kln9Vq+tDYHWxzv4GTaEnrxqG/fZBmnpWkJ+R/2JSsAirdwXmdjE8V rq5wR4tuBn8+ZK4LSYaSx0ww8dP/xgakWQplc7FUrKgYllrmcbrwD7uMrlj1VrhFYTwI BY5B/Cs0obxvlmqOAOQTHCvIxt6won+FMXlZxnPe2IPbhyGWsqwzgVPpd8I2U5Pu90gZ GQeM9ZxMRvEkR8HQk1yUeyKId+CV8jWfsKqiHAMgm++vyl2WKz1YQwXmA3YrcbRDYPAp VROgjlNOIexWr2w5fzZrPafGVGeTYW6u731NKMmx+fQrC7yyX2vSiNShSy9fXsr09VlW ae0Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:references:mail-copies-to :gmane-reply-to-list:date:in-reply-to:message-id:mime-version :content-transfer-encoding; bh=bSNz1Ud/6hHenYocG3EbWUu1OVuyprdkVT+IdvUaZ28=; b=arao3cz7X4uHujwei4HEpVJT5V1XnM/QTYBfxjMlmNPSyVjvVOEGdw0d0/mrgZSPMY 6+wjJ4BWeNEELJe40bpvZQIK/xFZRGU5aUhNJw6mT0xti5y6Mzfx+nNnN+lW3DFqFKxX sPR/+3G5t7wNpHXq0QnJiVEZQosFRvj/2rSiuwA2mHpgxw+GMBxzmFwHX6urBtwQQITJ yoRP0fvXpCRTtIBNzYdctSLE0cq49Gua/PKCGNONuVMIAd4DgBqfFo+LU6haw38sSO6k Tr2L2O8sf+M3dGTqYz5coTMEXmQTD4/0m7tTY/AGxkjU9GBEj/NbRZjNz8tCcKNnXX75 8waA== X-Gm-Message-State: APjAAAVp4oQqIEi48kTBdip/0Lo1JhV9pz3mGcWHWs5fjUZT1fxiOrlB GsWcHUTf2YTliZTnSLi2nkQlYY7o X-Google-Smtp-Source: APXvYqx0E7Kgzgmu1McY84HTWHCq9FB0jSxDRc/F1X4tqHBbkTSKGYwL9WE9rP8KHOdPUez/5VO+3A== X-Received: by 2002:a5d:51c8:: with SMTP id n8mr113914565wrv.46.1564480420546; Tue, 30 Jul 2019 02:53:40 -0700 (PDT) Original-Received: from rpluim-mac ([149.5.228.1]) by smtp.gmail.com with ESMTPSA id a2sm66748862wmj.9.2019.07.30.02.53.39 (version=TLS1_3 cipher=AEAD-AES256-GCM-SHA384 bits=256/256); Tue, 30 Jul 2019 02:53:39 -0700 (PDT) Mail-Copies-To: never Gmane-Reply-To-List: yes In-Reply-To: <871ry7g9ve.fsf@gmail.com> ("=?UTF-8?Q?=C5=A0t=C4=9Bp=C3=A1n_?= =?UTF-8?Q?N=C4=9Bmec?="'s message of "Tue, 30 Jul 2019 11:16:53 +0200") X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 209.51.188.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.org gmane.emacs.bugs:164099 Archived-At: >>>>> On Tue, 30 Jul 2019 11:16:53 +0200, =C5=A0t=C4=9Bp=C3=A1n N=C4=9Bmec = said: =C5=A0t=C4=9Bp=C3=A1n> ietf-drums-parse-address (AKA mail-header-parse-= address) uses =C5=A0t=C4=9Bp=C3=A1n> ietf-drums-atext-token to parse display-name, bu= t the regexp range only =C5=A0t=C4=9Bp=C3=A1n> contains ASCII characters, so e.g. as used in de= bbugs-gnu-show-reports, =C5=A0t=C4=9Bp=C3=A1n> the following happens: =C5=A0t=C4=9Bp=C3=A1n> (mail-header-parse-address =C5=A0t=C4=9Bp=C3=A1n> (decode-coding-string "=C3=81a=C3=A1=C3=A1=C3= =A1 =C5=AE=C5=AF=C5=AF=C5=AF=C5=AF " 'utf-8)) =C5=A0t=C4=9Bp=C3=A1n> ;;=3D> ("aaa@example.net" . "a=C3=A1=C3=A1=C3= =A1") =C5=A0t=C4=9Bp=C3=A1n> It actually only cares about the first char of a= word: =C5=A0t=C4=9Bp=C3=A1n> (let ((ietf-drums-atext-token "-=C3=81=C5=AE^a= -zA-Z0-9!#$%&'*+/=3D?_`{|}~")) =C5=A0t=C4=9Bp=C3=A1n> (mail-header-parse-address =C5=A0t=C4=9Bp=C3=A1n> (decode-coding-string "=C3=81a=C3=A1=C3=A1= =C3=A1 =C5=AE=C5=AF=C5=AF=C5=AF=C5=AF " 'utf-8))) =C5=A0t=C4=9Bp=C3=A1n> ;;=3D> ("aaa@example.net" . "=C3=81a=C3=A1=C3= =A1=C3=A1 =C5=AE=C5=AF=C5=AF=C5=AF=C5=AF") =C5=A0t=C4=9Bp=C3=A1n> I'm not quite sure what the proper fix is, as th= e ASCII-only thing seems =C5=A0t=C4=9Bp=C3=A1n> to be intentional. Maybe it's just not supposed = to be used the way it is =C5=A0t=C4=9Bp=C3=A1n> used in debbugs-gnu.el? Mail headers are defined to be ascii-only, although as I=CA=BCve just discovered, gmail undoes Gnus' perfectly formatted RFC 2047 encoding and replaces it with UTF-8 characters. Bad Google, bad. Perhaps mail-header-parse-address could just discard the complete display string if it finds a non-ascii char? That would at least prevent it from propagating. Robert