From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Brian Elmegaard via "Bug reports for GNU Emacs, the Swiss army knife of text editors" Newsgroups: gmane.emacs.bugs Subject: bug#47455: 27.1; bibtex mode - citation key generation - non-ascii characters Date: Fri, 28 May 2021 07:17:10 +0000 Message-ID: <730b1644528547f78ea6910f882591d6@mek.dtu.dk> References: <87wnrwc9ia.fsf@gnus.org> <3816.41298.758059.24740@gargle.gargle.HOWL> <878s43aivd.fsf@gnus.org> <39413.75034.396197.24750@gargle.gargle.HOWL> Reply-To: Brian Elmegaard Mime-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="10764"; mail-complaints-to="usenet@ciao.gmane.io" Cc: "47455@debbugs.gnu.org" <47455@debbugs.gnu.org> To: Roland Winkler , Lars Ingebrigtsen Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Fri May 28 09:18:16 2021 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1lmWl6-0002XJ-KF for geb-bug-gnu-emacs@m.gmane-mx.org; Fri, 28 May 2021 09:18:16 +0200 Original-Received: from localhost ([::1]:56306 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1lmWl4-0001ml-NQ for geb-bug-gnu-emacs@m.gmane-mx.org; Fri, 28 May 2021 03:18:14 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:33264) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1lmWks-0001mE-9V for bug-gnu-emacs@gnu.org; Fri, 28 May 2021 03:18:02 -0400 Original-Received: from debbugs.gnu.org ([209.51.188.43]:41975) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1lmWks-0002cK-0z for bug-gnu-emacs@gnu.org; Fri, 28 May 2021 03:18:02 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1lmWkr-0002iD-Pw for bug-gnu-emacs@gnu.org; Fri, 28 May 2021 03:18:01 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Brian Elmegaard Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Fri, 28 May 2021 07:18:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 47455 X-GNU-PR-Package: emacs Original-Received: via spool by 47455-submit@debbugs.gnu.org id=B47455.162218624510378 (code B ref 47455); Fri, 28 May 2021 07:18:01 +0000 Original-Received: (at 47455) by debbugs.gnu.org; 28 May 2021 07:17:25 +0000 Original-Received: from localhost ([127.0.0.1]:53521 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1lmWkG-0002hJ-M6 for submit@debbugs.gnu.org; Fri, 28 May 2021 03:17:24 -0400 Original-Received: from mail-vi1eur05on2054.outbound.protection.outlook.com ([40.107.21.54]:55521 helo=EUR05-VI1-obe.outbound.protection.outlook.com) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1lmWkA-0002h1-TQ for 47455@debbugs.gnu.org; Fri, 28 May 2021 03:17:23 -0400 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=CYOLIMhCg8FYh7O8bBlxivXRBUS2Xw/ylX3ylYeQRGN8de90MP4BakoRGW/kHI9DfsLZv0rntlgTWDFgw3uHA/012IGJ3EE1T55eHSHv9MyI5CIB+U3LCcoLoWLVLKxQYnNqZMS0C6ZC5DcR0PlUB6jvKCh6FR09GaZL9T42mW+2wb1FaIpPeBUhiip5hck9F9twJ/VfR749Y2Ck/QQuDM/YJPOcePaHIETTx0Kv6bHgfoqNk84h8c+VBYIiK51Jn47FwVS90p2pUfljcCnz7uQ6HPdOIDDH6ti31og0W532eObc5/t9WJs8mFZOY6bVZelN6P5nqhiDNHmfMXnMlQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=1g4IZBWOzVQIkLYQ4D51hhsa98HCYsQPB3AjRfrrq3E=; b=HhTRnU+6leFartpW1zOFd/0CS5WuOwyDXqz81UxlRZxLl5Ie9y/TmYhUQ/p8rfCWdJcfAnRi4aCCl0Ew/zqWW2T8C9rZQ4wtfO+9uRtrqO0fRmUl2ZMLrotrBlVgZToKsw3Tbu9bbvKLRtlTQxl+bnAnYMjN31Re025bL3DZiCBVfZ7sNjXaieJhG4eNPLu53VojMGs8WKk3TJVOJp3ieLfJdw+RyYNz2eEIYANJdx+TtoNCvQs3Yc2qXBxO4wtJWwM0o2mr4ONwnQPPHgCV0AIsOY8hbSljP1IWksmSfHZgyPD21qNd+KcRtWVfrBDL4JndPksFcocTISoWUtz4Zw== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass (sender ip is 192.38.82.194) smtp.rcpttodomain=gnu.org smtp.mailfrom=mek.dtu.dk; dmarc=pass (p=quarantine sp=quarantine pct=100) action=none header.from=mek.dtu.dk; dkim=none (message not signed); arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=mek.dtu.dk; s=selector2; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=1g4IZBWOzVQIkLYQ4D51hhsa98HCYsQPB3AjRfrrq3E=; b=Mt8Lf4BxtZQ6eTOHQBsEh2M1DF4JGoT96eU5M7m8XHdDcIs1+xQyW+H/0F2Wzf6KAiwe7Rv/NiuHjUPSxD1Mygb1T8Z/puBnA6o+kEGihE4d9TMJKMMtjAhIXX2P+TcNysJwlP5rsHYH/Uf73ga71FbjcFAkbpap62kkNpxTidA= Original-Received: from DB6PR0501CA0015.eurprd05.prod.outlook.com (2603:10a6:4:8f::25) by PR3P192MB0556.EURP192.PROD.OUTLOOK.COM (2603:10a6:102:47::20) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4150.27; Fri, 28 May 2021 07:17:11 +0000 Original-Received: from DB5EUR01FT049.eop-EUR01.prod.protection.outlook.com (2603:10a6:4:8f:cafe::b8) by DB6PR0501CA0015.outlook.office365.com (2603:10a6:4:8f::25) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4173.20 via Frontend Transport; Fri, 28 May 2021 07:17:11 +0000 X-MS-Exchange-Authentication-Results: spf=pass (sender IP is 192.38.82.194) smtp.mailfrom=mek.dtu.dk; gnu.org; dkim=none (message not signed) header.d=none;gnu.org; dmarc=pass action=none header.from=mek.dtu.dk; Received-SPF: Pass (protection.outlook.com: domain of mek.dtu.dk designates 192.38.82.194 as permitted sender) receiver=protection.outlook.com; client-ip=192.38.82.194; helo=mail.win.dtu.dk; Original-Received: from mail.win.dtu.dk (192.38.82.194) by DB5EUR01FT049.mail.protection.outlook.com (10.152.5.138) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_CBC_SHA256) id 15.20.4150.30 via Frontend Transport; Fri, 28 May 2021 07:17:11 +0000 Original-Received: from ait-pexsrv01.win.dtu.dk (192.38.82.194) by ait-pexsrv01.win.dtu.dk (192.38.82.194) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_CBC_SHA256_P256) id 15.1.2176.14; Fri, 28 May 2021 09:17:10 +0200 Original-Received: from ait-pexsrv01.win.dtu.dk ([192.38.82.194]) by ait-pexsrv01.win.dtu.dk ([192.38.82.194]) with mapi id 15.01.2176.014; Fri, 28 May 2021 09:17:10 +0200 Thread-Topic: bug#47455: 27.1; bibtex mode - citation key generation - non-ascii characters Thread-Index: AdckGPjNpDRmkKalTUe/PdbUWtNj/Qn5rnl/AAHq4gABOFvvjQBZ1K6AAFALgzA= In-Reply-To: <39413.75034.396197.24750@gargle.gargle.HOWL> Accept-Language: da-DK, en-US Content-Language: en-US x-originating-ip: [192.38.82.8] X-EOPAttributedMessage: 0 X-MS-PublicTrafficType: Email X-MS-Office365-Filtering-Correlation-Id: 80d0054e-d7a5-4d93-09c3-08d921a89ca6 X-MS-TrafficTypeDiagnostic: PR3P192MB0556: X-Microsoft-Antispam-PRVS: X-MS-Oob-TLC-OOBClassifiers: OLM:873; X-MS-Exchange-SenderADCheck: 1 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: 5li0t8kJp9eG4aqtLN3uJsOzDp7v52S/SStfNx/EPZPLoYMDmKVw1xfQ3sRQ08GUnVW9RAdITjYx79vmV6DTC2y//RjHQu5soByh/+M7suJ3Idz4RwNLTd988SngHW5F/1be6SzrOUsWg4dQ0V4uSog8mUx/Q4GWaNvMlZh6ajmPreZmklM/urtB7gkkuqI7clfooknjVbMd69QhhOJDItLklzm1tPh8bXrKUL48w5KJbAFc6joju6SGvZ1IB6MbVHLvf0skO6ksz2CRliIhKFNlYQcFSq0n8+Y/JXwQ7MNMGZ7oFbc5R1busLhtW4/LlIUzLH8PD7O0WdlUuAWepK5BSFRa517QeZ4cSN0WJxHc1A3TM4XwjgmiGL6+cdq6HHzsh61d7KhWnxhDlY3hqqlUkdJGbCyeHYbWCthCq8xIoP6Szj6BGaNpkqCRaJcF29rADN4HqPUOpKz0T0cqkwPmOYF/BgLt3p+nxlItfDdt6qJW2FJ4dqkkk6eD4fxHu8y4kcLHBpPICGV9FmVsvslugtcKFnVV98cCtMyjKKwo2S6Ulftt6efgAcXw+ZkcsQAgA3CN4m/cxC5UxqFr9gqlBc6ZTUbe/7g/Ysb8K36htdk7LSUgYMgwjcC3KKjxqyNinrewUDix1PbL5ktuwJ3hBMoQANWoa3KxkrXOfpo= X-Forefront-Antispam-Report: CIP:192.38.82.194; CTRY:DK; LANG:en; SCL:1; SRV:; IPV:CAL; SFV:NSPM; H:mail.win.dtu.dk; PTR:ait-pexsrv01.win.dtu.dk; CAT:NONE; SFS:(376002)(136003)(39850400004)(346002)(396003)(36840700001)(46966006)(36756003)(26005)(316002)(82310400003)(70586007)(110136005)(82740400003)(53546011)(2906002)(336012)(47076005)(956004)(478600001)(356005)(2616005)(5660300002)(70206006)(83380400001)(4326008)(108616005)(24736004)(8936002)(8676002)(86362001)(36860700001)(786003)(186003); DIR:OUT; SFP:1101; X-OriginatorOrg: mek.dtu.dk X-MS-Exchange-CrossTenant-OriginalArrivalTime: 28 May 2021 07:17:11.5215 (UTC) X-MS-Exchange-CrossTenant-Network-Message-Id: 80d0054e-d7a5-4d93-09c3-08d921a89ca6 X-MS-Exchange-CrossTenant-Id: f251f123-c9ce-448e-9277-34bb285911d9 X-MS-Exchange-CrossTenant-OriginalAttributedTenantConnectingIp: TenantId=f251f123-c9ce-448e-9277-34bb285911d9; Ip=[192.38.82.194]; Helo=[mail.win.dtu.dk] X-MS-Exchange-CrossTenant-AuthSource: DB5EUR01FT049.eop-EUR01.prod.protection.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Anonymous X-MS-Exchange-CrossTenant-FromEntityHeader: HybridOnPrem X-MS-Exchange-Transport-CrossTenantHeadersStamped: PR3P192MB0556 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.io gmane.emacs.bugs:207444 Archived-At: Hi Thanks for looking into this. I understand your reasoning about this being an issue with the tools used. In auctex I can also enter \newcommand{\=FC}{u} without being warned that i= t will not work with latex. The warning seems to be a good idea to me as well. Brian -----Original Message----- From: Roland Winkler =20 Sent: 26. maj 2021 20:57 To: Lars Ingebrigtsen Cc: 47455@debbugs.gnu.org; Brian Elmegaard Subject: Re: bug#47455: 27.1; bibtex mode - citation key generation - non-a= scii characters On Tue May 25 2021 Lars Ingebrigtsen wrote: > And this gets even more difficult to deal with for non-Latin scripts. >=20 > So I'm not sure anything here can be done programmatically... the=20 > command could output a warning? "Probably invalid key"? The warning is a good idea. Actually, the warning should be issued if ther= e are non-ascii characters anywhere in a BibTeX key because (oldfashioned) BibTeX will choke on those no matter where they appear. So = I'll add a new element for the user variable bibtex-entry-format for this. = Then users can enable these warnings if they use oldfashioned BibTeX. (Th= ose who use modern variants of BibTeX need not enable these warnings.) I believe that the real problem here lies in the fact that many publishers = of scientific journals let you download citation records for their journal = articles. When they offer not only BibTeX-formatted records but other form= ats, too, the BibTeX records are often malformed, decorated with non-ascii = characters that BibTeX (and LaTeX) cannot handle and other things. I have been fooled a number of time= s by "invisible" non-ascii characters. So I will enable the new option for= myself!