From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: "Roland Winkler" Newsgroups: gmane.emacs.bugs Subject: bug#47455: 27.1; bibtex mode - citation key generation - non-ascii characters Date: Wed, 26 May 2021 13:56:53 -0500 Message-ID: <39413.75034.396197.24750@gargle.gargle.HOWL> References: <87wnrwc9ia.fsf@gnus.org> <3816.41298.758059.24740@gargle.gargle.HOWL> <878s43aivd.fsf@gnus.org> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="17289"; mail-complaints-to="usenet@ciao.gmane.io" Cc: 47455@debbugs.gnu.org, Brian Elmegaard To: Lars Ingebrigtsen Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Wed May 26 20:58:39 2021 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1llyjn-0004KA-Q9 for geb-bug-gnu-emacs@m.gmane-mx.org; Wed, 26 May 2021 20:58:39 +0200 Original-Received: from localhost ([::1]:38262 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1llyjm-0000g2-Pd for geb-bug-gnu-emacs@m.gmane-mx.org; Wed, 26 May 2021 14:58:38 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:38576) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1llyjC-00008d-NZ for bug-gnu-emacs@gnu.org; Wed, 26 May 2021 14:58:02 -0400 Original-Received: from debbugs.gnu.org ([209.51.188.43]:38405) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1llyjC-00060t-GB for bug-gnu-emacs@gnu.org; Wed, 26 May 2021 14:58:02 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1llyjC-0000cg-ES for bug-gnu-emacs@gnu.org; Wed, 26 May 2021 14:58:02 -0400 X-Loop: help-debbugs@gnu.org Resent-From: "Roland Winkler" Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Wed, 26 May 2021 18:58:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 47455 X-GNU-PR-Package: emacs Original-Received: via spool by 47455-submit@debbugs.gnu.org id=B47455.16220554242324 (code B ref 47455); Wed, 26 May 2021 18:58:02 +0000 Original-Received: (at 47455) by debbugs.gnu.org; 26 May 2021 18:57:04 +0000 Original-Received: from localhost ([127.0.0.1]:49951 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1llyiF-0000bQ-OC for submit@debbugs.gnu.org; Wed, 26 May 2021 14:57:03 -0400 Original-Received: from eggs.gnu.org ([209.51.188.92]:44684) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1llyiE-0000am-1R for 47455@debbugs.gnu.org; Wed, 26 May 2021 14:57:02 -0400 Original-Received: from fencepost.gnu.org ([2001:470:142:3::e]:43244) by eggs.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1llyi8-0005My-1N; Wed, 26 May 2021 14:56:56 -0400 Original-Received: from [2600:1700:5650:f790:7ccd:4158:4a46:b3e0] (port=48070 helo=regnitz) by fencepost.gnu.org with esmtpsa (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1llyi6-0002eU-Hy; Wed, 26 May 2021 14:56:55 -0400 In-Reply-To: <878s43aivd.fsf@gnus.org> X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.io gmane.emacs.bugs:207312 Archived-At: On Tue May 25 2021 Lars Ingebrigtsen wrote: > And this gets even more difficult to deal with for non-Latin scripts. > > So I'm not sure anything here can be done programmatically... the > command could output a warning? "Probably invalid key"? The warning is a good idea. Actually, the warning should be issued if there are non-ascii characters anywhere in a BibTeX key because (oldfashioned) BibTeX will choke on those no matter where they appear. So I'll add a new element for the user variable bibtex-entry-format for this. Then users can enable these warnings if they use oldfashioned BibTeX. (Those who use modern variants of BibTeX need not enable these warnings.) I believe that the real problem here lies in the fact that many publishers of scientific journals let you download citation records for their journal articles. When they offer not only BibTeX-formatted records but other formats, too, the BibTeX records are often malformed, decorated with non-ascii characters that BibTeX (and LaTeX) cannot handle and other things. I have been fooled a number of times by "invisible" non-ascii characters. So I will enable the new option for myself!