From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: "Roland Winkler" Newsgroups: gmane.emacs.bugs Subject: bug#47455: 27.1; bibtex mode - citation key generation - non-ascii characters Date: Tue, 18 May 2021 14:00:56 -0500 Message-ID: <3816.41298.758059.24740@gargle.gargle.HOWL> References: <87wnrwc9ia.fsf@gnus.org> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="19083"; mail-complaints-to="usenet@ciao.gmane.io" Cc: 47455@debbugs.gnu.org, Brian Elmegaard To: Lars Ingebrigtsen Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Tue May 18 21:06:19 2021 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1lj52p-0004fW-0W for geb-bug-gnu-emacs@m.gmane-mx.org; Tue, 18 May 2021 21:06:19 +0200 Original-Received: from localhost ([::1]:58252 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1lj52m-00089P-G1 for geb-bug-gnu-emacs@m.gmane-mx.org; Tue, 18 May 2021 15:06:16 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:55052) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1lj4yi-00027M-0I for bug-gnu-emacs@gnu.org; Tue, 18 May 2021 15:02:04 -0400 Original-Received: from debbugs.gnu.org ([209.51.188.43]:45766) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1lj4yh-0005HP-8R for bug-gnu-emacs@gnu.org; Tue, 18 May 2021 15:02:03 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1lj4yg-0004Wt-5d for bug-gnu-emacs@gnu.org; Tue, 18 May 2021 15:02:02 -0400 X-Loop: help-debbugs@gnu.org Resent-From: "Roland Winkler" Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Tue, 18 May 2021 19:02:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 47455 X-GNU-PR-Package: emacs Original-Received: via spool by 47455-submit@debbugs.gnu.org id=B47455.162136447117347 (code B ref 47455); Tue, 18 May 2021 19:02:02 +0000 Original-Received: (at 47455) by debbugs.gnu.org; 18 May 2021 19:01:11 +0000 Original-Received: from localhost ([127.0.0.1]:57312 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1lj4xq-0004Vg-6W for submit@debbugs.gnu.org; Tue, 18 May 2021 15:01:11 -0400 Original-Received: from eggs.gnu.org ([209.51.188.92]:32918) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1lj4xk-0004V3-2K for 47455@debbugs.gnu.org; Tue, 18 May 2021 15:01:07 -0400 Original-Received: from fencepost.gnu.org ([2001:470:142:3::e]:47608) by eggs.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1lj4xe-0004wn-1x; Tue, 18 May 2021 15:00:58 -0400 Original-Received: from [2600:1700:5650:f790:7ccd:4158:4a46:b3e0] (port=52356 helo=regnitz) by fencepost.gnu.org with esmtpsa (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1lj4xd-0002Vy-Sa; Tue, 18 May 2021 15:00:57 -0400 In-Reply-To: <87wnrwc9ia.fsf@gnus.org> X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.io gmane.emacs.bugs:206831 Archived-At: On Tue May 18 2021 Lars Ingebrigtsen wrote: > Brian Elmegaard writes: >=20 > > Using C-c C-c in a bibtex cleans the entry and generates a citation key. > > If the author name includes non-ascii characters these are included in > > the key, even though BibTeX does not accept this. >=20 > Is this the case for all versions of BibTeX? I believe the problem lies here already in BibTeX itself, that is, BibTeX [like conventional (La)TeX] does not like non-ascii characters anywhere, not in the key nor anywhere else. Of course, there is biblatex and also new versions of (La)TeX that can handle non-ascii characters. But that's a separate story. > > For example: > > > > @Article{=C3=A4=C3=B6=C3=BC21, > > author =3D {=C3=A6=C3=B8=C3=A5 =C3=A4=C3=B6=C3=BC}, >=20 > I guess Emacs could an asciification of some sort here, but I'm > not sure there's any that's universally accepted? The default of the user variable bibtex-autokey-transcriptions handles "LaTeX non-ascii" characters like \"a. You can customize these rules to your liking. I vaguely remember an old thread that started from the very question raised here and expanding on how asciification can be encapsulated in some generic piece of elisp code. But I cannot find it anymore and I do not know either whether this would be possible at all. I believe, everyone agrees on asciification of German umlaute like =C3=A4 -> ae But beyond that, I do not know how to do this satisfactorily.