From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Ulrich Mueller Newsgroups: gmane.emacs.devel Subject: Re: Disambiguate modeline character for UTF-8? Date: Wed, 05 Jul 2023 12:08:59 +0200 Message-ID: References: <83wo1p73d2.fsf@gnu.org> <6ccde339-2bf1-3a4d-61bb-734046bf02d5@cs.ucla.edu> <83r1rx6vgv.fsf@gnu.org> <83lfi56te9.fsf@gnu.org> Mime-Version: 1.0 Content-Type: text/plain Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="20640"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.2 (gnu/linux) Cc: Drew Adams , Eli Zaretskii , eggert@cs.ucla.edu, Stefan Monnier To: emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Wed Jul 05 12:10:19 2023 Return-path: Envelope-to: ged-emacs-devel@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1qGzSl-0005HI-HH for ged-emacs-devel@m.gmane-mx.org; Wed, 05 Jul 2023 12:10:19 +0200 Original-Received: from localhost ([::1] helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1qGzRp-0004Cq-DS; Wed, 05 Jul 2023 06:09:21 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1qGzRk-0004BX-7o for emacs-devel@gnu.org; Wed, 05 Jul 2023 06:09:17 -0400 Original-Received: from woodpecker.gentoo.org ([140.211.166.183] helo=smtp.gentoo.org) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_CHACHA20_POLY1305:256) (Exim 4.90_1) (envelope-from ) id 1qGzRi-0006Au-9w; Wed, 05 Jul 2023 06:09:15 -0400 In-Reply-To: (Ulrich Mueller's message of "Mon, 24 Aug 2020 17:23:18 +0200") Received-SPF: pass client-ip=140.211.166.183; envelope-from=ulm@gentoo.org; helo=smtp.gentoo.org X-Spam_score_int: -41 X-Spam_score: -4.2 X-Spam_bar: ---- X-Spam_report: (-4.2 / 5.0 requ) BAYES_00=-1.9, RCVD_IN_DNSWL_MED=-2.3, RCVD_IN_MSPIKE_H3=0.001, RCVD_IN_MSPIKE_WL=0.001, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Xref: news.gmane.io gmane.emacs.devel:307453 Archived-At: >>>>> On Mon, 24 Aug 2020, Ulrich Mueller wrote: >>>>> On Mon, 24 Aug 2020, Drew Adams wrote: >> I'll just say this, as some have suggested that >> one main thing they want is to be able to easily >> and quickly tell whether the encoding is NOT >> utf-8 (and not ASCII, presumably): >> The characters "u" and "U" are not so easily >> distinguished. You might want to pick some >> other, quite different looking, character for >> the non-UTF-8 (i.e., UTF-16 etc.). > Another idea: Since "-" is used for ASCII, maybe use "+" for UTF-8? > This would be visually unobtrusive, so any uncommon coding system would > stand out against it. Coming back to this thread (which at the time ended in bikeshedding). The goal I had in mind was to disambiguate UTF-8, i.e. a unique modeline character would be used for it. Currently this is not the case: U -- utf-8* (all variants) U -- utf-16* (all variants) U -- utf-7 U -- koi8-u So, I propose to change this to either: + -- utf-8* (all variants) (everything else unchanged) or: U -- utf-8* (all variants) u -- utf-16* (all variants) u -- utf-7 K -- koi8-u Note that "K" is also used for Korean. I think that's not a real conflict, because normally it would be clear from context whether the buffer's content is Korean or Ukrainian.