From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.bugs Subject: bug#57531: 28.1; Character encoding missing for "eo" Date: Mon, 05 Sep 2022 16:11:05 +0300 Message-ID: <83sfl6j79y.fsf@gnu.org> References: <87h71r0w5z.fsf@jonreeve.com> <83h71qqq5e.fsf@gnu.org> <878rn1p7oz.fsf@jonreeve.com> <8335d8o6ow.fsf@gnu.org> <877d2kpfe1.fsf@jonreeve.com> <83o7vwmlet.fsf@gnu.org> <875yi4p6se.fsf@jonreeve.com> <831qsrn1im.fsf@gnu.org> <6fb0e08e80e292e6c7a0@heytings.org> <83edwqkqix.fsf@gnu.org> <0b2303e39221dde4f40f@heytings.org> <83y1uyj9el.fsf@gnu.org> <0b2303e39214d25ba3d3@heytings.org> Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="1097"; mail-complaints-to="usenet@ciao.gmane.io" Cc: jonathan@jonreeve.com, 57531@debbugs.gnu.org To: Gregory Heytings Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Mon Sep 05 15:23:58 2022 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1oVC4z-00007d-Rj for geb-bug-gnu-emacs@m.gmane-mx.org; Mon, 05 Sep 2022 15:23:58 +0200 Original-Received: from localhost ([::1]:37702 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1oVC4y-0005f9-Kf for geb-bug-gnu-emacs@m.gmane-mx.org; Mon, 05 Sep 2022 09:23:56 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:34832) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1oVBtT-0001l8-OU for bug-gnu-emacs@gnu.org; Mon, 05 Sep 2022 09:12:03 -0400 Original-Received: from debbugs.gnu.org ([209.51.188.43]:58111) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1oVBtT-0004Og-87 for bug-gnu-emacs@gnu.org; Mon, 05 Sep 2022 09:12:03 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1oVBtS-0000CN-Fm for bug-gnu-emacs@gnu.org; Mon, 05 Sep 2022 09:12:02 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Eli Zaretskii Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Mon, 05 Sep 2022 13:12:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 57531 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: moreinfo Original-Received: via spool by 57531-submit@debbugs.gnu.org id=B57531.1662383505736 (code B ref 57531); Mon, 05 Sep 2022 13:12:02 +0000 Original-Received: (at 57531) by debbugs.gnu.org; 5 Sep 2022 13:11:45 +0000 Original-Received: from localhost ([127.0.0.1]:46810 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oVBt7-0000Bf-7M for submit@debbugs.gnu.org; Mon, 05 Sep 2022 09:11:45 -0400 Original-Received: from eggs.gnu.org ([209.51.188.92]:58858) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oVBsz-0000At-E8 for 57531@debbugs.gnu.org; Mon, 05 Sep 2022 09:11:37 -0400 Original-Received: from fencepost.gnu.org ([2001:470:142:3::e]:43530) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1oVBst-0004He-1e; Mon, 05 Sep 2022 09:11:27 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnu.org; s=fencepost-gnu-org; h=References:Subject:In-Reply-To:To:From:Date: mime-version; bh=jKb88bXBZd3hqm6FUMH8lWmBLY++ck3eZGFrSVQJkss=; b=lMuvB+eUwFqB rnvpPj7k9l0HcsFpVQdtuncYvhn4UCdovdkErYXBMGHjRhJoIWzeojiuqQ8NXSbocPQKpcfGkzECn 9amITyMBYckTdRNMPwfDHhDJybelgPkFx2eNwXgu4FPrUcnu6v5J4+IqKU1bc58NIQp1l9fmVd2ZH JjWZwz+C6yu2Nm+U2oDatvmLN20YBGJ3LPqbwDl4tLVehL2znD0YADT3ISvxtuqD3hBktbuqJXdfD yAoQMuQKBy24emM6GD4nk3FT4hNDr4ZJjR/9wGga7ikMswghdM+PaQ4ZM6jqc7bFroyU08IYPPMHc 23GupGAmVX3xT4cMGZKs9g==; Original-Received: from [87.69.77.57] (port=4683 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1oVBss-0002YM-Bc; Mon, 05 Sep 2022 09:11:26 -0400 In-Reply-To: <0b2303e39214d25ba3d3@heytings.org> (message from Gregory Heytings on Mon, 05 Sep 2022 12:59:21 +0000) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.io gmane.emacs.bugs:241563 Archived-At: > Date: Mon, 05 Sep 2022 12:59:21 +0000 > From: Gregory Heytings > cc: jonathan@jonreeve.com, 57531@debbugs.gnu.org > > >> I think expecting systems to be well-configured and to contain accurate > >> information about that exotic locale is a bit too optimistic. > > > > What would you suggest that Emacs does instead? > > I don't know, because anything that it could do would be backward > incompatible. The only change I could think of that is almost backward-compatible (except for this single locale) is the one I posted, if we modify it to also make the 'lang-info' pseudo-encoding override the locale.alias file. > What is clear is that, on reasonably modern systems, legacy > locales are not used anymore, and their use is discouraged (e.g. the > Debian installer does not present you with any legacy encoding, they > remain available but to activate them you need to edit the /etc/locale.gen > file manually). So perhaps Emacs could always assume UTF-8, and use > another encoding only when there are good reasons to do so (e.g. when > opening a file with a legacy encoding). The presence of the equivalence > eo / Latin-3 in locale.alias is IMO not a good enough reason. I have no idea what this kind of change could do. Maybe nothing, maybe breakage across the board. Keep in mind that the default encoding is used for stuff other than decoding text in files Emacs visits, and also for some important tasks during startup. I also think our encoding detection doesn't always succeed to discern between UTF-8 and single-byte Latin-N encodings.