From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.bugs Subject: bug#57531: 28.1; Character encoding missing for "eo" Date: Sun, 04 Sep 2022 08:37:21 +0300 Message-ID: <831qsrn1im.fsf@gnu.org> References: <87h71r0w5z.fsf@jonreeve.com> <83h71qqq5e.fsf@gnu.org> <878rn1p7oz.fsf@jonreeve.com> <8335d8o6ow.fsf@gnu.org> <877d2kpfe1.fsf@jonreeve.com> <83o7vwmlet.fsf@gnu.org> <875yi4p6se.fsf@jonreeve.com> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="26491"; mail-complaints-to="usenet@ciao.gmane.io" Cc: 57531@debbugs.gnu.org To: Jonathan Reeve Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Sun Sep 04 07:38:18 2022 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1oUiKo-0006kD-81 for geb-bug-gnu-emacs@m.gmane-mx.org; Sun, 04 Sep 2022 07:38:18 +0200 Original-Received: from localhost ([::1]:34058 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1oUiKm-0002wu-R6 for geb-bug-gnu-emacs@m.gmane-mx.org; Sun, 04 Sep 2022 01:38:16 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:40594) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1oUiKY-0002wi-AF for bug-gnu-emacs@gnu.org; Sun, 04 Sep 2022 01:38:02 -0400 Original-Received: from debbugs.gnu.org ([209.51.188.43]:54587) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1oUiKY-0005xb-11 for bug-gnu-emacs@gnu.org; Sun, 04 Sep 2022 01:38:02 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1oUiKX-0007KJ-QQ for bug-gnu-emacs@gnu.org; Sun, 04 Sep 2022 01:38:01 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Eli Zaretskii Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Sun, 04 Sep 2022 05:38:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 57531 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: moreinfo Original-Received: via spool by 57531-submit@debbugs.gnu.org id=B57531.166226987328148 (code B ref 57531); Sun, 04 Sep 2022 05:38:01 +0000 Original-Received: (at 57531) by debbugs.gnu.org; 4 Sep 2022 05:37:53 +0000 Original-Received: from localhost ([127.0.0.1]:43286 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oUiKP-0007Jw-3S for submit@debbugs.gnu.org; Sun, 04 Sep 2022 01:37:53 -0400 Original-Received: from eggs.gnu.org ([209.51.188.92]:47938) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oUiKN-0007Ji-AC for 57531@debbugs.gnu.org; Sun, 04 Sep 2022 01:37:51 -0400 Original-Received: from fencepost.gnu.org ([2001:470:142:3::e]:38968) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1oUiKH-0005wc-Qy; Sun, 04 Sep 2022 01:37:45 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnu.org; s=fencepost-gnu-org; h=MIME-version:References:Subject:In-Reply-To:To:From: Date; bh=9rHGBOLU41XuPItaU35VmfD5ZAwRsRNFddr+GfbphCY=; b=pg9UZZmrB0gHTLglhnn0 T2ay0LZ5V0/ZJKG4On6Pi7r+hd2oUDKKEL9yAuCJMY0RDeIL6Jkt5bZNh2PDgDpdWGylUCMXKaT31 otETPhq9fovNqB44PdcumWDRbroronHpF4gEFazt9m7U9FtEh/I57olCJ+J018+4i2+rOmIkgujUd JeeYdVqNRy581k/R2rIkzM6dgwhFuIYXN4g3rnP12315sO5sIp/miwIKDGnzfC+dLJ4k5+5mdj3gs oKBadxzf6XBP00CQcXyt82lqGn1fVv2WqX20uBwIOVW9yPqHEZterwaT9rE8vb2E8G2ILp5NjWkdo d+tSl07947tcSA==; Original-Received: from [87.69.77.57] (port=4184 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1oUiKH-0002nP-4g; Sun, 04 Sep 2022 01:37:45 -0400 In-Reply-To: <875yi4p6se.fsf@jonreeve.com> (message from Jonathan Reeve on Sat, 03 Sep 2022 20:00:41 +0000) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.io gmane.emacs.bugs:241465 Archived-At: > Date: Sat, 03 Sep 2022 20:00:41 +0000 > From: Jonathan Reeve > Cc: 57531@debbugs.gnu.org > > > I believe this is due to the fact that the text was saved in UTF-8, > > and Emacs was trying to decode it as if it were in Latin-3. > > That’s the problem. Emacs should decode UTF-8 as UTF-8, not Latin-3. It tries, but it isn't always 100% successful. > > Using the prefer-coding-system customization should fix that. > > The user shouldn’t have to customize an encoding system to have a UTF-8 locale be interpreted as UTF-8. A UTF-8 locale should be encoded as such, without needing to be told otherwise. Ideally, I agree. But in practice, we've found that goal unreachable in some cases. > > I disagree. I think your system doesn’t tell Emacs enough to guess > > correctly. > > It does, though. The locale data is already there, which is why I only have this problem in Emacs, and nowhere else on my whole system. The problem is in this line from `locale-language-names'. Here’s what it says: > > `("eo" . "Esperanto")' > > Here’s what it should say: > > `("eo" "Esperanto" utf-8)' That's only correct for glibc systems, though, as I already explained. I found no authoritative place on the Internet which would mandate that the Esperanto locale should use or prefer UTF-8 as its encoding. Once again, glibc is just one C library on just one OS. If you can show me some authoritative source of information about this locale which says it should use UTF-8, that could be a reason good enough to make such an incompatible change. And we need good reasons for such incompatible changes, because some users out there could have configurations or applications that depend on previous behavior. > The system says that it’s a UTF-8 locale. How does it say so? > > Please try the solution I proposed, and if it doesn’t work, let’s see > > what else is needed. If you keep insisting on defaulting Esperanto to > > UTF-8, I see now way to make any progress here. > > You’re not proposing a solution, you’re proposing a workaround. Would you please try it nonetheless? > There’s no reason at all why Esperanto should be encoded in latin-3. It never has been, as far as I can tell, and it never well be, in latin-3, with the eo locale. If you can find any good reason why it should be in latin-3, I’m all ears, but as far as I can tell, this is a mistake. No, it isn't a mistake. Latin-3 was introduced to cover Esperanto (and some other languages). That's why the Emacs Esperanto locale was configured to use it. It wasn't a random choice. > Please keep in mind that /I’m trying to help improve emacs,/ Please keep in mind that so am I. For many years, as a matter of fact.