all messages for Emacs-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
From: Eli Zaretskii <eliz@gnu.org>
To: Stefan Monnier <monnier@iro.umontreal.ca>,
	Kenichi Handa <handa@m17n.org>
Cc: 2497@emacsbugs.donarmstrong.com, uwe.siart@tum.de
Subject: bug#2497: 23.0.91; Fails to read UTF-8 on Win2k
Date: Sat, 28 Feb 2009 12:49:58 +0200	[thread overview]
Message-ID: <uab86q1ih.fsf@gnu.org> (raw)
In-Reply-To: <jwv4oyf18le.fsf-monnier+emacsbugreports@gnu.org>

> From: Stefan Monnier <monnier@iro.umontreal.ca>
> Cc: 2497@emacsbugs.donarmstrong.com,  uwe.siart@tum.de
> Date: Fri, 27 Feb 2009 23:40:01 -0500
> 
> >> It works with "C-x RET c utf-8 RET" immediately prior to "C-x C-f".
> >> > If it does, then the problem is with guessing the encoding, not with
> >> > decoding it.
> >> That's also my impression.
> >> > Also, what is the default value of buffer-file-coding-system, and was
> >> > it the same in 23.0.90?
> >> iso-latin-1-dos in 23.0.90 and in 23.0.91.
> > Then you shouldn't expect Emacs to guess UTF-8 encoding correctly in
> > every single instance.  Distinguishing between UTF-8 and Latin-1 is
> 
> The guessing shouldn't give priority to buffer-file-coding-system.
> Instead we have the set-coding-system-priority instead.

Please give me some credit: I said ``the _default_value_ of
buffer-file-coding-system''.  That default tells volumes about the
coding-system priorities.

> And IIUC utf-8 should always have a pretty high priority

With today's CVS on a Windows XP machine I get this:

  M-: (coding-system-priority-list) RET
  =>  (iso-latin-1 utf-8 iso-2022-7bit iso-2022-7bit-lock iso-2022-8bit-ss2 emacs-mule raw-text iso-2022-jp in-is13194-devanagari chinese-iso-8bit utf-8-auto utf-8-with-signature utf-16 utf-16be-with-signature utf-16le-with-signature utf-16be utf-16le japanese-shift-jis undecided)

So UTF-8 is indeed ``pretty high'', but lower than the locale's
default.

> So this still looks like a real bug.

Perhaps it is, but I didn't know Emacs 23 can reliably distinguish
between Latin-1 and UTF-8, even when UTF-8 sequences are present in
the text.  Can we do that reliably?  Perhaps Handa-san can shed some
light on this.






  parent reply	other threads:[~2009-02-28 10:49 UTC|newest]

Thread overview: 41+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <uzlg6oiq3.fsf@gnu.org>
2009-02-17 10:35 ` bug#2354: 23.0.90; Emacs fails to detect utf-8 encoding with language environment Latin-1 David Engster
2009-02-17 16:45   ` Juanma Barranquero
2009-02-17 18:04     ` David Engster
2009-02-28 12:30   ` bug#2354: marked as done (23.0.90; Emacs fails to detect utf-8 encoding with language environment Latin-1) Emacs bug Tracking System
2009-02-27 14:10 ` bug#2497: 23.0.91; Fails to read UTF-8 on Win2k Uwe Siart
2009-02-27 16:03   ` Eli Zaretskii
2009-02-27 16:48     ` Uwe Siart
2009-02-27 18:19       ` Eli Zaretskii
2009-02-27 20:35         ` Uwe Siart
2009-02-28  4:40         ` Stefan Monnier
2009-02-28  8:17           ` Uwe Siart
2009-02-28 10:14             ` David Engster
2009-02-28 12:09               ` Eli Zaretskii
2009-02-28 14:16                 ` Jason Rumney
2009-02-28 14:31                 ` David Engster
2009-02-28 22:00             ` Stefan Monnier
2009-02-28 10:49           ` Eli Zaretskii [this message]
2009-02-28 12:16             ` Uwe Siart
2009-02-28 22:04             ` Stefan Monnier
2009-03-02 11:43             ` Kenichi Handa
2009-03-02 15:25               ` Stefan Monnier
2009-03-02 19:25                 ` Eli Zaretskii
2009-03-03 16:34                   ` Stefan Monnier
2009-02-27 16:11   ` Juanma Barranquero
2009-02-27 16:16     ` Juanma Barranquero
2009-02-27 16:27       ` Uwe Siart
2009-02-27 16:32         ` Juanma Barranquero
2009-02-27 16:23     ` Uwe Siart
2009-02-27 16:38       ` Juanma Barranquero
2009-02-27 18:19         ` Eli Zaretskii
2009-02-27 20:38           ` Juanma Barranquero
2009-02-28  1:29           ` Jason Rumney
2009-02-27 17:02     ` Leo
2009-02-27 17:46   ` David Engster
2009-02-27 21:15     ` Uwe Siart
2009-02-28  1:32     ` Jason Rumney
2009-02-28  1:35       ` Processed (with 5 errors): " Emacs bug Tracking System
2009-02-27 23:34   ` bug#2497: 23.0.91; Fails to read UTF-8 on Windows2k Richard M Stallman
2009-02-28  9:47     ` Uwe Siart
2009-02-28 18:08       ` Richard M Stallman
2009-02-28 12:30   ` bug#2497: marked as done (23.0.91; Fails to read UTF-8 on Win2k) Emacs bug Tracking System

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=uab86q1ih.fsf@gnu.org \
    --to=eliz@gnu.org \
    --cc=2497@emacsbugs.donarmstrong.com \
    --cc=handa@m17n.org \
    --cc=monnier@iro.umontreal.ca \
    --cc=uwe.siart@tum.de \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/emacs.git
	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.