unofficial mirror of emacs-devel@gnu.org 
 help / color / mirror / code / Atom feed
From: Ralf Angeli <angeli@iwi.uni-sb.de>
Subject: Re: [angeli@iwi.uni-sb.de: Coding problem with Euro sign]
Date: Thu, 15 Dec 2005 17:20:09 +0100	[thread overview]
Message-ID: <dns53n$dk4$1@sea.gmane.org> (raw)
In-Reply-To: dnqh85$okp$1@sea.gmane.org

* Kevin Rodgers (2005-12-15) writes:

> Ralf Angeli wrote:
>> * Kevin Rodgers (2005-12-14) writes:
>>>I think the OP is confused: 
>> 
>> Was confused.  That was cleared up on emacs-pretest-bug.
>
> Good!  I hope you didn't take offense at my remark.

Oh well ... something like that was to be expected as my knowledge
about coding systems is only improving slowly. (c:

>>>And the OP should try visiting the file with the cp1252 coding system.
>> 
>> Well, the question now is if it is possible for Emacs to figure out
>> the coding system on itself with the example at hand.
>
> You could try something like this:
>
> (setq auto-coding-regexp-alist
>        (cons '("[\040-\177][\200-\237]" . cp1252)
>              auto-coding-regexp-alist))
>
> I don't think that's a general purpose solution since (1)
> auto-coding-regexp-alist actually has precedence over `-*-coding:-*-'
> file variables and (2) other encodings probably use those o200 - o237
> bytes (certainly other Microsoft Windows code pages do).

This doesn't seem to work here.  I still see the byte codes of the
8-bit characters when opening the file after evaluating the above
form.

And a customization is actually not what I am interested in; I'd like
Emacs to figure this out by itself, out of the box.

I am not sure how common something like the case at hand is but it is
certainly not academic.  And if one is working with different
operating systems or interchanging files with people working on
different operating systems the failure to detect the correct coding
could lead to people regarding Emacs as a truly inferior piece of
software.  I can already hear them: "What?  It displays the Euro sign
as \200?  Even Notepad gets this right!"  On these grounds it may
become a bit hard to convince people that Emacs is the one true
editor.

Anyway, I tested a bit and under Windows (surprise) every application
I tried (e.g. Notepad and OpenOffice) managed to display the file
correctly.  On GNU/Linux no application got it right.  I checked with
less, more, vim, nano, pico, and OpenOffice.  Either "garbage" was
displayed or (in case of OpenOffice) a dialog asking the user to
specify the encoding.  So it's not like Emacs isn't in good company.
Nevertheless it would be nice if Emacs got it right.  Unfortunately I
lack the knowledge for judging if this is possible at all without
having to use all sorts of unreliable heuristics which are costly to
implement.

-- 
Ralf

  reply	other threads:[~2005-12-15 16:20 UTC|newest]

Thread overview: 32+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2005-12-13 23:34 [angeli@iwi.uni-sb.de: Coding problem with Euro sign] Richard M. Stallman
2005-12-14 18:56 ` Kevin Rodgers
2005-12-14 22:51   ` Ralf Angeli
2005-12-15  1:34     ` Kevin Rodgers
2005-12-15 16:20       ` Ralf Angeli [this message]
2005-12-15 22:02         ` Kevin Rodgers
2005-12-16  8:57           ` Eli Zaretskii
2005-12-16 17:59             ` Kevin Rodgers
2005-12-17  7:19               ` Eli Zaretskii
2005-12-16 11:55           ` Ralf Angeli
2005-12-16 22:58             ` Kevin Rodgers
2005-12-17  7:36               ` Eli Zaretskii
2005-12-17 10:47               ` Reiner Steib
2006-01-10 12:38             ` windows-XXXX and cpXXXX Kenichi Handa
2006-01-10 19:18               ` Eli Zaretskii
2006-01-11 11:35                 ` Kenichi Handa
2006-01-11 17:46                   ` Eli Zaretskii
2006-01-12  1:25                     ` Kenichi Handa
2006-01-12  4:33                       ` Eli Zaretskii
2006-01-12  8:29                         ` Werner LEMBERG
2006-01-12 19:56                           ` Eli Zaretskii
2006-01-12 13:23                         ` Kenichi Handa
2006-01-12 19:59                           ` Eli Zaretskii
2006-01-13  0:58                             ` Kenichi Handa
2006-01-13  8:52                               ` Eli Zaretskii
2006-01-13 11:50                                 ` Kenichi Handa
2006-01-13 12:59                                   ` Eli Zaretskii
2006-01-16  1:05                                     ` Kenichi Handa
2006-01-16  4:31                                       ` Eli Zaretskii
2006-01-16 12:11                                         ` Kenichi Handa
2006-01-13 14:45                                 ` Stefan Monnier
2005-12-16 10:35         ` [angeli@iwi.uni-sb.de: Coding problem with Euro sign] David Hansen

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://www.gnu.org/software/emacs/

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='dns53n$dk4$1@sea.gmane.org' \
    --to=angeli@iwi.uni-sb.de \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).