all messages for Emacs-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
From: Stefan Monnier <monnier@iro.umontreal.ca>
To: Emmanuel Bigler <Emmanuel.Bigler@ens2m.fr>
Cc: schwab@linux-m68k.org, 7962@debbugs.gnu.org, svenjoac@gmx.de
Subject: bug#7962: 23.2; capitalize letters ISO-8859-1 with diacritic signs in emacs 23.2.1
Date: Fri, 04 Feb 2011 16:34:44 -0500	[thread overview]
Message-ID: <jwvr5bnv6m2.fsf-monnier+emacs@gnu.org> (raw)
In-Reply-To: <4D4C32A3.3030906@ens2m.fr> (Emmanuel Bigler's message of "Fri, 04 Feb 2011 18:08:51 +0100")

> I think I'm starting to understand what is going on.

I think you're still confused, tho.

For some reason, you haven't replied to any of my email, even tho it's
blatantly obvious that your "default enable-multibyte-characters" is the
main culprit (and the "default" part is important here, it means that
it comes from something you've explicitly changed in your Emacs config).

> I had created a long time ago an unibyte file containing the 1-byte
> characters I want to test within emacs.

You mean an iso-8859-1 file, then.  A unibyte file only contains bytes,
no chars.

> I started /usr/local/bin/emacs -Q mytestchars-224-255-iso-8859.txt
> under emacs  23.2.93.1 (i686-pc-linux-gnu)

> The file displays perfectly correctly. (describe-char (point)) gives me
> exactly what I want, i.e. an extended asci decimal code between 224 and 255.

The code is not very helpful here, since depending on whether the
current buffer is unibyte or multibyte, the 224 or 255 doesn't mean the
same thing.  So the second line "preferred charset:" is more important,
since it should either say "eight-bit" (i.e. a raw byte with no
associated meaning of it representing some kind of character) or
"iso-8859-1".

> Almost all operations (except capitalize, see below) work exactly as I wish

So that leads me to think the buffer is in unibyte mode.
If you started with "emacs -Q", the only explanation is that you have
EMACS_UNIBYTE set in your environment variables.  If that's the case,
then please get rid of it.

> At the beginning of this discussion, Sven explained that capitalize would
> only work on 2-byte characters.

unibyte-vs-multibyte is not the same as "1-byte char"-vs-"2-byte chars".
It's an issue that's internal to Emacs and that's largely irrelevant to
how Emacs stores chars (e.g. as an array of 32bit integers, or as
a sequence of bytes, with escape sequences to represent more than 256
different values).

> Which I tested of course, and of course it works, but I simply wish
> I could continue to capitalize M-c  unibyte words like  in the good
> old iso-8859 days !!

Most likely you won't tell the difference: the multibyte mode works just
as well for iso-8859 files.  "multibyte-mode" means "we're manipulating
chars", whereas unibyte mode means "we're manipulating bytes", where
bytes are simply numbers between 0 and 255.  Now you tell me: what does
it mean to capitalize the number 224?


        Stefan





  parent reply	other threads:[~2011-02-04 21:34 UTC|newest]

Thread overview: 40+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2011-02-02 10:35 bug#7962: 23.2; capitalize letters ISO-8859-1 with diacritic signs in emacs 23.2.1 Emmanuel Bigler
2011-02-02 19:47 ` Eli Zaretskii
2011-02-02 19:53 ` Stefan Monnier
2011-02-03 13:34   ` Emmanuel Bigler
2011-02-03 17:56     ` Eli Zaretskii
2011-02-03 18:01     ` Emmanuel Bigler
2011-02-03 18:21     ` Lennart Borgman
2011-02-03 18:49       ` Emmanuel Bigler
2011-02-03 21:19         ` Eli Zaretskii
2011-02-03 21:39           ` Lennart Borgman
2011-02-03 22:17             ` Sven Joachim
2011-02-03 22:22               ` Lennart Borgman
2011-02-03 22:35                 ` Sven Joachim
2011-02-04 10:44                 ` Andreas Schwab
2011-02-04 12:07                   ` Lennart Borgman
2011-02-04 12:41                     ` Andreas Schwab
2011-02-04 13:02                       ` Lennart Borgman
2011-02-04 13:06                         ` Andreas Schwab
2011-02-04 13:11                           ` Lennart Borgman
2011-02-04 13:21                             ` Lennart Borgman
2011-02-04 13:31                             ` Andreas Schwab
2011-02-04 13:44                               ` Lennart Borgman
2011-02-04 13:58                                 ` Andreas Schwab
2011-02-04 14:00                                   ` Lennart Borgman
2011-02-04 14:22                                     ` Andreas Schwab
2011-02-04 14:54                             ` Eli Zaretskii
2011-02-04 15:00                               ` Lennart Borgman
2011-02-04 15:20                                 ` Jason Rumney
2011-02-04 15:36                                 ` Eli Zaretskii
2011-02-04 16:36                                   ` Lennart Borgman
2011-02-04 17:08                               ` Emmanuel Bigler
2011-02-04 18:37                                 ` Eli Zaretskii
2011-02-04 21:34                                 ` Stefan Monnier [this message]
2011-02-05  9:27                                   ` bug#7962: 23.2; capitalize letters ISO-8859-1 [SOLVED, MANY THANKS TO ALL] Emmanuel Bigler
2011-02-05 16:04                                     ` Stefan Monnier
2011-02-04  8:00           ` bug#7962: 23.2; capitalize letters ISO-8859-1 with diacritic signs in emacs 23.2.1 Emmanuel Bigler
2011-02-04  8:09             ` Emmanuel Bigler
2011-02-04  8:22               ` bug#7962: 23.2; capitalize / ISO 8859 / UNIBYTE / utf-8 backward compatibility Emmanuel Bigler
2011-02-04  8:48                 ` Eli Zaretskii
2011-02-03 18:49     ` bug#7962: 23.2; capitalize letters ISO-8859-1 with diacritic signs in emacs 23.2.1 Stefan Monnier

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=jwvr5bnv6m2.fsf-monnier+emacs@gnu.org \
    --to=monnier@iro.umontreal.ca \
    --cc=7962@debbugs.gnu.org \
    --cc=Emmanuel.Bigler@ens2m.fr \
    --cc=schwab@linux-m68k.org \
    --cc=svenjoac@gmx.de \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/emacs.git
	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.