all messages for Emacs-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
From: Benjamin Riefenstahl <Benjamin.Riefenstahl@epost.de>
Cc: emacs-devel@gnu.org
Subject: Re: utf-8 cut/paste
Date: Wed, 26 May 2004 14:30:37 +0200	[thread overview]
Message-ID: <m3fz9nnzf6.fsf@seneca.benny.turtle-trading.net> (raw)
In-Reply-To: <uoeocikfe.fsf@gnu.org> (Sam Steingold's message of "Tue, 25 May 2004 11:41:09 -0400")

Hi Sam,


Note that your original problem with cyrillic is not actually related
to MULE.  MULE may make sound things a bit more complicated, but the
problem is that Emacs doesn't use the Unicode APIs of Windows.  Which
it can do fine (and probably will at some point), with or without
MULE.  At least on NT/W2K/XP, I don't know whether the Unicode
clipboard works on 9x/Me.


Sam Steingold <sds@gnu.org> writes:
> each character comes equipped with its integer encoding, and 2
> characters which are identical elements of CHARACTER, but appear in
> two different encodings (e.g., #\Ц encoded in koi8 and in alt) are
> different characters in MULE.  This is so absurd that I can hardly
> believe that anyone could ever conceive of this, let alone implement
> it.

You are presupposing that you know which "2 characters [...] are
identical elements of CHARACTER, but appear in two different
encodings."  While this knowledge seems obvious in theory, in practice
it involves quite a lot of work to formalize this unification for all
relevant charsets (i.e. for the charsets that are actually in use).

After the work has mostly been done in Unicode, this kind of
information is actually one of the major benefits of that standard.
So now, today we have a well-defined reference for things like:

> #\C is a "LATIN CAPITAL LETTER C", or #\С is a "CYRILLIC CAPITAL
> LETTER ES" (even through they might look similar in your font).

But when MULE was first implemented, Unicode was in its infancy, if I
see this right.  So at that time this knowledge wasn't available in
formal terms and in the necessary breadth.  IOW, MULE (building on
ISO-2022) was a solution at the time, while Unicode was a still in the
design phase with much work to go.


benny

  parent reply	other threads:[~2004-05-26 12:30 UTC|newest]

Thread overview: 24+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2004-05-23 18:59 utf-8 cut/paste Sam Steingold
2004-05-24  9:52 ` Benjamin Riefenstahl
2004-05-24 16:18   ` Sam Steingold
2004-05-24 19:19     ` Benjamin Riefenstahl
2004-05-24 21:00       ` Sam Steingold
2004-05-24 23:10         ` Benjamin Riefenstahl
2004-05-25 13:06           ` Sam Steingold
2004-05-25  6:02       ` Eli Zaretskii
2004-05-25 10:03         ` Benjamin Riefenstahl
2004-05-25 12:36           ` Eli Zaretskii
2004-05-25 15:41             ` Sam Steingold
2004-05-26  4:22               ` Kenichi Handa
2004-05-28 17:45                 ` Sam Steingold
2004-05-29 10:04                   ` Jason Rumney
2004-05-26  4:33               ` Miles Bader
2004-05-26 18:11                 ` Sam Steingold
2004-05-26 19:23                   ` David Kastrup
2004-05-26 11:32               ` Eli Zaretskii
2004-05-26 13:31                 ` Sam Steingold
2004-05-26 12:30               ` Benjamin Riefenstahl [this message]
2004-05-26 15:48         ` Stefan Monnier
2004-05-26 18:11           ` Eli Zaretskii
2004-05-26 20:02             ` Stefan Monnier
2004-05-27  8:10               ` Eli Zaretskii

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=m3fz9nnzf6.fsf@seneca.benny.turtle-trading.net \
    --to=benjamin.riefenstahl@epost.de \
    --cc=emacs-devel@gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/emacs.git
	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.