unofficial mirror of emacs-devel@gnu.org 
 help / color / mirror / code / Atom feed
From: "Eli Zaretskii" <eliz@gnu.org>
Cc: emacs-devel@gnu.org
Subject: Re: utf-8 cut/paste
Date: Wed, 26 May 2004 13:32:26 +0200	[thread overview]
Message-ID: <9743-Wed26May2004133225+0300-eliz@gnu.org> (raw)
In-Reply-To: <uoeocikfe.fsf@gnu.org> (message from Sam Steingold on Tue, 25 May 2004 11:41:09 -0400)

> From: Sam Steingold <sds@gnu.org>
> Date: Tue, 25 May 2004 11:41:09 -0400
> 
> > * Eli Zaretskii <ryvm@tah.bet> [2004-05-25 14:36:07 +0200]:
> >
> > No, it doesn't.  My comment was a minor one, to help Sam avoid
> > possible confusion in the future.
> 
> I am sorry, you lost me long ago (when MULE was merged into Emacs).

I'm not sure what that comment was supposed to tell (I didn't design
MULE, nor integrated it into Emacs).  So I will just pretend it was
never written.

I simply tried to help you understand things better, assuming that you
wanted to understand; if not, feel free to disregard what's below.

> I understand what a CHARACTER is (a type in CL).
> E.g., #\C is a "LATIN CAPITAL LETTER C", or
> #\\x7f is a "CYRILLIC CAPITAL LETTER ES" (even through they might look
> similar in your font).
> I understand that there are many (partial) function between (subsets of)
> (INTEGER 0) and CHARACTER, called "encodings".
> I don't know what a "charset" is, but I would guess that it is a subset
> of CHARACTERs on which a particular encoding is defined.

That is true, but it has no direct relevance to what I was trying to
explain.

What I was trying to explain was that, taking Cyrillic characters as
an example, any single Cyrillic character can be encoded in several
different encodings.  Examples of such encodings include KOI8-R,
ISO-8859-5, and cp1251 (a.k.a. windows-1251).

The set of Cyrillic characters is what MULE calls ``a charset''.  Any
encoding of characters from that charset is what MULE calls ``a coding
system''.

cp1251 is an encoding, not a charset.  It encodes the Cyrillic charset
(MULE calls that charset cyrillic-iso8859-5).  Similarly, cp1252
encodes the latin-iso8859-1 charset, and cp1255 encodes the
hebrew-iso8859-8 charset.

I sincerely hope that helps to make things more clear.

  parent reply	other threads:[~2004-05-26 11:32 UTC|newest]

Thread overview: 24+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2004-05-23 18:59 utf-8 cut/paste Sam Steingold
2004-05-24  9:52 ` Benjamin Riefenstahl
2004-05-24 16:18   ` Sam Steingold
2004-05-24 19:19     ` Benjamin Riefenstahl
2004-05-24 21:00       ` Sam Steingold
2004-05-24 23:10         ` Benjamin Riefenstahl
2004-05-25 13:06           ` Sam Steingold
2004-05-25  6:02       ` Eli Zaretskii
2004-05-25 10:03         ` Benjamin Riefenstahl
2004-05-25 12:36           ` Eli Zaretskii
2004-05-25 15:41             ` Sam Steingold
2004-05-26  4:22               ` Kenichi Handa
2004-05-28 17:45                 ` Sam Steingold
2004-05-29 10:04                   ` Jason Rumney
2004-05-26  4:33               ` Miles Bader
2004-05-26 18:11                 ` Sam Steingold
2004-05-26 19:23                   ` David Kastrup
2004-05-26 11:32               ` Eli Zaretskii [this message]
2004-05-26 13:31                 ` Sam Steingold
2004-05-26 12:30               ` Benjamin Riefenstahl
2004-05-26 15:48         ` Stefan Monnier
2004-05-26 18:11           ` Eli Zaretskii
2004-05-26 20:02             ` Stefan Monnier
2004-05-27  8:10               ` Eli Zaretskii

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://www.gnu.org/software/emacs/

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=9743-Wed26May2004133225+0300-eliz@gnu.org \
    --to=eliz@gnu.org \
    --cc=emacs-devel@gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).