unofficial mirror of emacs-devel@gnu.org 
 help / color / mirror / code / Atom feed
* codepage.el: Codepage Mapping, Latin-7
@ 2006-03-16 14:35 Reiner Steib
  2006-03-17 10:48 ` Eli Zaretskii
  0 siblings, 1 reply; 2+ messages in thread
From: Reiner Steib @ 2006-03-16 14:35 UTC (permalink / raw)
  Cc: Eli Zaretskii

Hi,

from lisp/international/codepage.el:

--8<---------------cut here---------------start------------->8---
;; Support for the Windows 12xx series of codepages that MS has
;; butchered from the ISO-8859 specs. This does not add support for
;; the extended characters that MS has added in the 128 - 159 coding
;; range, only translates those characters that can be expressed in
;; the corresponding iso-8859 charset.

;; Codepage Mapping:
;;
;; Windows-1250: ISO-8859-2 (Central Europe) - differs in some positions
;; Windows-1251: ISO-8859-5 (Cyrillic)       - differs wildly
;; Windows-1252: ISO-8859-1 (West Europe)    - exact match
;; Windows-1253: ISO-8859-7 (Greek)          - differs in some positions
;; Windows-1254: ISO-8859-9 (Turkish)        - exact match
;; Windows-1255: ISO-8859-8 (Hebrew)         - exact match
;; Windows-1256: ISO-8859-6 (Arabic)         - half match
;; Windows-1257: ISO-8859-4 (Baltic)         - differs, future Latin-7
;; Windows-1258: VISCII (Vietnamese)         - Completely different
--8<---------------cut here---------------end--------------->8---

I think "exact match" is a little bit misleading, because
e.g. Windows-1252 is a superset of ISO-8859-1.  When reading the
comment on the "128 - 159 coding range" carefully it becomes more
clear the mapping table only refers to ISO-8859 code points.


BTW: The comment in `codepage.el' says that Latin-7 isn't official and
the it's not supported by Emacs.  This doesn't seem to be correct
anymore.

,----[ codepage.el ]
| ;; Since Latin-7 is not yet official, and Emacs does not support it,
| ;; provide translation between Windows-1257 and Latin-4 the best we
| ;; can.
`----

,----[ code-pages.el ]
| ;; used by lt_LT, lv_LV, mi_NZ
| ;;;###autoload(autoload-coding-system 'iso-8859-13 '(require 'code-pages))
| (cp-make-coding-system
|  iso-latin-7
| [...]
|  "Unicode-based encoding for Latin-7 (MIME: ISO-8859-13)"
|  ?l) ;; Lithuanian/Latvian
| (coding-system-put 'iso-latin-7 'mime-charset 'iso-8859-13)
| (define-coding-system-alias 'iso-8859-13 'iso-latin-7)
| (define-coding-system-alias 'latin-7 'iso-latin-7)
`----

Bye, Reiner.
-- 
       ,,,
      (o o)
---ooO-(_)-Ooo---  |  PGP key available  |  http://rsteib.home.pages.de/

^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: codepage.el: Codepage Mapping, Latin-7
  2006-03-16 14:35 codepage.el: Codepage Mapping, Latin-7 Reiner Steib
@ 2006-03-17 10:48 ` Eli Zaretskii
  0 siblings, 0 replies; 2+ messages in thread
From: Eli Zaretskii @ 2006-03-17 10:48 UTC (permalink / raw)
  Cc: emacs-devel

> From: Reiner Steib <reinersteib+gmane@imap.cc>
> Cc: Eli Zaretskii <eliz@gnu.org>
> Date: Thu, 16 Mar 2006 15:35:57 +0100
> 
> from lisp/international/codepage.el:
> 
> --8<---------------cut here---------------start------------->8---
> ;; Support for the Windows 12xx series of codepages that MS has
> ;; butchered from the ISO-8859 specs. This does not add support for
> ;; the extended characters that MS has added in the 128 - 159 coding
> ;; range, only translates those characters that can be expressed in
> ;; the corresponding iso-8859 charset.
> 
> ;; Codepage Mapping:
> ;;
> ;; Windows-1250: ISO-8859-2 (Central Europe) - differs in some positions
> ;; Windows-1251: ISO-8859-5 (Cyrillic)       - differs wildly
> ;; Windows-1252: ISO-8859-1 (West Europe)    - exact match
> ;; Windows-1253: ISO-8859-7 (Greek)          - differs in some positions
> ;; Windows-1254: ISO-8859-9 (Turkish)        - exact match
> ;; Windows-1255: ISO-8859-8 (Hebrew)         - exact match
> ;; Windows-1256: ISO-8859-6 (Arabic)         - half match
> ;; Windows-1257: ISO-8859-4 (Baltic)         - differs, future Latin-7
> ;; Windows-1258: VISCII (Vietnamese)         - Completely different
> --8<---------------cut here---------------end--------------->8---
> 
> I think "exact match" is a little bit misleading, because
> e.g. Windows-1252 is a superset of ISO-8859-1.  When reading the
> comment on the "128 - 159 coding range" carefully it becomes more
> clear the mapping table only refers to ISO-8859 code points.

Yes, the match is "exact" only in one direction: from ISO-8859
encoding to the codepage, but not the other way around.

It is meaningless to say that Windows-1252 is a ``superset'' of
ISO-8859-1, since these are encodings, not character sets.  The
underlying Emacs character set does not include characters which can
be encoded by Windows-1252, but codepage.el is only intersted in
encoding the ISO-8859-1 charset, not anything beyond that.

> BTW: The comment in `codepage.el' says that Latin-7 isn't official and
> the it's not supported by Emacs.  This doesn't seem to be correct
> anymore.

codepage.el is obsolete and is not developed anymore.  It is only used
by the MS-DOS port of Emacs; everything else uses code-pages.el.
codepage.el will die a natural death when the MS-DOS port dies,
probably when v23.1 is released.

^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2006-03-17 10:48 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2006-03-16 14:35 codepage.el: Codepage Mapping, Latin-7 Reiner Steib
2006-03-17 10:48 ` Eli Zaretskii

Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).