From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Oliver Scholz Newsgroups: gmane.emacs.devel Subject: Re: [PATCH] Unicode Lisp reader escapes Date: Thu, 18 May 2006 11:12:17 +0200 Message-ID: <87d5eb7l3i.fsf@gmx.de> References: <17491.34779.959316.484740@parhasard.net> <87iroarr9i.fsf-monnier+emacs@gnu.org> <87d5egrb4c.fsf-monnier+emacs@gnu.org> <87ves8p0us.fsf-monnier+emacs@gnu.org> <87ves8ngtb.fsf@gmx.de> <87u07qcnaa.fsf@gmx.de> <87ves5117y.fsf@gmx.de> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Trace: sea.gmane.org 1147943645 12962 80.91.229.2 (18 May 2006 09:14:05 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Thu, 18 May 2006 09:14:05 +0000 (UTC) Cc: emacs-devel@gnu.org, monnier@iro.umontreal.ca, handa@m17n.org, Oliver Scholz Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Thu May 18 11:14:02 2006 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by ciao.gmane.org with esmtp (Exim 4.43) id 1FgeZw-0006p5-G9 for ged-emacs-devel@m.gmane.org; Thu, 18 May 2006 11:13:57 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1FgeZv-0008KM-Ca for ged-emacs-devel@m.gmane.org; Thu, 18 May 2006 05:13:55 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1FgeZf-0008Gd-Ri for emacs-devel@gnu.org; Thu, 18 May 2006 05:13:39 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1FgeZd-0008Bs-Ma for emacs-devel@gnu.org; Thu, 18 May 2006 05:13:38 -0400 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1FgeZd-0008Bc-Hc for emacs-devel@gnu.org; Thu, 18 May 2006 05:13:37 -0400 Original-Received: from [213.165.64.20] (helo=mail.gmx.net) by monty-python.gnu.org with smtp (Exim 4.52) id 1Fgecg-0003I6-O0 for emacs-devel@gnu.org; Thu, 18 May 2006 05:16:47 -0400 Original-Received: (qmail invoked by alias); 18 May 2006 09:13:34 -0000 Original-Received: from dslb-084-058-172-184.pools.arcor-ip.net (EHLO localhost.localdomain.gmx.de) [84.58.172.184] by mail.gmx.net (mp005) with SMTP; 18 May 2006 11:13:34 +0200 X-Authenticated: #1497658 Original-To: rms@gnu.org In-Reply-To: (Richard Stallman's message of "Wed, 17 May 2006 16:09:08 -0400") User-Agent: Gnus/5.11 (Gnus v5.11) Emacs/23.0.0 (gnu/linux) X-Y-GMX-Trusted: 0 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:54695 Archived-At: Richard Stallman writes: > > What does "fragmentation" mean? I do not recall seeing that term > > in this context. > > It's the opposite of unification. In this context it can mean two > different things: > > Thanks. > > 1. Undo the effects of `unify-8859-on-decoding' mode. That is, wenn > decoding some encodings like cyrillic or some ISO 8859 encodings, > then decode them to characters from appropriate mule charsets (e.g. > `greek-iso8859-7') rather than to characters from the charset > `mule-unicode-0100-24ff'. This is the default. > > Don't you mean "Don't perform the actions of `unify-8859-on-decoding' > mode?" Both actually. This is what Emacs does by default. But there's also a function `ucs-fragment-8859' that, when called with its second argument non-nil, reverses the effects of unification on decoding. This is how `unify-8859-on-decoding-mode' is implemented: (define-minor-mode unify-8859-on-decoding-mode "Set up translation-tables for unifying ISO 8859 characters on decoding. [...]" :group 'mule :global t :init-value nil (if unify-8859-on-decoding-mode (ucs-unify-8859 nil t) (ucs-fragment-8859 nil t))) > 2. When decoding UCS encodings like UTF-8, decode characters from > certain reperoires, e.g. "Greek", to different mule charsets like > `greek-iso8859-7'. The default is to decode them all to characters > from `mule-unicode-0100-24ff'. A user can turn this behaviour on by > customizing `utf-fragment-on-decoding'. > > That one isn't relevant to the problem we need to solve now. Sorry, I was not aware that this decision was definitive. Oliver --=20 Oliver Scholz 29 Flor=C3=A9al an 214 de la R=C3=A9volution Ostendstr. 61 Libert=C3=A9, Egalit=C3=A9, Fraternit=C3=A9! 60314 Frankfurt a. M.=20=20=20=20=20=20=20