From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Stefan Monnier Newsgroups: gmane.emacs.devel Subject: Re: utf-8 cut/paste Date: 26 May 2004 11:48:32 -0400 Sender: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Message-ID: References: <9003-Tue25May2004080243+0300-eliz@gnu.org> NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: sea.gmane.org 1085599433 8325 80.91.224.253 (26 May 2004 19:23:53 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Wed, 26 May 2004 19:23:53 +0000 (UTC) Cc: Benjamin Riefenstahl , sds@gnu.org, emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Wed May 26 21:23:42 2004 Return-path: Original-Received: from quimby.gnus.org ([80.91.224.244]) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1BT406-0003UF-00 for ; Wed, 26 May 2004 21:23:42 +0200 Original-Received: from monty-python.gnu.org ([199.232.76.173]) by quimby.gnus.org with esmtp (Exim 3.35 #1 (Debian)) id 1BT406-00074J-00 for ; Wed, 26 May 2004 21:23:42 +0200 Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org) by monty-python.gnu.org with esmtp (Exim 4.34) id 1BT3wY-0004VI-Ji for emacs-devel@quimby.gnus.org; Wed, 26 May 2004 15:20:02 -0400 Original-Received: from list by monty-python.gnu.org with tmda-scanned (Exim 4.34) id 1BT3PU-00060Q-T4 for emacs-devel@gnu.org; Wed, 26 May 2004 14:45:53 -0400 Original-Received: from mail by monty-python.gnu.org with spam-scanned (Exim 4.34) id 1BT1FC-0004UY-5r for emacs-devel@gnu.org; Wed, 26 May 2004 12:27:37 -0400 Original-Received: from [132.204.24.67] (helo=mercure.iro.umontreal.ca) by monty-python.gnu.org with esmtp (Exim 4.34) id 1BT1Eb-0004NS-JS; Wed, 26 May 2004 12:26:29 -0400 Original-Received: from asado.iro.umontreal.ca (asado.iro.umontreal.ca [132.204.24.84]) by mercure.iro.umontreal.ca (Postfix) with ESMTP id 5D931B30358; Wed, 26 May 2004 11:48:35 -0400 (EDT) Original-Received: by asado.iro.umontreal.ca (Postfix, from userid 20848) id DBF7A8CA23; Wed, 26 May 2004 11:48:32 -0400 (EDT) Original-To: Eli Zaretskii In-Reply-To: <9003-Tue25May2004080243+0300-eliz@gnu.org> Original-Lines: 15 User-Agent: Gnus/5.09 (Gnus v5.9.0) Emacs/21.3.50 X-DIRO-MailScanner-Information: Please contact the ISP for more information X-DIRO-MailScanner: Found to be clean X-DIRO-MailScanner-SpamCheck: n'est pas un polluriel, SpamAssassin (score=0, requis 5) X-MailScanner-From: monnier@iro.umontreal.ca X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.4 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Xref: main.gmane.org gmane.emacs.devel:23971 X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:23971 >> > I thought that if I use unicode (utf-8), all characters are already >> > in one set. >> In general theory, all the Unicode characters are in the Unicode >> (utf-8) set and all the cp1252 characters are in the cp1252 set. > Actually, cp1252 is not a charset, it's an encoding (a.k.a. ``coding > system''). The underlying Mule charset is latin-iso8859-1. AFAICT he was talking "in general", not "in Emacs". Emacs' notion of a charset is a mostly arbitrary internal detail. cp1252 could have been implemented as another charset rather than being mapped to a mix of 8859-1 and unicode chars. IIUC, emacs-unicode does away with this notion of a charset. Stefan