From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Juri Linkov Newsgroups: gmane.emacs.devel Subject: Re: [PATCH] Unicode Lisp reader escapes Date: Thu, 11 May 2006 15:03:18 +0300 Organization: JURTA Message-ID: <878xp8g2a9.fsf@jurta.org> References: <17491.34779.959316.484740@parhasard.net> <17492.29148.246942.842300@parhasard.net> <8764kkawsf.fsf@jurta.org> <87vesi6nh1.fsf@jurta.org> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: sea.gmane.org 1147352573 16308 80.91.229.2 (11 May 2006 13:02:53 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Thu, 11 May 2006 13:02:53 +0000 (UTC) Cc: kehoea@parhasard.net, emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Thu May 11 15:02:51 2006 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by ciao.gmane.org with esmtp (Exim 4.43) id 1FeAo5-0002yW-1y for ged-emacs-devel@m.gmane.org; Thu, 11 May 2006 15:02:17 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1FeAo4-00053d-JD for ged-emacs-devel@m.gmane.org; Thu, 11 May 2006 09:02:16 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1FeAlb-0003DC-GC for emacs-devel@gnu.org; Thu, 11 May 2006 08:59:43 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1FeAlZ-00039p-IK for emacs-devel@gnu.org; Thu, 11 May 2006 08:59:42 -0400 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1FeAlZ-00039U-Cr for emacs-devel@gnu.org; Thu, 11 May 2006 08:59:41 -0400 Original-Received: from [217.25.160.1] (helo=relay1.binet.com.ua) by monty-python.gnu.org with esmtp (Exim 4.52) id 1FeAn2-00076J-Lm; Thu, 11 May 2006 09:01:13 -0400 Original-Received: from localhost (localhost [127.0.0.1]) by relay1.binet.com.ua (Postfix) with ESMTP id B0ADB77DE4; Thu, 11 May 2006 15:59:37 +0300 (EEST) Original-Received: from mail.binet.com.ua (i45.dialup.binet.com.ua [217.25.161.109]) by relay1.binet.com.ua (Postfix) with ESMTP id B91A677DD8; Thu, 11 May 2006 15:59:32 +0300 (EEST) Original-To: rms@gnu.org In-Reply-To: (Richard Stallman's message of "Wed, 10 May 2006 23:44:27 -0400") User-Agent: Gnus/5.110004 (No Gnus v0.4) Emacs/22.0.50 (gnu/linux) X-Virus-Scanned: by ClamAv at binet.com.ua X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:54261 Archived-At: > Support for \u and \U in Emacs Lisp would be good since other _Lisp_ languages > support \uXXXX and \UXXXXXXXX as well. But other Lisp languages support > also Lisp notation for Unicode characters. I think Emacs should support > it too. In this notation Unicode characters are written as #\u3042 or > #\u0002a6b2 with the leading hash mark. > > We do not in general try to be compatible with Common Lisp on input > syntax for characters. So forget this. The initial `#' character is a valid Emacs hash notation for writing integers in various bases. After adding `\uXXXX' it seems reasonable to add `#\uXXXX' as well. However, there is one difference: Emacs Lisp hash notation doesn't use the backslash `\' after `#', e.g. `#x42', but other Lisps use the backslash in the notation of Unicode characters, e.g. `#\u3042'. I have no opinion which notation is better. > Also it would be good to support a syntax for named Unicode characters. > Common Lisp has the syntax #\euro_sign, and Perl - \N{EURO SIGN}. > > I tend to think we should not do this now. > Does Emacs have a table of these names? The variable `describe-char-unicodedata-file' points to the file `UnicodeData.txt' not distributed currently with Emacs. This could be done in the emacs-unicode branch. I think this question should be considered after the release. -- Juri Linkov http://www.jurta.org/emacs/