From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: James Cloos Newsgroups: gmane.emacs.devel Subject: Re: modify-syntax-entry and UTF8? Date: Tue, 22 May 2007 09:25:35 -0400 Message-ID: References: <4652AE2C.5030305@cis.upenn.edu> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Trace: sea.gmane.org 1179840428 17057 80.91.229.12 (22 May 2007 13:27:08 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Tue, 22 May 2007 13:27:08 +0000 (UTC) Cc: emacs-devel@gnu.org To: Geoffrey Alan Washburn Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Tue May 22 15:27:06 2007 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1HqUOH-0001zb-Df for ged-emacs-devel@m.gmane.org; Tue, 22 May 2007 15:27:05 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1HqUOI-0003ry-8w for ged-emacs-devel@m.gmane.org; Tue, 22 May 2007 09:27:06 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1HqUNa-0003S3-TR for emacs-devel@gnu.org; Tue, 22 May 2007 09:26:22 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1HqUNY-0003Oz-Ug for emacs-devel@gnu.org; Tue, 22 May 2007 09:26:22 -0400 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1HqUNY-0003Og-O0 for emacs-devel@gnu.org; Tue, 22 May 2007 09:26:20 -0400 Original-Received: from eagle.jhcloos.com ([207.210.242.212]) by monty-python.gnu.org with esmtps (TLS-1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.60) (envelope-from ) id 1HqUNY-0001ex-8C for emacs-devel@gnu.org; Tue, 22 May 2007 09:26:20 -0400 Original-Received: by eagle.jhcloos.com (Postfix, from userid 10) id B162540050; Tue, 22 May 2007 13:26:17 +0000 (UTC) DKIM-Signature: v=0.5; a=rsa-sha256; c=relaxed/relaxed; d=jhcloos.com; s=eagle; t=1179840378; bh=zF+TdEmBG92smu9Jak98xtCtLAf7wr9zW/HukQufX Cw=; h=Received:From:To:Cc:Subject:In-Reply-To:References: User-Agent:Face:Copyright:OpenPGP:OpenPGP-Fingerprint:X-Hashcash: X-Hashcash:Date:Message-ID:Lines:MIME-Version:Content-Type: Content-Transfer-Encoding; b=jPPcAFdL5+uBnAzFlUfPTvZdSYC794SlkMKnn q3l4XusulfCLeU9MpeuOyWiYvszAFq0hkrXyzrR/zPw+Umsso00kNuB3vDa7hs65KYi MULou4GJCqOMrXEFga/Z/xUf3HWOns0ic0dhAvbd4HXWpWqs+EGssBpB1X4GTdwN8q4 = Original-Received: by lugabout.jhcloos.org (Postfix, from userid 500) id A1BDA10; Tue, 22 May 2007 13:25:59 +0000 (UTC) In-Reply-To: <4652AE2C.5030305@cis.upenn.edu> (Geoffrey Alan Washburn's message of "Tue, 22 May 2007 04:47:40 -0400") User-Agent: Gnus/5.110007 (No Gnus v0.7) Emacs/23.0.0 (gnu/linux) Face: iVBORw0KGgoAAAANSUhEUgAAABAAAAAQCAYAAAAf8/9hAAAAMnRFWHRUaXRsZQAvbWF4dG9y L2Nsb29zL2luY29taW5nLzIwMDQvMDIvUFdHL0pIQzE2LnBuZyJMlEwAAAAadEVYdFNvZnR3YXJl AEdOT01FIEljb24gRWRpdG9ynioKbAAAAHFJREFUeJzFk8EKwCAMQ1/ED9cvzw6iuM0N9WIvDTWE 1KjAxmKnLBEBjLcEIBCeI2nccy5YKrjxwbbvAvZ37zlo4GB9iRMC/cq/AppItwmMyJ5I98wd9G7P xxgrSKlYS2lMrOcV13o95aVSKA7E5ndGXG+IIzcl4BNfAAAAAElFTkSuQmCC Copyright: Copyright 2007 James Cloos OpenPGP: ED7DAEA6; url=http://jhcloos.com/public_key/0xED7DAEA6.asc OpenPGP-Fingerprint: E9E9 F828 61A4 6EA9 0F2B 63E7 997A 9F17 ED7D AEA6 X-Hashcash: 1:23:070522:geoffw@cis.upenn.edu::QSUPrzCjj6V+4/Vr:00000000000000000000000000000000000000000fOH9 X-Hashcash: 1:23:070522:emacs-devel@gnu.org::eDtyrPxZBCd5x5Nk:000000000000000000000000000000000000000003Z9HG Original-Lines: 36 X-detected-kernel: Linux 2.6, seldom 2.4 (older, 2) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:71597 Archived-At: >>>>> "Geoffrey" =3D=3D Geoffrey Alan Washburn wri= tes: Geoffrey> No, what I wrote is exactly what I meant, unless the author of Geoffrey> the TeX-input method incorrectly defined \langle and \rangle. Ah. That does put a different spin on things. And in fact, the UCS has expanded since that was written, and characters were added for exactly TeX's \langle and \rlangle (and a few others in latin-ltx.el which currently point to CJK characters instead of math char= s). latin-ltx.el should be updated to use =E2=9F=A8 U+27E8 MATHEMATICAL LEFT = ANGLE BRACKET for \langle and =E2=9F=A9 U+27E9 MATHEMATICAL RIGHT ANGLE BRACKET= for \rangle. Other examples are \llbracket and \rrbracket which should be U+27E6 and U+27E7 instead of U+301A and U+301B, \ldata and \rdata (U+27EA and U+27EB instead of U+300A and U+300B), \sbs (U+29F5 instead of U+FE68). The reason is that the CJK characters in Emacs get different codepoints depending on which language, and that can prevent matching. I'm sure you are having problems matching those characters because the versions in your .el file have different buffer and/or file codes than what you are trying to match them to. What does C-uC-x=3D output when point is on the characters in your (modify-syntax-entry) calls and when point is on one of the characters you are trying to match in the buffer you are editing? What are the mode and coding-system of the buffer you are editing? What is the coding-system of the .el file? -JimC --=20 James Cloos OpenPGP: 1024D/ED7DAEA6