From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Reiner Steib Newsgroups: gmane.emacs.devel Subject: Re: regex encoding Date: Tue, 01 Aug 2006 21:46:32 +0200 Message-ID: References: Reply-To: Reiner Steib NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Trace: sea.gmane.org 1154461703 8013 80.91.229.2 (1 Aug 2006 19:48:23 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Tue, 1 Aug 2006 19:48:23 +0000 (UTC) Cc: emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Tue Aug 01 21:48:17 2006 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by ciao.gmane.org with esmtp (Exim 4.43) id 1G80DT-0001Ob-GP for ged-emacs-devel@m.gmane.org; Tue, 01 Aug 2006 21:47:47 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1G80DQ-0002cY-C3 for ged-emacs-devel@m.gmane.org; Tue, 01 Aug 2006 15:47:44 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1G80DF-0002br-Mg for emacs-devel@gnu.org; Tue, 01 Aug 2006 15:47:33 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1G80DD-0002av-B8 for emacs-devel@gnu.org; Tue, 01 Aug 2006 15:47:33 -0400 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1G80DD-0002as-3O for emacs-devel@gnu.org; Tue, 01 Aug 2006 15:47:31 -0400 Original-Received: from [134.60.1.1] (helo=mail.uni-ulm.de) by monty-python.gnu.org with esmtps (TLS-1.0:DHE_RSA_AES_256_CBC_SHA:32) (Exim 4.52) id 1G80G9-0007uw-HH for emacs-devel@gnu.org; Tue, 01 Aug 2006 15:50:33 -0400 Original-Received: from bridgekeeper.physik.uni-ulm.de (bridgekeeper.physik.uni-ulm.de [134.60.10.123]) by mail.uni-ulm.de (8.13.7/8.13.7) with ESMTP id k71JlQIG027728; Tue, 1 Aug 2006 21:47:26 +0200 (MEST) Original-Received: from viandante.physik.uni-ulm.de (bridgekeeper.physik.uni-ulm.de [134.60.10.123]) by bridgekeeper.physik.uni-ulm.de (Postfix) with SMTP id 6CD2111F1D; Tue, 1 Aug 2006 21:47:26 +0200 (CEST) Original-Received: (nullmailer pid 20329 invoked by uid 170); Tue, 01 Aug 2006 19:46:32 -0000 Original-To: Chip Coldwell X-Face: 1; h7XMU[7l}$T@J.D}5z*w8Tg'}B5ArAWc8>2X~otB; kOjKs8X%|hTC#dG:%Vpx")x7S/`v :VXU#fZW$X$zdhEU.RfVQ@<-m9IuN{Hm"fW{,5]6kR'M*vEs+{5Cj!L(JTRzA$(},?5J=sm; %Od, emacs-devel@gnu.org In-Reply-To: (Chip Coldwell's message of "Tue, 1 Aug 2006 15:09:46 -0400 (EDT)") User-Agent: Gnus/5.110006 (No Gnus v0.6) Emacs/22.0.50 (gnu/linux) X-DCC-CTc-dcc2-Metrics: gemini 1031; Body=2 Fuz1=2 Fuz2=2 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:57957 Archived-At: On Tue, Aug 01 2006, Chip Coldwell wrote: > If your mail reader can handle UTF-8, the characters represented by > these iso-8859-1 codes look like this: > > [a-zA-Z=C3=83=E2=80=9E=C3=83=E2=80=93=C3=83=C5=93=C3=83=C2=A4=C3=83=C2=B6= =C3=83=C5=B8=C3=83=C2=BC] | [a-zA-Z=C3=84=C3=96=C3=9C=C3=A4=C3=B6=C3=9F=C3=BC] If you intend to send UTF-8, you MUA should not declare it as "Content-Type: TEXT/PLAIN; charset=3DISO-8859-1". ;-) > My question is: are emacs regex character classes limited to the > iso-8859-1 character set, or is there some way to represent Unicode > (such as UTF-8) characters in a character class? AFAIK, you can write the chars in UTF-8 if you specify the encoding of the lisp file, cf. (info "(emacs)Specify Coding"): --8<---------------cut here---------------start------------->8--- ;; -*- coding: utf-8 -*- (defun rs-test () (interactive) (re-search-forward "[=C3=84=C3=96=C3=9C=C3=A4=C3=B6=C3=9F=C3=BC]")) --8<---------------cut here---------------end--------------->8--- I don't know if there's a reason why isn't used in `ispell.el'. Bye, Reiner. --=20 ,,, (o o) ---ooO-(_)-Ooo--- | PGP key available | http://rsteib.home.pages.de/