From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Lennart Borgman Newsgroups: gmane.emacs.devel Subject: Re: highlighting non-ASCII characters Date: Wed, 24 Mar 2010 17:21:40 +0100 Message-ID: References: <87vdcngws4.fsf@mail.jurta.org> <8739zryv6l.fsf_-_@lifelogs.com> <6932BBFEB09A4BA09156ED7F598569CE@us.oracle.com> <87pr2uv8e1.fsf@lifelogs.com> <87aatyuj9s.fsf@lifelogs.com> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Trace: dough.gmane.org 1269447751 10993 80.91.229.12 (24 Mar 2010 16:22:31 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Wed, 24 Mar 2010 16:22:31 +0000 (UTC) Cc: emacs-devel@gnu.org To: Ted Zlatanov Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Wed Mar 24 17:22:26 2010 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1NuTLU-0000C1-Vn for ged-emacs-devel@m.gmane.org; Wed, 24 Mar 2010 17:22:17 +0100 Original-Received: from localhost ([127.0.0.1]:60710 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1NuTLU-00039S-5w for ged-emacs-devel@m.gmane.org; Wed, 24 Mar 2010 12:22:16 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1NuTLO-00038a-D9 for emacs-devel@gnu.org; Wed, 24 Mar 2010 12:22:10 -0400 Original-Received: from [140.186.70.92] (port=38441 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1NuTLM-00036m-QS for emacs-devel@gnu.org; Wed, 24 Mar 2010 12:22:09 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.69) (envelope-from ) id 1NuTLG-00024U-2v for emacs-devel@gnu.org; Wed, 24 Mar 2010 12:22:08 -0400 Original-Received: from mail-fx0-f225.google.com ([209.85.220.225]:57296) by eggs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1NuTLF-00024M-Ug for emacs-devel@gnu.org; Wed, 24 Mar 2010 12:22:02 -0400 Original-Received: by fxm25 with SMTP id 25so81968fxm.26 for ; Wed, 24 Mar 2010 09:22:01 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :from:date:message-id:subject:to:cc:content-type :content-transfer-encoding; bh=xHqDYSuP9yfNJ+B3trIwR4cryXjQ8H8QuJkq9Wx5hIU=; b=vLOjL3642v1YCcfuP2NiIlCDJwm7VFzMUe0b5nLxAVafZOQXaxd45ic5LrDT25haaU LHkJjE9PRlJ0gHzwUZttdfngEEVrEyJTrvXy39pzfpgzrHU3DBbJyb4qtAY1tKSPMXdV 6cKHVx+2+Mchcs+znczEV3ubE5Qx1ECX5O0xI= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc:content-type:content-transfer-encoding; b=QIJAUJDTp27S0nV+2LOSiYbsg2mV/yDyhjH7XHHkoBlWLfk2rjN7ptEPY8XB9uWhPF ClqqSVXEwWtTGDqeZABAlzRKV4N277Z4tPvdlcFZ9btiVkbw6nTUHXLXbS7MiQzJICiH 02Y1pVh8TY1S8aSNEG50vA/EZEamHRoP5tmUw= Original-Received: by 10.239.141.65 with SMTP id b1mr413819hba.28.1269447720167; Wed, 24 Mar 2010 09:22:00 -0700 (PDT) In-Reply-To: <87aatyuj9s.fsf@lifelogs.com> X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:122611 Archived-At: 2010/3/24 Ted Zlatanov : > On Wed, 24 Mar 2010 14:00:47 +0900 "Stephen J. Turnbull" wrote: > > SJT> There were long threads on Python-dev about this with respect to the > SJT> PEPs implementing Unicode. =C2=A0The bottom line was basically that = the > SJT> recommendations of the Unicode Security Considerations UTR #36 shoul= d > SJT> be followed with respect to "characters that may not be what they lo= ok > SJT> like". > > This is relevant, thanks for the pointer. =C2=A0See > > http://unicode.org/reports/tr36/ > > which links to: > > http://www.unicode.org/reports/tr39/#Confusable_Detection > > which can also be used to build a table of homoglyphs (as in http://homog= lyphs.net). Maybe "Recommended Identifier Profiles for IDN" should be implemented in Emacs? (See http://www.unicode.org/reports/tr39/data/idnchars.txt) How about a bool vector (see make-bool-vector) for this?