From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Ted Zlatanov Newsgroups: gmane.emacs.devel Subject: Re: highlighting non-ASCII characters Date: Mon, 29 Mar 2010 13:38:19 -0500 Organization: =?utf-8?B?0KLQtdC+0LTQvtGAINCX0LvQsNGC0LDQvdC+0LI=?= @ Cienfuegos Message-ID: <87eij3ht2c.fsf@lifelogs.com> References: <6932BBFEB09A4BA09156ED7F598569CE@us.oracle.com> <87pr2uv8e1.fsf@lifelogs.com> <87aatyuj9s.fsf@lifelogs.com> <87pr2rj89j.fsf@lifelogs.com> <87ljdeke5k.fsf@lifelogs.com> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: dough.gmane.org 1269888090 32208 80.91.229.12 (29 Mar 2010 18:41:30 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Mon, 29 Mar 2010 18:41:30 +0000 (UTC) To: emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Mon Mar 29 20:41:25 2010 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1NwJts-00011H-Qk for ged-emacs-devel@m.gmane.org; Mon, 29 Mar 2010 20:41:25 +0200 Original-Received: from localhost ([127.0.0.1]:60136 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1NwJts-0001Rf-CC for ged-emacs-devel@m.gmane.org; Mon, 29 Mar 2010 14:41:24 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1NwJr9-0007qv-HG for emacs-devel@gnu.org; Mon, 29 Mar 2010 14:38:35 -0400 Original-Received: from [140.186.70.92] (port=38917 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1NwJr8-0007pq-0v for emacs-devel@gnu.org; Mon, 29 Mar 2010 14:38:34 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.69) (envelope-from ) id 1NwJr7-0003nL-2x for emacs-devel@gnu.org; Mon, 29 Mar 2010 14:38:33 -0400 Original-Received: from lo.gmane.org ([80.91.229.12]:40233) by eggs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1NwJr6-0003nG-Ms for emacs-devel@gnu.org; Mon, 29 Mar 2010 14:38:33 -0400 Original-Received: from list by lo.gmane.org with local (Exim 4.69) (envelope-from ) id 1NwJr5-0007z0-QU for emacs-devel@gnu.org; Mon, 29 Mar 2010 20:38:31 +0200 Original-Received: from 38.98.147.130 ([38.98.147.130]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Mon, 29 Mar 2010 20:38:31 +0200 Original-Received: from tzz by 38.98.147.130 with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Mon, 29 Mar 2010 20:38:31 +0200 X-Injected-Via-Gmane: http://gmane.org/ Original-Lines: 30 Original-X-Complaints-To: usenet@dough.gmane.org X-Gmane-NNTP-Posting-Host: 38.98.147.130 X-Face: bd.DQ~'29fIs`T_%O%C\g%6jW)yi[zuz6; d4V0`@y-~$#3P_Ng{@m+e4o<4P'#(_GJQ%TT= D}[Ep*b!\e,fBZ'j_+#"Ps?s2!4H2-Y"sx" User-Agent: Gnus/5.110011 (No Gnus v0.11) Emacs/24.0.50 (gnu/linux) Cancel-Lock: sha1:K3ToegvPnVJXQTViok5MYILBRk0= X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 3) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:122866 Archived-At: On Fri, 26 Mar 2010 23:50:26 +0100 Lennart Borgman wrote: LB> Thanks, but what would the difference be to try confusable detection LB> instead of chars outside IDN? They are two different character sets. The IDN chars are fewer and have a specific purpose for IDN; confusable characters are any Unicode characters mapped to homoglyphs. LB> I believe marking confusable chars would also mean marking those LB> confusable chars that have been decided on in IDN. Yes, probably. But that's accidental. I still think the character classes [:idn:] (revised name from before) and [:confusable:] (or [:homoglyph:]) would make sense as a first step, then we can decide how to highlight them. LB> Perhaps they could also be handled in another way (for example helping LB> switching/rotating the confusable chars, all or one-by-one). Would LB> that be useful? I don't know what you mean by switching/rotating, sorry. LB> Still I think it could be a good idea to restrict identifier names to LB> chars within IDN. We're talking about generic text in Emacs, so I think IDN should be limited to places in Emacs where domain names show up (e.g. URLs). Ted