From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Ted Zlatanov Newsgroups: gmane.emacs.devel Subject: Re: idn.el and confusables.txt Date: Mon, 16 May 2011 13:31:44 -0500 Organization: =?utf-8?B?0KLQtdC+0LTQvtGAINCX0LvQsNGC0LDQvdC+0LI=?= @ Cienfuegos Message-ID: <87wrhq5wfj.fsf@lifelogs.com> References: <874o5rqr5z.fsf@lifelogs.com> <87mxjjpal4.fsf@lifelogs.com> <87vcy6nzan.fsf@lifelogs.com> <87tydl4sjj.fsf_-_@lifelogs.com> <87r58pghh7.fsf_-_@lifelogs.com> <83iptdg0yr.fsf@gnu.org> <87y629ien3.fsf@lifelogs.com> <83aaepfiuk.fsf@gnu.org> <87aaepi9k2.fsf@lifelogs.com> <834o4xfd34.fsf@gnu.org> <8739khi54z.fsf@lifelogs.com> <83y629dmmt.fsf@gnu.org> <8739kg7o63.fsf@lifelogs.com> <87hb8w5few.fsf@lifelogs.com> <83ei3yhlbl.fsf@gnu.org> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain X-Trace: dough.gmane.org 1305570735 24127 80.91.229.12 (16 May 2011 18:32:15 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Mon, 16 May 2011 18:32:15 +0000 (UTC) To: emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Mon May 16 20:32:04 2011 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([140.186.70.17]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1QM2aG-0001VQ-Nx for ged-emacs-devel@m.gmane.org; Mon, 16 May 2011 20:32:00 +0200 Original-Received: from localhost ([::1]:37274 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1QM2aG-0006Bf-3J for ged-emacs-devel@m.gmane.org; Mon, 16 May 2011 14:32:00 -0400 Original-Received: from eggs.gnu.org ([140.186.70.92]:37448) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1QM2aD-0006BX-MO for emacs-devel@gnu.org; Mon, 16 May 2011 14:31:58 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1QM2aC-0004nA-DA for emacs-devel@gnu.org; Mon, 16 May 2011 14:31:57 -0400 Original-Received: from lo.gmane.org ([80.91.229.12]:35881) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1QM2aB-0004n6-VT for emacs-devel@gnu.org; Mon, 16 May 2011 14:31:56 -0400 Original-Received: from list by lo.gmane.org with local (Exim 4.69) (envelope-from ) id 1QM2aB-0001Tj-8e for emacs-devel@gnu.org; Mon, 16 May 2011 20:31:55 +0200 Original-Received: from 38.98.147.130 ([38.98.147.130]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Mon, 16 May 2011 20:31:55 +0200 Original-Received: from tzz by 38.98.147.130 with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Mon, 16 May 2011 20:31:55 +0200 X-Injected-Via-Gmane: http://gmane.org/ Original-Lines: 46 Original-X-Complaints-To: usenet@dough.gmane.org X-Gmane-NNTP-Posting-Host: 38.98.147.130 X-Face: bd.DQ~'29fIs`T_%O%C\g%6jW)yi[zuz6; d4V0`@y-~$#3P_Ng{@m+e4o<4P'#(_GJQ%TT= D}[Ep*b!\e,fBZ'j_+#"Ps?s2!4H2-Y"sx" User-Agent: Gnus/5.110018 (No Gnus v0.18) Emacs/24.0.50 (gnu/linux) Cancel-Lock: sha1:tiu/VMW48jmCiJ8+tUrJtfuFVWE= X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 3) X-Received-From: 80.91.229.12 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:139433 Archived-At: On Mon, 16 May 2011 15:38:38 +0300 Eli Zaretskii wrote: >> From: Ted Zlatanov >> Date: Sun, 15 May 2011 07:14:47 -0500 >> EZ> If we were to implement the full IDNA protocol, would the above be EZ> enough? Or will we need additional information? >> >> Oh, all this has been for confusables (TR39) only. IDNA and uni-idn.el >> will have their own needs! EZ> Granted, but I was asking whether IDNA implementation will need EZ> anything from confusables.txt beyond what we discussed. That is, will EZ> a mapping of a character to a string be all IDNA needs to use the EZ> information in confusables.txt? I can't think of any other uses, so yes. Lennart may want to comment. >> IDNA is better described in http://unicode.org/reports/tr46/ and the >> links at the end of that document (a whole bunch of RFCs). I'm not >> interested in implementing the IDNA code beyond supporting the current >> character set detection because I don't think IDNA is popular enough, >> but maybe Lennart and others want to do it. >> >> For further possible markchars.el functionality, take a look at >> http://www.unicode.org/reports/tr36/ (Unicode Security Considerations). >> It talks about the confusables issues, IDNA issues, and bidi issues >> among others. It's a really good explanation of what security-related >> functionality is needed from the confusables char-table and potentially >> other places in Emacs. EZ> Yes, I'm familiar with these specs, but I didn't try to design an EZ> implementation of IDNA, so the answers to the above questions are not EZ> clear to me. If someone can explain how would an IDNA implementation EZ> use confusables.txt, we could make sure that the tables we produce EZ> from it will be good for IDNA as well, if and when someone will want EZ> to implement it. I may be misunderstanding, but I think you're sort of implying that IDNA (in domain names and URLs) is the main use for confusables.txt. I think file names, shell interactions, and general text (especially source code) are also important uses of that data because confusables in those settings can introduce security issues. Sorry if this is clear to you and everyone else. Ted