From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: "Drew Adams" Newsgroups: gmane.emacs.devel,gmane.emacs.pretest.bugs Subject: RE: [sdl.web@gmail.com: 23.0.0; (thing-at-point 'url) returns invalid urls] Date: Sat, 1 Sep 2007 09:39:02 -0700 Message-ID: References: NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="US-ASCII" Content-Transfer-Encoding: 7bit X-Trace: sea.gmane.org 1188664836 28275 80.91.229.12 (1 Sep 2007 16:40:36 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Sat, 1 Sep 2007 16:40:36 +0000 (UTC) To: "Emacs-Pretest-Bug" Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Sat Sep 01 18:40:32 2007 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1IRW1O-0005p1-KT for ged-emacs-devel@m.gmane.org; Sat, 01 Sep 2007 18:40:30 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1IRW1N-0000ae-K3 for ged-emacs-devel@m.gmane.org; Sat, 01 Sep 2007 12:40:29 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1IRW1J-0000Vl-ON for emacs-devel@gnu.org; Sat, 01 Sep 2007 12:40:25 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1IRW1I-0000Tx-M4 for emacs-devel@gnu.org; Sat, 01 Sep 2007 12:40:24 -0400 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1IRW1I-0000Ta-60 for emacs-devel@gnu.org; Sat, 01 Sep 2007 12:40:24 -0400 Original-Received: from fencepost.gnu.org ([140.186.70.10]) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1IRW1H-0005jo-Ha for emacs-devel@gnu.org; Sat, 01 Sep 2007 12:40:23 -0400 Original-Received: from monty-python.gnu.org ([199.232.76.173]) by fencepost.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1IRW15-0004f9-RZ for emacs-pretest-bug@gnu.org; Sat, 01 Sep 2007 12:40:11 -0400 Original-Received: from Debian-exim by monty-python.gnu.org with spam-scanned (Exim 4.60) (envelope-from ) id 1IRW1E-0005iI-R0 for emacs-pretest-bug@gnu.org; Sat, 01 Sep 2007 12:40:23 -0400 Original-Received: from rgminet01.oracle.com ([148.87.113.118]) by monty-python.gnu.org with esmtps (TLS-1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.60) (envelope-from ) id 1IRW1E-0005hu-Fr for emacs-pretest-bug@gnu.org; Sat, 01 Sep 2007 12:40:20 -0400 Original-Received: from agmgw2.us.oracle.com (agmgw2.us.oracle.com [152.68.180.213]) by rgminet01.oracle.com (Switch-3.2.4/Switch-3.1.6) with ESMTP id l81GeHAR021531 for ; Sat, 1 Sep 2007 10:40:17 -0600 Original-Received: from acsmt350.oracle.com (acsmt350.oracle.com [141.146.40.150]) by agmgw2.us.oracle.com (Switch-3.2.0/Switch-3.2.0) with ESMTP id l81GcFdY021428 for ; Sat, 1 Sep 2007 10:40:17 -0600 Original-Received: from dhcp-amer-whq-csvpn-gw3-141-144-80-148.vpn.oracle.com by acsmt351.oracle.com with ESMTP id 3177295221188664745; Sat, 01 Sep 2007 09:39:05 -0700 X-Priority: 3 (Normal) X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook IMO, Build 9.0.6604 (9.0.2911.0) Importance: Normal In-Reply-To: X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.3138 X-Brightmail-Tracker: AAAAAQAAAAI= X-Brightmail-Tracker: AAAAAQAAAAI= X-Whitelist: TRUE X-Whitelist: TRUE X-Detected-Kernel: Linux 2.4-2.6 X-Detected-Kernel: Linux 2.6, seldom 2.4 (older, 4) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:77547 gmane.emacs.pretest.bugs:19683 Archived-At: > Stefan gets my idea right. It is definitely not about aliveness. Fine. So what is the heuristic to use to recognize something that "is likely to be meant as a URL"? Presence of a URL scheme (e.g. http://, ftp://)? Presence of a URL scheme or "www." (e.g. www.whatever.anything)? There are already regexps defined to recognize URLs, with and without schemes. How should they be used or modified? And what to return when probably-intended-URL recognition fails? nil? Whatever is currently at point, without prepending http://? Should http:// ever be prepended (e.g. if "www." satisfies the test for likely URL, as in www.google.com)? Let's stop being so vague and go beyond saying things like (1) just DTRT and (2) we'll have a heuristic that recognizes TRT. Which value do you want returned for which text at point? And what heuristic do you propose to use to recognize a likely URL intention? The only difficult problem seen so far is knowing what is being requested. It's not an alligator. It's bigger than a breadbox. It doesn't contain chlorophyll. OK, so what is it?