From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Thamer Mahmoud Newsgroups: gmane.emacs.help Subject: Re: How to get title of web page by url? Date: Wed, 28 Jul 2010 21:14:17 +0300 Message-ID: <87k4of324m.fsf@zemblan.newkuwait.org> References: <87vd802nx4.fsf@zemblan.newkuwait.org> <87ocdr39i7.fsf@zemblan.newkuwait.org> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: dough.gmane.org 1280340918 21150 80.91.229.12 (28 Jul 2010 18:15:18 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Wed, 28 Jul 2010 18:15:18 +0000 (UTC) To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Wed Jul 28 20:15:16 2010 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1OeB9u-0001UT-2o for geh-help-gnu-emacs@m.gmane.org; Wed, 28 Jul 2010 20:15:14 +0200 Original-Received: from localhost ([127.0.0.1]:36707 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1OeB9t-0004eU-6m for geh-help-gnu-emacs@m.gmane.org; Wed, 28 Jul 2010 14:15:13 -0400 Original-Received: from [140.186.70.92] (port=34093 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1OeB9I-0004ce-9v for help-gnu-emacs@gnu.org; Wed, 28 Jul 2010 14:14:37 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.69) (envelope-from ) id 1OeB9H-0006no-57 for help-gnu-emacs@gnu.org; Wed, 28 Jul 2010 14:14:36 -0400 Original-Received: from lo.gmane.org ([80.91.229.12]:48134) by eggs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1OeB9G-0006nS-Tr for help-gnu-emacs@gnu.org; Wed, 28 Jul 2010 14:14:35 -0400 Original-Received: from list by lo.gmane.org with local (Exim 4.69) (envelope-from ) id 1OeB9E-00016O-2o for help-gnu-emacs@gnu.org; Wed, 28 Jul 2010 20:14:32 +0200 Original-Received: from 89.203.6.209 ([89.203.6.209]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Wed, 28 Jul 2010 20:14:32 +0200 Original-Received: from thamer.mahmoud by 89.203.6.209 with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Wed, 28 Jul 2010 20:14:32 +0200 X-Injected-Via-Gmane: http://gmane.org/ Original-Lines: 21 Original-X-Complaints-To: usenet@dough.gmane.org X-Gmane-NNTP-Posting-Host: 89.203.6.209 User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.2 (gnu/linux) Cancel-Lock: sha1:vxtyo5OZZfIEVEnj/QISiazUtBU= X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 3) X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:74328 Archived-At: > (defun www-get-page-title (url) > (let ((title)) > (with-current-buffer (url-retrieve-synchronously url) > (goto-char (point-min)) > (re-search-forward "\\([^<]*\\)" nil t 1) > (setq title (match-string 1)) > (goto-char (point-min)) > (re-search-forward "charset=\\([-0-9a-zA-Z]*\\)" nil t 1) > (decode-coding-string title (intern (match-string 1)))))) Just did a test on a wikipedia page, and looks like `decode-coding-string' doesn't handle upper-case charsets, like UTF-8, only utf-8. So the last line should be: (decode-coding-string title (intern (downcase (match-string 1))))))) -- Thamer