From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Andreas =?utf-8?q?R=C3=B6hler?= Newsgroups: gmane.emacs.help Subject: Re: url-retrieve and utf-8 Date: Tue, 5 Feb 2008 11:50:17 +0100 Message-ID: <200802051150.18822.andreas.roehler@online.de> References: <200802041702.27763.andreas.roehler@online.de> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1202208676 25331 80.91.229.12 (5 Feb 2008 10:51:16 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Tue, 5 Feb 2008 10:51:16 +0000 (UTC) To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Tue Feb 05 11:51:37 2008 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1JMLOe-0003cz-74 for geh-help-gnu-emacs@m.gmane.org; Tue, 05 Feb 2008 11:51:24 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1JMLOC-0001Vx-2Z for geh-help-gnu-emacs@m.gmane.org; Tue, 05 Feb 2008 05:50:56 -0500 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1JMLLT-0007ty-OM for help-gnu-emacs@gnu.org; Tue, 05 Feb 2008 05:48:07 -0500 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1JMLLT-0007sp-5i for help-gnu-emacs@gnu.org; Tue, 05 Feb 2008 05:48:07 -0500 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1JMLLS-0007sg-OI for help-gnu-emacs@gnu.org; Tue, 05 Feb 2008 05:48:06 -0500 Original-Received: from moutng.kundenserver.de ([212.227.126.174]) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1JMLLS-00075L-8U for help-gnu-emacs@gnu.org; Tue, 05 Feb 2008 05:48:06 -0500 Original-Received: from noname (p54BE8C5D.dip0.t-ipconnect.de [84.190.140.93]) by mrelayeu.kundenserver.de (node=mrelayeu6) with ESMTP (Nemesis) id 0ML29c-1JMLLO3JKD-00072q; Tue, 05 Feb 2008 11:48:02 +0100 User-Agent: KMail/1.9.5 In-Reply-To: Content-Disposition: inline X-Provags-ID: V01U2FsdGVkX1+l/34Z0Jm5Iz787dkLG1rqsnPXJhLHqFiAfgW pK8oxt7Fk3DmbwNHa7yWdZiV5iZ3Yw0dTDubm3JehFAvZU+SKN Co59o1L08iEyksBVj4EFA== X-detected-kernel: by monty-python.gnu.org: Linux 2.6? (barebone, rare!) X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:51220 Archived-At: Am Dienstag, 5. Februar 2008 04:55 schrieb William Xu: > Andreas R=C3=B6hler writes: > > =3D=3D=3D=3D=3D=3D> Space, tab, newline and formfeed <=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D > > > > are classified as > > whitespace in almost all major modes. > > If [:space:] is for the above purpose, how come it eats some non-ascii > chacracters(here, it is japanese) ? As it depends from definition of whitespace-syntax, that may happen, however indicates some error in that definition. To exclude errors here, you could use (skip-chars-backward " ") i.e. not relying at the class-matter at all AFAIU. > > Maybe due to the buffer not correctly encoded? Back to my original > question, does url-retrieve respect "Context-type, charset", headers in > a html page? Good question. Unfortunately I ignore these things. From this point of ignorance I would expect, `url-retrieve' delivering the stuff as-is, which must be parsed afterwards. Right or wrong? Should I discover more, I'll let you know. Andreas R=C3=B6hler