From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Andreas =?iso-8859-1?q?R=F6hler?= Newsgroups: gmane.emacs.help Subject: Re: url-retrieve and utf-8 Date: Mon, 4 Feb 2008 17:02:27 +0100 Message-ID: <200802041702.27763.andreas.roehler@online.de> References: NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1202140877 6693 80.91.229.12 (4 Feb 2008 16:01:17 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Mon, 4 Feb 2008 16:01:17 +0000 (UTC) Cc: William Xu To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Mon Feb 04 17:01:39 2008 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1JM3kv-0006B2-Ur for geh-help-gnu-emacs@m.gmane.org; Mon, 04 Feb 2008 17:01:14 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1JM3kT-0004N7-Vf for geh-help-gnu-emacs@m.gmane.org; Mon, 04 Feb 2008 11:00:46 -0500 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1JM3k8-0004MP-TS for help-gnu-emacs@gnu.org; Mon, 04 Feb 2008 11:00:24 -0500 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1JM3k6-0004La-Da for help-gnu-emacs@gnu.org; Mon, 04 Feb 2008 11:00:23 -0500 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1JM3k6-0004LV-98 for help-gnu-emacs@gnu.org; Mon, 04 Feb 2008 11:00:22 -0500 Original-Received: from moutng.kundenserver.de ([212.227.126.179]) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1JM3k5-0006zV-KF for help-gnu-emacs@gnu.org; Mon, 04 Feb 2008 11:00:22 -0500 Original-Received: from noname (p54BE8F33.dip0.t-ipconnect.de [84.190.143.51]) by mrelayeu.kundenserver.de (node=mrelayeu1) with ESMTP (Nemesis) id 0MKwpI-1JM3k30iKj-0007uw; Mon, 04 Feb 2008 17:00:19 +0100 User-Agent: KMail/1.9.5 In-Reply-To: Content-Disposition: inline X-Provags-ID: V01U2FsdGVkX19/5T149xa/mrJNBNVw8OO8/ok2ZJdKGuKaz9O MepAHzEC3b0Nt7tHjXQ9MARoBqt7DD5z6QJRD/iqgnvXX6yQua Q7YOjStS8ThQnSxUzK3vw== X-detected-kernel: by monty-python.gnu.org: Linux 2.6? (barebone, rare!) X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:51196 Archived-At: Am Montag, 4. Februar 2008 13:43 schrieb William Xu: > William Xu writes: > > At present, I tried to call: > > > > (decode-coding-string (buffer-string) 'utf-8) > > > > But the result is only partially correct. For example, when there are a > > mix of ascii and japanese characters, it only returns the ascii part. > > For this, it is because I have called (skip-chars-backward > "[[:space:]]") before decode-coding-string, and apprarently > skip-chars-backward seems mistook some non-ascii characters as > whitespaces. AFAIS that's not a mistake, that's implemented See elisp info node 34.3.1.2 Character Classes `[:space:]' This matches any character that has whitespace syntax (*note =2E... Here is a table of syntax classes, the characters that stand for them, their meanings, and examples of their use. -- Syntax class: whitespace character "Whitespace characters" (designated by ` ' or `-') separate symbols and words from each other. Typically, whitespace characters have no other syntactic significance, and multiple whitespace characters are syntactically equivalent to a single one. =20 =3D=3D=3D=3D=3D=3D> Space, tab, newline and formfeed <=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D are classified as whitespace in almost all major modes. ;;;;;;; [:blank:] should DTRT. Andreas R=F6hler