From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: William Xu Newsgroups: gmane.emacs.help Subject: url-retrieve and utf-8 Date: Mon, 04 Feb 2008 17:28:34 +0900 Organization: the Church of Emacs Message-ID: NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: ger.gmane.org 1202113773 11015 80.91.229.12 (4 Feb 2008 08:29:33 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Mon, 4 Feb 2008 08:29:33 +0000 (UTC) To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Mon Feb 04 09:29:55 2008 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1JLwi8-0003Zv-Mp for geh-help-gnu-emacs@m.gmane.org; Mon, 04 Feb 2008 09:29:52 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1JLwhg-0000ux-JI for geh-help-gnu-emacs@m.gmane.org; Mon, 04 Feb 2008 03:29:24 -0500 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1JLwhL-0000sR-3r for help-gnu-emacs@gnu.org; Mon, 04 Feb 2008 03:29:03 -0500 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1JLwhG-0000ik-EG for help-gnu-emacs@gnu.org; Mon, 04 Feb 2008 03:29:02 -0500 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1JLwhG-0000iU-2n for help-gnu-emacs@gnu.org; Mon, 04 Feb 2008 03:28:58 -0500 Original-Received: from main.gmane.org ([80.91.229.2] helo=ciao.gmane.org) by monty-python.gnu.org with esmtps (TLS-1.0:RSA_AES_256_CBC_SHA1:32) (Exim 4.60) (envelope-from ) id 1JLwhG-0005i8-Ko for help-gnu-emacs@gnu.org; Mon, 04 Feb 2008 03:28:58 -0500 Original-Received: from list by ciao.gmane.org with local (Exim 4.43) id 1JLwhA-0004Mm-VV for help-gnu-emacs@gnu.org; Mon, 04 Feb 2008 08:28:52 +0000 Original-Received: from gw.community-engine.co.jp ([210.255.51.230]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Mon, 04 Feb 2008 08:28:52 +0000 Original-Received: from william.xwl by gw.community-engine.co.jp with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Mon, 04 Feb 2008 08:28:52 +0000 X-Injected-Via-Gmane: http://gmane.org/ Original-Lines: 15 Original-X-Complaints-To: usenet@ger.gmane.org X-Gmane-NNTP-Posting-Host: gw.community-engine.co.jp User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.0.50 (darwin) Cancel-Lock: sha1:85H1w7SIwdLikKKAAZw2zjvsOUQ= X-detected-kernel: by monty-python.gnu.org: Linux 2.6, seldom 2.4 (older, 4) X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:51192 Archived-At: How to correctly handle utf-8 encoded html pages fetched by url-retrieve? Or is there a way to specify a coding system for read/write in the buffer returned by url-retrieve? At present, I tried to call: (decode-coding-string (buffer-string) 'utf-8) But the result is only partially correct. For example, when there are a mix of ascii and japanese characters, it only returns the ascii part. -- William http://williamxu.net9.org