From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Micah Cowan Newsgroups: gmane.emacs.help Subject: Re: chinese encoded in UTF-8 and XML Date: 25 Sep 2003 19:52:32 -0700 Organization: CalWeb Internet Services, Inc. Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Message-ID: References: NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=big5 Content-Transfer-Encoding: 8bit X-Trace: sea.gmane.org 1064544474 21432 80.91.224.253 (26 Sep 2003 02:47:54 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Fri, 26 Sep 2003 02:47:54 +0000 (UTC) Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Fri Sep 26 04:47:53 2003 Return-path: Original-Received: from monty-python.gnu.org ([199.232.76.173]) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1A2ie8-0006Hd-00 for ; Fri, 26 Sep 2003 04:47:53 +0200 Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org) by monty-python.gnu.org with esmtp (Exim 4.22) id 1A2icL-0003DO-DV for geh-help-gnu-emacs@m.gmane.org; Thu, 25 Sep 2003 22:46:01 -0400 Original-Path: shelby.stanford.edu!newsfeed.stanford.edu!news-spur1.maxwell.syr.edu!news.maxwell.syr.edu!newshosting.com!news-xfer1.atl.newshosting.com!posted1.newshosting.com!not-for-mail Original-Newsgroups: gnu.emacs.help,comp.text.xml User-Agent: Gnus/5.09 (Gnus v5.9.0) Emacs/21.1 Original-Lines: 30 Original-NNTP-Posting-Host: 22a38379.news.newshosting.com Original-X-Trace: DXC=N7; 0:YZo83nah]44k8PLKk7gYX5fdemc``>B^[YMIhmfGhdERAfY; >gmHV`@[463WoB; AU List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: main.gmane.org gmane.emacs.help:12811 X-Report-Spam: http://spam.gmane.org/gmane.emacs.help:12811 Knackeback writes: > Hi, I wrote a XML file with GNU emacs 21.2.2 and with > chinese character content encoded in UTF-8. > I wrote something like: > > > > ¼» > ÄÎ > > > and then I used "C-x RET f" and then I choosed utf-8. > Then I typed "C-x C-s" to save my file. > I hope this is the right way in emacs to store the content > as UTF-8 encoded text ?! > Now I tried to parse the file with xmllint. xmllint is a > small xml-parser program which comes with libxml2. > The parser complains that the second "chinese line" is not proper > UTF-8. > > ==> FWICT, Emacs doesn't have a chinese input method which supports unicode output... :-( ...I've had similar troubles with Japanese. I've also noted that, e.g. for greek, there are input methods which explicitly support unicode, and others which do not. -Micah