From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: horatio@gmail.com Newsgroups: gmane.emacs.help Subject: Re: UTF-8 character question Date: Mon, 12 May 2008 01:07:26 -0700 (PDT) Organization: http://groups.google.com Message-ID: <7574be57-a5b5-4c3a-9ad7-eecfaf4c67c2@i36g2000prf.googlegroups.com> References: <6183b9c0-e6c8-41f3-a44f-52fca5d170d9@u12g2000prd.googlegroups.com> <2c3e51c1-12aa-49a4-b316-32955545650f@l28g2000prd.googlegroups.com> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=GB2312 Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1210588142 24462 80.91.229.12 (12 May 2008 10:29:02 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Mon, 12 May 2008 10:29:02 +0000 (UTC) To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Mon May 12 12:29:39 2008 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1JvVHk-0004vs-Ry for geh-help-gnu-emacs@m.gmane.org; Mon, 12 May 2008 12:29:37 +0200 Original-Received: from localhost ([127.0.0.1]:50117 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1JvVH2-0006nt-2P for geh-help-gnu-emacs@m.gmane.org; Mon, 12 May 2008 06:28:52 -0400 Original-Path: news.stanford.edu!newsfeed.stanford.edu!postnews.google.com!i36g2000prf.googlegroups.com!not-for-mail Original-Newsgroups: gnu.emacs.help Original-Lines: 52 Original-NNTP-Posting-Host: 24.6.171.121 Original-X-Trace: posting.google.com 1210579646 27514 127.0.0.1 (12 May 2008 08:07:26 GMT) Original-X-Complaints-To: groups-abuse@google.com Original-NNTP-Posting-Date: Mon, 12 May 2008 08:07:26 +0000 (UTC) Complaints-To: groups-abuse@google.com Injection-Info: i36g2000prf.googlegroups.com; posting-host=24.6.171.121; posting-account=jNSocwoAAABrA9g2AGnMTxGj8RSLX4fO User-Agent: G2/1.0 X-HTTP-UserAgent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.8.1.14) Gecko/20080404 Firefox/2.0.0.14,gzip(gfe),gzip(gfe) Original-Xref: news.stanford.edu gnu.emacs.help:158553 X-Mailman-Approved-At: Mon, 12 May 2008 06:20:04 -0400 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:53923 Archived-At: On May 12, 12:51 am, hora...@gmail.com wrote: > On May 12, 12:14 am, Harald Hanche-Olsen wrote: > > > > > + hora...@gmail.com: > > > > I downloaded Emacs 22.2.1 for Windows, and I was pleased to find that > > > Chinese characters work "out of the box" on my computer. However, I > > > have a weird visualization problem for some characters. One example > > > is =C4=E3=C4=E3. These two characters appear the same in Firefox, in = Notepad, > > > in the file system (ie Explorer), and in various other places. > > > However, in Emacs, the character on the left appears as an empty > > > square, but the character on the right shows up as the Chinese > > > character for "you". > > > I am confused. They not only /look/ the same, they /are/ the same > > character (U+4F60). Maybe your news posting software knows what emacs > > doesn't, and has changed one of those so they are equal? > > > I'm afraid you will have to describe the difference between the two > > characters somehow. > > I used Firefox to post, and yes, it replaced one of the characters for > me. I don't know how to figure out what the encoding is for the > character Emacs is correctly displaying, but the character U+4F60 does > not display correctly in my version of Emacs. Instead, it shows up as > the empty square. There's another version of the same character that > does show up correctly in Emacs, but unfortunately it's not the one > used elsewhere. Fascinating. I just found something else out. When I save the file, and then reload it, the character that was successfully displayed earlier is now displayed as an empty box. Maybe there is only one =C4=E3 character, and sometimes Emacs can show it, and sometimes it can't. Furthermore, when I use Options->Mule->List Character Sets, some of the supported character sets are entirely empty boxes. The strange thing about that is there are definitely some characters that it shows fine, with none of these issues. It's pretty strange that for some characters, it can show the Chinese characters, and for others it can't. My guess is there's some basic option or package that I'm missing that will make the problem go away. Can you (or anyone else) copy and paste that character into an Emacs buffer? If it works, can you think of anything in your setup that I might not have done? I'll take a look myself in the meantime. Thanks for the help.