From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Xah Lee Newsgroups: gmane.emacs.help Subject: Re: where does emacs23 store the info that was UnicodeData.txt? Date: Wed, 12 Aug 2009 22:20:34 -0700 (PDT) Organization: http://groups.google.com Message-ID: References: <2dbf8b79-30c4-4abb-8506-e8527d829777@13g2000prl.googlegroups.com> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1250147070 27286 80.91.229.12 (13 Aug 2009 07:04:30 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Thu, 13 Aug 2009 07:04:30 +0000 (UTC) To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Thu Aug 13 09:04:23 2009 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1MbUMI-0007xb-Od for geh-help-gnu-emacs@m.gmane.org; Thu, 13 Aug 2009 09:04:23 +0200 Original-Received: from localhost ([127.0.0.1]:54066 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1MbUMG-0001X4-Ad for geh-help-gnu-emacs@m.gmane.org; Thu, 13 Aug 2009 03:04:20 -0400 Original-Path: news.stanford.edu!newsfeed.stanford.edu!postnews.google.com!p36g2000prn.googlegroups.com!not-for-mail Original-Newsgroups: gnu.emacs.help Original-Lines: 63 Original-NNTP-Posting-Host: 76.102.12.87 Original-X-Trace: posting.google.com 1250140835 6774 127.0.0.1 (13 Aug 2009 05:20:35 GMT) Original-X-Complaints-To: groups-abuse@google.com Original-NNTP-Posting-Date: Thu, 13 Aug 2009 05:20:35 +0000 (UTC) Complaints-To: groups-abuse@google.com Injection-Info: p36g2000prn.googlegroups.com; posting-host=76.102.12.87; posting-account=bRPKjQoAAACxZsR8_VPXCX27T2YcsyMA User-Agent: G2/1.0 X-HTTP-UserAgent: Mozilla/5.0 (Windows; U; Windows NT 6.0; en-US) AppleWebKit/530.5 (KHTML, like Gecko) Chrome/2.0.172.39 Safari/530.5, gzip(gfe), gzip(gfe) Original-Xref: news.stanford.edu gnu.emacs.help:171936 X-Mailman-Approved-At: Thu, 13 Aug 2009 03:03:13 -0400 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:67116 Archived-At: On Aug 12, 9:20 pm, "B. T. Raven" wrote: > Xah Lee wrote: > > in emacs 22, one can put in this unicode data file to emacs so that > > what-cursor-position with C-u can give full info about a char: > > > ; load unicode data; used by what-cursor-position for showing full > > unicode info > > (setq describe-char-unicodedata-file > > (concat (file-name-directory > > (or load-file-name buffer-file-name)) > > "UnicodeData.txt" )) > > > by looking at the output, it appears to me emacs 23 now has the full > > info on unicode char without needing that file. > > > Is that true? > > > where does emacs get the data that was in UnicodeData.txt? > > > thanks. > > > Xah > > =E2=88=91http://xahlee.org/ > > > =E2=98=84 > > The internal representation in ver. 23 is now Unicode, as far as I > understand. On ver 22, =E2=98=84 would be reported as some "Emacs" byte s= equence > but on ver 23 it will give the actual U+XXXX code point for that glyph > or character or whatever you call it. yes, but am wondering where does emacs get the unicode char property? e.g., do describe-char, and you get: --------------------- character: =E2=98=84 (9732, #o23004, #x2604) preferred charset: unicode-bmp (Unicode Basic Multilingual Plane (U +0000..U+FFFF)) code point: 0x2604 syntax: w which means: word category: .:Base buffer code: #xE2 #x98 #x84 file code: #xE2 #x98 #x84 (encoded by coding system utf-8-dos) display: no font available Character code properties: customize what to show name: COMET general-category: So (Symbol, Other) --------------------- Notice the last 3 lines. These info wasn't in emacs 22, unless you install the UnicodeData.txt file. But since in my emacs 23, i took out the UnicodeData.txt, it still know the char is named COMET, as well as info like =E2=80=9Cgeneral-category: So (Symbol, Other)=E2=80=9D. So, am wo= ndering where it got it? Xah =E2=88=91 http://xahlee.org/ =E2=98=84