From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.help Subject: Re: Chracters not unified with Unicode -- any example? Date: Tue, 10 Jun 2014 20:40:04 +0300 Message-ID: <838up41x57.fsf@gnu.org> References: NNTP-Posting-Host: plane.gmane.org X-Trace: ger.gmane.org 1402422355 15695 80.91.229.3 (10 Jun 2014 17:45:55 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Tue, 10 Jun 2014 17:45:55 +0000 (UTC) To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Tue Jun 10 19:45:48 2014 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1WuQ7H-0003ew-ET for geh-help-gnu-emacs@m.gmane.org; Tue, 10 Jun 2014 19:45:47 +0200 Original-Received: from localhost ([::1]:41542 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1WuQ7G-0007qP-T1 for geh-help-gnu-emacs@m.gmane.org; Tue, 10 Jun 2014 13:45:46 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:58944) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1WuQ6h-0007p8-Cj for help-gnu-emacs@gnu.org; Tue, 10 Jun 2014 13:45:22 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1WuQ6a-0006iy-Fv for help-gnu-emacs@gnu.org; Tue, 10 Jun 2014 13:45:11 -0400 Original-Received: from mtaout27.012.net.il ([80.179.55.183]:56301) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1WuQ6a-0006hv-8X for help-gnu-emacs@gnu.org; Tue, 10 Jun 2014 13:45:04 -0400 Original-Received: from conversion-daemon.mtaout27.012.net.il by mtaout27.012.net.il (HyperSendmail v2007.08) id <0N6Y00L00RCZLP00@mtaout27.012.net.il> for help-gnu-emacs@gnu.org; Tue, 10 Jun 2014 20:37:00 +0300 (IDT) Original-Received: from HOME-C4E4A596F7 ([87.69.4.28]) by mtaout27.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0N6Y00IUHRLNN140@mtaout27.012.net.il> for help-gnu-emacs@gnu.org; Tue, 10 Jun 2014 20:37:00 +0300 (IDT) In-reply-to: X-012-Sender: halo1@inter.net.il X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Received-From: 80.179.55.183 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:98125 Archived-At: > From: Mikhail Gusarov > Date: Tue, 10 Jun 2014 13:55:13 +0200 > > Are there defined Emacs characters in range 0x110000-0x3fff7f > (beyond Unicode codepoints, but before eight-bit raw bytes)? Yes, quite a few. > I looked at various map files in etc and hadn't found any non-Unicode > codepoints there. Documentation refers to this range as "characters that are > not unified with Unicode", but does not give any examples. You will find them in lisp/international/mule-conf.el. Look for any define-charset form which has a :unify-map property. The :code-offset property gives the beginning of the codepoint block for each of these charsets, which tells you where in the 0x110000-0x3fff7f range they are mapped. This is an obscure issue, which is of interest to a select few (maybe just one) of the Emacs hackers, that's why it is never described more than you found in the documentation.