From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Stefan Monnier Newsgroups: gmane.emacs.help Subject: Re: Chracters not unified with Unicode -- any example? Date: Tue, 10 Jun 2014 15:20:06 -0400 Message-ID: References: <838up41x57.fsf@gnu.org> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain X-Trace: ger.gmane.org 1402428063 24009 80.91.229.3 (10 Jun 2014 19:21:03 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Tue, 10 Jun 2014 19:21:03 +0000 (UTC) To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Tue Jun 10 21:20:55 2014 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1WuRbL-0006x8-98 for geh-help-gnu-emacs@m.gmane.org; Tue, 10 Jun 2014 21:20:55 +0200 Original-Received: from localhost ([::1]:41896 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1WuRbK-0003yr-PP for geh-help-gnu-emacs@m.gmane.org; Tue, 10 Jun 2014 15:20:54 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:54954) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1WuRau-0003yl-Er for help-gnu-emacs@gnu.org; Tue, 10 Jun 2014 15:20:36 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1WuRam-0007sE-SA for help-gnu-emacs@gnu.org; Tue, 10 Jun 2014 15:20:28 -0400 Original-Received: from plane.gmane.org ([80.91.229.3]:38201) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1WuRam-0007s4-LL for help-gnu-emacs@gnu.org; Tue, 10 Jun 2014 15:20:20 -0400 Original-Received: from list by plane.gmane.org with local (Exim 4.69) (envelope-from ) id 1WuRak-0006Sb-Tk for help-gnu-emacs@gnu.org; Tue, 10 Jun 2014 21:20:18 +0200 Original-Received: from 75.119.224.253 ([75.119.224.253]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Tue, 10 Jun 2014 21:20:18 +0200 Original-Received: from monnier by 75.119.224.253 with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Tue, 10 Jun 2014 21:20:18 +0200 X-Injected-Via-Gmane: http://gmane.org/ Original-Lines: 18 Original-X-Complaints-To: usenet@ger.gmane.org X-Gmane-NNTP-Posting-Host: 75.119.224.253 User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.4.50 (gnu/linux) Cancel-Lock: sha1:ZX5g6Eue1Kvt1s7WHWN+38UE/mY= X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-Received-From: 80.91.229.3 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:98127 Archived-At: > You will find them in lisp/international/mule-conf.el. Look for any > define-charset form which has a :unify-map property. The :code-offset > property gives the beginning of the codepoint block for each of these > charsets, which tells you where in the 0x110000-0x3fff7f range they > are mapped. > This is an obscure issue, which is of interest to a select few (maybe > just one) of the Emacs hackers, that's why it is never described more > than you found in the documentation. Indeed, if you need to know the details, ask Kenichi Handa. IIUC Emacs uses some parts of this area to map some (parts of) asian charsets such as GBnnnn which contain some chars which aren't (yet) in Unicode. Stefan