From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.bugs Subject: bug#20499: C-x 8 shorthands for curved quotes, Euro, etc. Date: Thu, 07 May 2015 17:44:25 +0300 Message-ID: <83a8xgqwfq.fsf@gnu.org> References: <1430701990-31993-1-git-send-email-eggert@cs.ucla.edu> <5547BD19.1010608@cs.ucla.edu> <55485D27.2010901@cs.ucla.edu> <554B19FC.70602@cs.ucla.edu> <878ud0k8qh.fsf_-_@violet.siamics.net> Reply-To: Eli Zaretskii NNTP-Posting-Host: plane.gmane.org X-Trace: ger.gmane.org 1431009924 26786 80.91.229.3 (7 May 2015 14:45:24 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Thu, 7 May 2015 14:45:24 +0000 (UTC) Cc: 20499@debbugs.gnu.org To: Ivan Shmakov Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Thu May 07 16:45:12 2015 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1YqN32-00035l-8P for geb-bug-gnu-emacs@m.gmane.org; Thu, 07 May 2015 16:45:12 +0200 Original-Received: from localhost ([::1]:51553 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YqN31-00083i-Qf for geb-bug-gnu-emacs@m.gmane.org; Thu, 07 May 2015 10:45:11 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:47021) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YqN2x-00081w-UZ for bug-gnu-emacs@gnu.org; Thu, 07 May 2015 10:45:08 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1YqN2u-00047M-JC for bug-gnu-emacs@gnu.org; Thu, 07 May 2015 10:45:07 -0400 Original-Received: from debbugs.gnu.org ([140.186.70.43]:56174) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YqN2u-00046f-Ey for bug-gnu-emacs@gnu.org; Thu, 07 May 2015 10:45:04 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.80) (envelope-from ) id 1YqN2t-00058h-Jw for bug-gnu-emacs@gnu.org; Thu, 07 May 2015 10:45:03 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Eli Zaretskii Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Thu, 07 May 2015 14:45:03 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 20499 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: patch Original-Received: via spool by 20499-submit@debbugs.gnu.org id=B20499.143100987419703 (code B ref 20499); Thu, 07 May 2015 14:45:03 +0000 Original-Received: (at 20499) by debbugs.gnu.org; 7 May 2015 14:44:34 +0000 Original-Received: from localhost ([127.0.0.1]:37916 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1YqN2P-00057i-7I for submit@debbugs.gnu.org; Thu, 07 May 2015 10:44:33 -0400 Original-Received: from mtaout20.012.net.il ([80.179.55.166]:50969) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1YqN2M-00057U-Fe for 20499@debbugs.gnu.org; Thu, 07 May 2015 10:44:31 -0400 Original-Received: from conversion-daemon.a-mtaout20.012.net.il by a-mtaout20.012.net.il (HyperSendmail v2007.08) id <0NNZ00C00I9AP800@a-mtaout20.012.net.il> for 20499@debbugs.gnu.org; Thu, 07 May 2015 17:44:15 +0300 (IDT) Original-Received: from HOME-C4E4A596F7 ([87.69.4.28]) by a-mtaout20.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0NNZ00CQMI9QD960@a-mtaout20.012.net.il>; Thu, 07 May 2015 17:44:15 +0300 (IDT) In-reply-to: <878ud0k8qh.fsf_-_@violet.siamics.net> X-012-Sender: halo1@inter.net.il X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 3.x X-Received-From: 140.186.70.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:102568 Archived-At: > From: Ivan Shmakov > Date: Thu, 07 May 2015 10:00:38 +0000 > > > Although I suppose it comes from a decomposition table, I don't know > > what the table was designed for, and it's not clear to me how it's > > relevant. > > I hope someone more knowledgeable could comment on this. I'm not sure I'm your man, or what needs to be commented on, but I will try nonetheless ;-) The 'decomposition property of a character (as every other property accessed by get-char-code-property) comes directly from Unicode database. In this case, you will see that some characters in UnicodeData.txt have this part non-empty: 1E99;LATIN SMALL LETTER Y WITH RING ABOVE;Ll;0;L;0079 030A;;;;N;;;;; ^^^^^^^^^ This gives the so-called "canonical decomposition" of the character; in this case, we are told that U+1E99's decomposition is a sequence of U+0079 (lower-case y) followed by U+030A (combining ring above). Some characters have "compatibility decompositions" instead, like this: 1E9A;LATIN SMALL LETTER A WITH RIGHT HALF RING;Ll;0;L; 0061 02BE;;;;N;;;;; ^^^^^^^^^^^^^^^^^^ which is useful for collation-driven sorting and for loose comparisons a-la string-collate-lessp. For more details about this, see http://unicode.org/reports/tr44/, the Unicode Technical Report that describes the Unicode Character Database.