From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Alan Third Newsgroups: gmane.emacs.bugs Subject: bug#23934: 25.1.50; Incorrect rendering of cyrillic text on OS X Date: Sun, 10 Jul 2016 18:09:02 +0100 Message-ID: <20160710170902.GA51571@breton.holly.idiocy.org> References: <83zippeaxz.fsf@gnu.org> <20160710162802.GB51429@breton.holly.idiocy.org> <83twfxe8pi.fsf@gnu.org> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Trace: ger.gmane.org 1468171113 6374 80.91.229.3 (10 Jul 2016 17:18:33 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Sun, 10 Jul 2016 17:18:33 +0000 (UTC) Cc: 23934@debbugs.gnu.org To: Eli Zaretskii Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Sun Jul 10 19:18:17 2016 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1bMIMy-0000U6-V8 for geb-bug-gnu-emacs@m.gmane.org; Sun, 10 Jul 2016 19:18:17 +0200 Original-Received: from localhost ([::1]:56043 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1bMIMy-0008SH-48 for geb-bug-gnu-emacs@m.gmane.org; Sun, 10 Jul 2016 13:18:16 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:39625) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1bMIMp-0008RG-8l for bug-gnu-emacs@gnu.org; Sun, 10 Jul 2016 13:18:08 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1bMIMk-0006qn-VG for bug-gnu-emacs@gnu.org; Sun, 10 Jul 2016 13:18:06 -0400 Original-Received: from debbugs.gnu.org ([208.118.235.43]:33152) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1bMIMk-0006qi-QT for bug-gnu-emacs@gnu.org; Sun, 10 Jul 2016 13:18:02 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1bMIMk-0000fs-Md for bug-gnu-emacs@gnu.org; Sun, 10 Jul 2016 13:18:02 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Alan Third Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Sun, 10 Jul 2016 17:18:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 23934 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: Original-Received: via spool by 23934-submit@debbugs.gnu.org id=B23934.14681710312517 (code B ref 23934); Sun, 10 Jul 2016 17:18:02 +0000 Original-Received: (at 23934) by debbugs.gnu.org; 10 Jul 2016 17:17:11 +0000 Original-Received: from localhost ([127.0.0.1]:45485 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1bMILv-0000eX-GK for submit@debbugs.gnu.org; Sun, 10 Jul 2016 13:17:11 -0400 Original-Received: from mail-wm0-f45.google.com ([74.125.82.45]:38717) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1bMILu-0000eH-DF for 23934@debbugs.gnu.org; Sun, 10 Jul 2016 13:17:10 -0400 Original-Received: by mail-wm0-f45.google.com with SMTP id o80so23527091wme.1 for <23934@debbugs.gnu.org>; Sun, 10 Jul 2016 10:17:10 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlemail.com; s=20120113; h=sender:date:from:to:cc:subject:message-id:references:mime-version :content-disposition:content-transfer-encoding:in-reply-to :user-agent; bh=6TaqYlRNcbztuy991iQ6rEh5VQ7nt0g83+xN1uRutnA=; b=hBHZ3ZmhhEWqc7BiRku4S20WtIBvVHLcFLRcNalBdWnReC0rz0TKHvUXcCLQUQI9CC +tBeCPVFOazxKRY08MYOOwFWPob0ODhqTueq3Yw6HX0xIz2fIDnEA2RJ5XisdXZ/8lsa AfFV3TAyOzlqQFZYAtLedQnmZOJhyIsqq9mjqYmKpbFjc/5al6cSgliH5HHFJ8IrABKT i6uNj29ti99Da65QWY22YrnDeHU9u1dmMX6F4p5ozW6u09Vp5GNg9qSUTBQxQYwyQ7vL tmM1CiscGu/FZNq0eEOnnk96AVUPQGHbBfMn1BtXN4wGF1jD7VFYlvf/L0y7Sc8hsVez tx1g== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:sender:date:from:to:cc:subject:message-id :references:mime-version:content-disposition :content-transfer-encoding:in-reply-to:user-agent; bh=6TaqYlRNcbztuy991iQ6rEh5VQ7nt0g83+xN1uRutnA=; b=YOK0mBAfo+BqXwZfiqNzGRpI1/r9OCgDhNp+fQf5X8gCuGTvdzlCho20rge0DYAFlG nBWrygN9FRUutRbGMcMJ301vbJ+ToDJJsNPtA1/PTaEQJ7W8m/hISX6bEvjRqwPOTzk2 KherCxdKE0lBu57qw2ezpilyyeL8gTayelvO6Lb4Fmyc1XGwWagwt03yz5TAdXgg4rj4 hui7B64+WWpVuSPlgRN56CtcMQcDoQuDPqqfaeZyiidPWcY7hQr0SMjJO2MjwLtJ9PBM /bkDTPTf/+R3BilsuFFhuDSFmRJXh9IPw1hQYmvY0irjUbFgwd2abW6w6w6Hfd3PVcPy 7+LA== X-Gm-Message-State: ALyK8tJbg8qMr/RFP9ltRtWKnaaOisogbSzjmcvz9C1g+rhkJ8mfYYoDzLalPHywwrADFg== X-Received: by 10.28.13.1 with SMTP id 1mr8262422wmn.60.1468170545558; Sun, 10 Jul 2016 10:09:05 -0700 (PDT) Original-Received: from breton.holly.idiocy.org (ip6-2001-08b0-03f8-8129-5cc4-ba14-a0d0-bdac.holly.idiocy.org. [2001:8b0:3f8:8129:5cc4:ba14:a0d0:bdac]) by smtp.gmail.com with ESMTPSA id o142sm14412274wme.20.2016.07.10.10.09.04 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Sun, 10 Jul 2016 10:09:05 -0700 (PDT) Content-Disposition: inline In-Reply-To: <83twfxe8pi.fsf@gnu.org> User-Agent: Mutt/1.5.24 (2015-08-30) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 208.118.235.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.org gmane.emacs.bugs:120789 Archived-At: On Sun, Jul 10, 2016 at 07:52:57PM +0300, Eli Zaretskii wrote: > > Date: Sun, 10 Jul 2016 17:28:02 +0100 > > From: Alan Third > > Cc: 23934@debbugs.gnu.org > > > > Yes, Emacs seems to think that the composed character takes up no > > space. This may be no help whatsoever, but I've attached another image > > showing a cyrillic a with acute alongside the latin version. The latin > > one is not, afaik, composed, because I don't know how to do that. > > The Latin character is composed, because Emacs knows how to do that. > You can verify with "C-u C-x =". > > Can you tell what does window-text-pixel-size return for the а́ part of > the text, both with Menlo and with Monaco? Using (window-text-pixel-size nil 3 5): Menlo: (0 . 14) Monaco: (7 . 16) > > I don't know how to check whether the font supports it. > > Here's what describe-char says: > > > > position: 148 of 234 (63%), column: 2 > > character: а (displayed as а) (codepoint 1072, #o2060, #x430) > > preferred charset: cyrillic-iso8859-5 (Right-Hand Part of ISO/IEC 8859/5 (Latin/Cyrillic): ISO-IR-144) > > code point in charset: 0x50 > > script: cyrillic > > syntax: w which means: word > > category: .:Base, L:Left-to-right (strong), Y:2-byte Cyrillic, c:Chinese, h:Korean, j:Japanese, y:Cyrillic > > to input: type "C-x 8 RET 430" or "C-x 8 RET CYRILLIC SMALL LETTER A" > > buffer code: #xD0 #xB0 > > file code: #xD0 #xB0 (encoded by coding system utf-8-unix) > > display: composed to form "а́" (see below) > > > > Composed with the following character(s) "́" using this font: > > mac-ct:-*-Menlo-normal-normal-normal-*-12-*-*-*-m-0-iso10646-1 > ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ > The above line says that Menlo does have the glyph for the accent. > Emacs will not compose characters from different fonts. > > > by these glyphs: > > [0 1 1072 870 7 0 7 7 1 nil] > > [0 1 769 646 7 2 6 10 -7 [-8 -1 -8]] > > Can you show the corresponding data when you use the Monaco font? position: 3 of 7 (29%), column: 2 character: а (displayed as а) (codepoint 1072, #o2060, #x430) preferred charset: unicode (Unicode (ISO10646)) code point in charset: 0x0430 script: cyrillic syntax: w which means: word category: .:Base, L:Left-to-right (strong), Y:2-byte Cyrillic, c:Chinese, h:Korean, j:Japanese, y:Cyrillic to input: type "C-x 8 RET 430" or "C-x 8 RET CYRILLIC SMALL LETTER A" buffer code: #xD0 #xB0 file code: #xD0 #xB0 (encoded by coding system utf-8-unix) display: composed to form "а́" (see below) Composed with the following character(s) "́" using this font: mac-ct:-*-Monaco-normal-normal-normal-*-12-*-*-*-m-0-iso10646-1 by these glyphs: [0 1 1072 449 7 0 7 7 1 nil] [0 1 769 375 0 -6 -1 10 -7 nil] Character code properties: customize what to show name: CYRILLIC SMALL LETTER A general-category: Ll (Letter, Lowercase) decomposition: (1072) ('а') -- Alan Third