From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Kenichi Handa Newsgroups: gmane.emacs.bugs Subject: bug#11860: 24.1; Arabic - Harakat (diacritics, short vowels) don't appear Date: Sun, 19 Aug 2012 22:37:29 +0900 Message-ID: <87628fhujq.fsf@gnu.org> References: <349071341393469@web30d.yandex.ru> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain X-Trace: ger.gmane.org 1345383482 15955 80.91.229.3 (19 Aug 2012 13:38:02 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Sun, 19 Aug 2012 13:38:02 +0000 (UTC) Cc: 11860@debbugs.gnu.org, smias@yandex.ru To: Jason Rumney Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Sun Aug 19 15:38:01 2012 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1T35hY-0005JI-Kc for geb-bug-gnu-emacs@m.gmane.org; Sun, 19 Aug 2012 15:38:00 +0200 Original-Received: from localhost ([::1]:49321 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1T35hX-0006OQ-4R for geb-bug-gnu-emacs@m.gmane.org; Sun, 19 Aug 2012 09:37:59 -0400 Original-Received: from eggs.gnu.org ([208.118.235.92]:34815) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1T35hV-0006OL-2z for bug-gnu-emacs@gnu.org; Sun, 19 Aug 2012 09:37:58 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1T35hT-000696-W0 for bug-gnu-emacs@gnu.org; Sun, 19 Aug 2012 09:37:57 -0400 Original-Received: from debbugs.gnu.org ([140.186.70.43]:57141) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1T35hT-000692-SP for bug-gnu-emacs@gnu.org; Sun, 19 Aug 2012 09:37:55 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.72) (envelope-from ) id 1T35ha-0007JV-S7 for bug-gnu-emacs@gnu.org; Sun, 19 Aug 2012 09:38:03 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Kenichi Handa Original-Sender: debbugs-submit-bounces@debbugs.gnu.org Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Sun, 19 Aug 2012 13:38:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 11860 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: Original-Received: via spool by 11860-submit@debbugs.gnu.org id=B11860.134538347328097 (code B ref 11860); Sun, 19 Aug 2012 13:38:02 +0000 Original-Received: (at 11860) by debbugs.gnu.org; 19 Aug 2012 13:37:53 +0000 Original-Received: from localhost ([127.0.0.1]:38454 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1T35hQ-0007J8-Gi for submit@debbugs.gnu.org; Sun, 19 Aug 2012 09:37:52 -0400 Original-Received: from fencepost.gnu.org ([208.118.235.10]:42086) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1T35hJ-0007Iw-HA for 11860@debbugs.gnu.org; Sun, 19 Aug 2012 09:37:48 -0400 Original-Received: from 126.229.accsnet.ne.jp ([202.220.229.126]:54718 helo=ubuntu) by fencepost.gnu.org with esmtpsa (TLS1.0:DHE_RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1T35h9-0006Xu-GJ; Sun, 19 Aug 2012 09:37:36 -0400 In-Reply-To: <87393j7fdv.fsf@gnu.org> (message from Jason Rumney on Sun, 19 Aug 2012 11:02:52 +0800) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-Received-From: 140.186.70.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:63289 Archived-At: In article <87393j7fdv.fsf@gnu.org>, Jason Rumney writes: >>> > So, apparently Emacs on Windows and GNU/Linux uses the >>> > different metrics of glyphs. > Right, but adding the offsets to the corresponding metrics, we get the > same result with both the Windows and GNU/Linux cases, ?? I don't understand what you mean. > except for the > total height of the font, which I think is because Windows counts > inter-line spacing, while on GNU/Linux, that is separate. I'm not sure, but currently, y-axis metrics are not the problem. > > For Hebrew too, on Windows, I see the same problem as what > > Steffan reported: > If you are seeing something different than Eli for Hebrew with the same > font, then I suspect the cause is linked with the version of Uniscribe > that is installed. Maybe diacritic handling for Hebrew and Arabic is a > more recent addition to Uniscribe than the basic support for those > languages. Perhaps. I tested it on Windows 7, and the tested version of Emacs was 24.0.?, not the latest one. I'm now downloading the latest Windows binary of Emacs. >>> > For instance, in the above case, we may have to render glyphs in >>> > this order (diacritical mark first): >>> > >>> > [0 1 1593 760 0 3 6 12 4 [1 -2 0]] >>> > [0 1 1593 969 8 1 8 12 4 nil] > I'm curious as to how we ended up with the same C entry in those > vectors. Could this be causing us problems later on? I don't think so. As far as I remember, the C entries in a glyphs string is not used after being shaped. > The glyph index > is correct (comparing to the GNU/Linux version), but I wonder if > Uniscribe is referring back to the character at some point and tripping > up because it has been changed. I have no idea about that. > > The comment refer to "clusters". I don't know what it > > exactly means in uniscribe, but I guess it relates to > > grapheme cluster, and if so, this part seems to relates to > > the ordering of glyphs in this kind of grapheme clauster: > > > > [0 1 1593 969 8 1 8 12 4 nil] > > [0 1 1593 760 0 3 6 12 4 [1 -2 0]] > That seems to be correct. Why? As the xadvance of the first glyph is 8, and the xoffset of the second glyph is 1, the second glyph is never drawn at the same column as the first glyph. > Maybe this is the code that is changing the > character code to 1593. I seem to recall that something like this was > required for Indic languages to let Emacs know which characters had been > linked back into one glyph. Is that Windows specific? --- Kenichi Handa handa@gnu.org