From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Kenichi Handa Newsgroups: gmane.emacs.devel Subject: Re: Composing Hebrew diacriticals Date: Mon, 17 May 2010 13:35:42 +0900 Message-ID: References: <83mxwlw2c0.fsf@gnu.org> <83pr12pfw6.fsf@gnu.org> <83fx1xowfj.fsf@gnu.org> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable X-Trace: dough.gmane.org 1274070966 14998 80.91.229.12 (17 May 2010 04:36:06 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Mon, 17 May 2010 04:36:06 +0000 (UTC) Cc: emacs-devel@gnu.org To: Yair F Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Mon May 17 06:36:04 2010 connect(): No such file or directory Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1ODs3a-0001J7-T4 for ged-emacs-devel@m.gmane.org; Mon, 17 May 2010 06:35:59 +0200 Original-Received: from localhost ([127.0.0.1]:55868 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1ODs3a-00031F-CN for ged-emacs-devel@m.gmane.org; Mon, 17 May 2010 00:35:58 -0400 Original-Received: from [140.186.70.92] (port=41415 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1ODs3U-000310-Ca for emacs-devel@gnu.org; Mon, 17 May 2010 00:35:53 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.69) (envelope-from ) id 1ODs3S-0001np-Fo for emacs-devel@gnu.org; Mon, 17 May 2010 00:35:52 -0400 Original-Received: from mx1.aist.go.jp ([150.29.246.133]:60128) by eggs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1ODs3S-0001nU-0R for emacs-devel@gnu.org; Mon, 17 May 2010 00:35:50 -0400 Original-Received: from rqsmtp2.aist.go.jp (rqsmtp2.aist.go.jp [150.29.254.123]) by mx1.aist.go.jp with ESMTP id o4H4Zhde013453; Mon, 17 May 2010 13:35:43 +0900 (JST) env-from (handa@m17n.org) Original-Received: from smtp2.aist.go.jp by rqsmtp2.aist.go.jp with ESMTP id o4H4Zh00008709; Mon, 17 May 2010 13:35:43 +0900 (JST) env-from (handa@m17n.org) Original-Received: by smtp2.aist.go.jp with ESMTP id o4H4Zg7V003986; Mon, 17 May 2010 13:35:42 +0900 (JST) env-from (handa@m17n.org) Original-Received: from handa by etlken with local (Exim 4.69) (envelope-from ) id 1ODs3K-0006f0-57; Mon, 17 May 2010 13:35:42 +0900 In-Reply-To: (message from Yair F on Sun, 16 May 2010 00:35:11 +0300) X-detected-operating-system: by eggs.gnu.org: Solaris 9 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:124847 Archived-At: In article , = Yair F writes: >>> 1. Letter Shin (U+5E9) is not composed. It is followed by Shin-dot >>> (U+5C2) which breaks the composition. > > > > In my environment, the sequence U+5E9 U+5C2 is surely > > composed. =A0Please show me the output of this command. > > > > % MDEBUG_FLT=3D3 emacs --eval '(message "\u05E9\u05C2")' [...] > Mine is > [FLT] (hebr-ff (dejavu sans) > [FLT] (SOURCE 05E9) > [FLT] (STAGE 0 "H" (05E9) > [FLT] (SUBPART 0 > [FLT] (COND > [FLT] (REGEX "^." "H" 1 > [FLT] (COPY 0x5E9))))) > [FLT] (RESULT (053D 1152 0 0))) Are you using the same setting as mine which I wrote as below in the previous mail? > I have not yet committed proper codes for Hebrew > composition. I'm now testing with this simple version. >=20 > (let ((pattern "[\u05D0-\u05F2][\u0591-\u05BF\u05C1-\u05C5\u05C7]+")) > (set-char-table-range > composition-function-table '(#x591 . #x5C7) > (list (vector pattern 1 'font-shape-gstring) > ["[\u0591-\u05C7]" 0 font-shape-gstring])) > (set-char-table-range > composition-function-table #x5C0 nil) > (set-char-table-range > composition-function-table #x5C6 nil)) >>> 2. Letter Lamed (U+5DC) Followed by Holam (U+5B9) The point is >>> composed to the right of the letter instead of left. > > > > It seems that "dejavu sans" font doesn't have 'ccmp' nor > > 'mark' feature applicable to that sequence. > In that case, may I suggest switching to a different font? It's possible to write such a code (which will slow down the display), but if you don't have a better font, we anyway have to use the available font. By the way, do you have a better font than 'dejavu sans' for Hebrew? >>> 3. Letter Yod (U+5D9) composed with Hiriq (U+5B4) is composed into >>> presentation form (U+FB1D). This should only happen with specific >>> control (Either CGJ or ZWJ I'll check). > > > > I tried to show that sequence with gedit and and firefox. > > Both show it with U+FB1D. > Kate show this as well, Firefox doesn't. Nevertheless it's still a bug. Then what is the correct rendering of the sequence "\u05D9\u05B4"? Applying only `mark' feature? With dejavu sans font, U+FB1D has lower dot above baseline, but just applying `mark' feature results in displaying dot below baseline. See the attached image. The left one is the composition with `ccmp' GSUB feature (i.e. into U+FB1D) for the sequence "\u05D9\u200D\u05B4", and the right one is the composition with `mark' GPOS feature for the sequence "\u05D9\u05B4". Is this the correct rendering? --- Kenichi Handa handa@m17n.org