From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.bidi,gmane.emacs.devel Subject: Re: Arabic support Date: Thu, 02 Sep 2010 07:53:15 -0400 Message-ID: References: <83bp8oml9c.fsf@gnu.org> Reply-To: Eli Zaretskii NNTP-Posting-Host: lo.gmane.org X-Trace: dough.gmane.org 1283428403 20828 80.91.229.12 (2 Sep 2010 11:53:23 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Thu, 2 Sep 2010 11:53:23 +0000 (UTC) Cc: emacs-bidi@gnu.org, emacs-devel@gnu.org, jasonr@gnu.org To: Kenichi Handa Original-X-From: emacs-bidi-bounces+gnu-emacs-bidi=m.gmane.org@gnu.org Thu Sep 02 13:53:22 2010 Return-path: Envelope-to: gnu-emacs-bidi@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1Or8M5-0003yp-T4 for gnu-emacs-bidi@m.gmane.org; Thu, 02 Sep 2010 13:53:22 +0200 Original-Received: from localhost ([127.0.0.1]:42574 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Or8M5-0007zR-Cq for gnu-emacs-bidi@m.gmane.org; Thu, 02 Sep 2010 07:53:21 -0400 Original-Received: from [199.232.76.173] (port=58829 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Or8M2-0007xy-V4 for emacs-bidi@gnu.org; Thu, 02 Sep 2010 07:53:18 -0400 Original-Received: from Debian-exim by monty-python.gnu.org with spam-scanned (Exim 4.60) (envelope-from ) id 1Or8M1-0002Y1-RQ for emacs-bidi@gnu.org; Thu, 02 Sep 2010 07:53:18 -0400 Original-Received: from fencepost.gnu.org ([140.186.70.10]:43801) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1Or8M1-0002Xx-KP; Thu, 02 Sep 2010 07:53:17 -0400 Original-Received: from eliz by fencepost.gnu.org with local (Exim 4.69) (envelope-from ) id 1Or8Lz-0004if-He; Thu, 02 Sep 2010 07:53:15 -0400 In-reply-to: (message from Kenichi Handa on Wed, 01 Sep 2010 16:08:50 +0900) X-detected-operating-system: by monty-python.gnu.org: GNU/Linux 2.6 (newer, 3) X-BeenThere: emacs-bidi@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Discussion of Emacs support for multi-directional text." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-bidi-bounces+gnu-emacs-bidi=m.gmane.org@gnu.org Errors-To: emacs-bidi-bounces+gnu-emacs-bidi=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bidi:796 gmane.emacs.devel:129582 Archived-At: > From: Kenichi Handa > Cc: jasonr@gnu.org, emacs-bidi@gnu.org, emacs-devel@gnu.org > Date: Wed, 01 Sep 2010 16:08:50 +0900 > > LGSTRING may contain multiple grapheme clusters. In the > case of arabic, we make LGSTRING for one Arabic word then > shape it (otherwise, the shaper can't know where in a word a > consonant appears). Where can I find the code which decides how to break text into LGSTRINGs? I'd like to see such code for both Arabic and Hebrew, unless it's the same code. For example, can characters like digits or other neutrals be included in the same LGSTRING with Arabic and Hebrew? Or will an LGSTRING always include characters from one script only? I'm asking because it's possible that we will need to modify w32uniscribe.c to reorder R2L characters before we pass them to the Uniscribe ScriptShape API, to let it see the characters in the logical order it expects them. That's if it turns out that Uniscribe cannot otherwise shape them correctly. TIA