From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.bugs Subject: bug#20140: 24.4; M17n shaper output rejected Date: Wed, 16 Feb 2022 17:13:56 +0200 Message-ID: <83y22a26a3.fsf@gnu.org> References: <20150318222040.4066e6e9@JRWUBU2> <87r18jk5nr.fsf@gnus.org> <83v8xv2icg.fsf@gnu.org> <20220205225251.08a0faab@JRWUBU2> <831r06rbwk.fsf@gnu.org> <20220213205310.0b8a715c@JRWUBU2> <83mtitpouv.fsf@gnu.org> <20220214221427.35231794@JRWUBU2> <20220215012734.41fb4aaf@JRWUBU2> Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="25790"; mail-complaints-to="usenet@ciao.gmane.io" Cc: 20140-done@debbugs.gnu.org, larsi@gnus.org To: Richard Wordingham Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Wed Feb 16 16:14:13 2022 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1nKM0S-0006Wr-Db for geb-bug-gnu-emacs@m.gmane-mx.org; Wed, 16 Feb 2022 16:14:12 +0100 Original-Received: from localhost ([::1]:44148 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1nKM0R-0000XU-El for geb-bug-gnu-emacs@m.gmane-mx.org; Wed, 16 Feb 2022 10:14:11 -0500 Original-Received: from eggs.gnu.org ([209.51.188.92]:41716) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1nKM0I-0000Ww-Gz for bug-gnu-emacs@gnu.org; Wed, 16 Feb 2022 10:14:02 -0500 Original-Received: from debbugs.gnu.org ([209.51.188.43]:55346) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1nKM0I-0004ZH-74 for bug-gnu-emacs@gnu.org; Wed, 16 Feb 2022 10:14:02 -0500 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1nKM0I-0006wK-3c for bug-gnu-emacs@gnu.org; Wed, 16 Feb 2022 10:14:02 -0500 Resent-From: Eli Zaretskii Original-Sender: "Debbugs-submit" Resent-To: bug-gnu-emacs@gnu.org Resent-Date: Wed, 16 Feb 2022 15:14:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: cc-closed 20140 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: moreinfo Mail-Followup-To: 20140@debbugs.gnu.org, eliz@gnu.org, richard.wordingham@ntlworld.com Original-Received: via spool by 20140-done@debbugs.gnu.org id=D20140.164502443926656 (code D ref 20140); Wed, 16 Feb 2022 15:14:01 +0000 Original-Received: (at 20140-done) by debbugs.gnu.org; 16 Feb 2022 15:13:59 +0000 Original-Received: from localhost ([127.0.0.1]:49242 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1nKM0E-0006vr-Tl for submit@debbugs.gnu.org; Wed, 16 Feb 2022 10:13:59 -0500 Original-Received: from eggs.gnu.org ([209.51.188.92]:35000) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1nKM0C-0006vX-8d for 20140-done@debbugs.gnu.org; Wed, 16 Feb 2022 10:13:57 -0500 Original-Received: from [2001:470:142:3::e] (port=55048 helo=fencepost.gnu.org) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1nKM06-0004Y6-Ms; Wed, 16 Feb 2022 10:13:51 -0500 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnu.org; s=fencepost-gnu-org; h=References:Subject:In-Reply-To:To:From:Date: mime-version; bh=hAd5c07s1QOGYgGTa/h0QHExenMPdm9BKXCNtXl7Mlk=; b=UsMuvaxHhXNm /qcc/YbzEOIyof/Vb2Vg3FRtKdmHcX/oIwLh5QFdEz0i7crbH1DoL3mvH8QaRoWqIMSuiZYQ4CW2Y RM66qHLNGNe4xr78BqmtS5+OIyCe/55CFlUXw7FGk0GCGn9ON4T2FlOCHAqJfvvnDs3AFK3mPBjGq +SJAZwSjH92YpDHgRzVET0n4TF5T0pt9YTKM8iX37SiFkgYktQxNaWW4VQoazc28TkMONNESgiPpl +/9hhz3EVi42vM0t5XxNHfAC12ryduiB/X3Y1l4p8AhNL34AyxD8MZKuqE/HJlOgqgI81H63llBqO GtnjvzJ9sXdkc6Dp2uYi1Q==; Original-Received: from [87.69.77.57] (port=2517 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1nKM06-0004oI-27; Wed, 16 Feb 2022 10:13:50 -0500 In-Reply-To: <20220215012734.41fb4aaf@JRWUBU2> (message from Richard Wordingham on Tue, 15 Feb 2022 01:27:34 +0000) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.io gmane.emacs.bugs:227038 Archived-At: > Date: Tue, 15 Feb 2022 01:27:34 +0000 > From: Richard Wordingham > Cc: 20140@debbugs.gnu.org, larsi@gnus.org > > On Mon, 14 Feb 2022 22:14:27 +0000 > Richard Wordingham wrote: > > > On Mon, 14 Feb 2022 15:19:36 +0200 > > Eli Zaretskii wrote: > > > > > > Date: Sun, 13 Feb 2022 20:53:10 +0000 > > > > From: Richard Wordingham > > > > Cc: larsi@gnus.org, 20140@debbugs.gnu.org > > > > > You should also add CGJ and ZWNJ, and some people may appreciate > > > > ZWJ - the Khottabun font has ligatures involving ZWJ, though it > > > > may just be an experimental feature - and ultimately WJ, for when > > > > someone writes a Tai Tham word breaker. > > > > > > How should I add CGJ and ZWNJ? What are the rules? > > > > > > > Oh, and Thai and Lao mai t(r)i and mai chat(t)awa and U+0324 > > > > COMBINING DIAERESIS BELOW turn up occasionally - U+0324 is > > > > supported in Thep's Khottabun font, and my Da Lekh series > > > > supports Thai mai tri and mai chattawa. These characters seem to > > > > work with HarfBuzz. > > > > > > Not sure I understand: what patterns/rules should be added for > > > these? > > > > Add them all to "M" in the definition of tai-tham-composable-pattern. > > Strictly, U+0324 should also be added to "S", but I'd be surprised to > > see it in a genuine spelling. > > In view of Wyn Owen's report (A Description and Linguistic Analysis of > the Tai Khuen Writing System, JSEALS 10.1 (2017) > https://evols.library.manoa.hawaii.edu/bitstream/10524/52403/1/09_Owen2017description.pdf) > on Tai Khuen spelling, one should also add U+0E49 THAI CHARACTER MAI > THO to "M". And, of course, as all 5 non-Tai Tham tone marks used with > the Tai Tham script have canonical combining class greater than 9, they > should be added to "S" - i.e. add U+0E49 to U+0E4B and U+0EC9 and > U+0ECB to "S". Thanks, done that as well, and installed the changes for Emacs 29. And with that, I'm closing this bug report. Thanks a lot for your code and helpful discussions.