From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Madhu Newsgroups: gmane.emacs.devel Subject: Re: Not able to display \u110BD and \u110CD in Emacs Date: Sat, 07 May 2022 20:27:26 +0530 (IST) Message-ID: <20220507.202726.836184849101723671.enometh@meer.net> References: <83h761zv0w.fsf@gnu.org> <20220507.194920.1642525629735859845.enometh@meer.net> <83pmkpxw15.fsf@gnu.org> Mime-Version: 1.0 Content-Type: Text/Plain; charset=us-ascii Content-Transfer-Encoding: 7bit Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="13598"; mail-complaints-to="usenet@ciao.gmane.io" Cc: emacs-devel@gnu.org To: eliz@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Sat May 07 16:58:10 2022 Return-path: Envelope-to: ged-emacs-devel@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1nnLso-0003Nb-4j for ged-emacs-devel@m.gmane-mx.org; Sat, 07 May 2022 16:58:10 +0200 Original-Received: from localhost ([::1]:38782 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1nnLsm-0002wy-Rr for ged-emacs-devel@m.gmane-mx.org; Sat, 07 May 2022 10:58:08 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:57204) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1nnLs3-0001Yb-I5 for emacs-devel@gnu.org; Sat, 07 May 2022 10:57:23 -0400 Original-Received: from smtp6.ctinetworks.com ([205.166.61.199]:34426) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1nnLs1-0001OV-NJ; Sat, 07 May 2022 10:57:23 -0400 Original-Received: from localhost (unknown [117.254.34.14]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) (Authenticated sender: enometh@meer.net) by smtp6.ctinetworks.com (Postfix) with ESMTPSA id 8521C85CFA; Sat, 7 May 2022 10:57:13 -0400 (EDT) In-Reply-To: <83pmkpxw15.fsf@gnu.org> X-Mailer: Mew version 6.8 on Emacs 29.0.50 X-ctinetworks-Information: Please contact the ISP for more information X-ctinetworks-MailScanner-ID: 8521C85CFA.AAE10 X-ctinetworks-VirusCheck: Found to be clean X-ctinetworks-Watermark: 1652799440.20384@3qz3SikDUkimYiWFtgS/Pg Received-SPF: pass client-ip=205.166.61.199; envelope-from=enometh@meer.net; helo=smtp6.ctinetworks.com X-Spam_score_int: -18 X-Spam_score: -1.9 X-Spam_bar: - X-Spam_report: (-1.9 / 5.0 requ) BAYES_00=-1.9, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.io gmane.emacs.devel:289400 Archived-At: * Eli Zaretskii <83pmkpxw15.fsf@gnu.org> Wrote on Sat, 07 May 2022 17:22:14 +0300 >> Date: Sat, 07 May 2022 19:49:20 +0530 (IST) >> Cc: emacs-devel@gnu.org >> From: Madhu >> * Eli Zaretskii <83h761zv0w.fsf@gnu.org> >> Wrote on Sat, 07 May 2022 10:01:03 +0300 >> >> From: Madhu >> >> Date: Thu, 05 May 2022 21:23:50 +0530 >> >> e.g. (#x969 #x951 #x952), when composed, the devanagari numeral 3 >> >> should have an anudatta ("line below") and svarita ("vertical line >> >> above") >> > >> > Where are these rules documented? I don't see them in the latest >> > Unicode Standard 14.0. >> >> I have no idea. I assumed the rulesfollow from (some defined) rules >> for composition, but I stopped dealing with unicode for sanskrit ( >> vedic accents) back in 2002 when I saw the direction it was going in. > > It is strange that I seem to be unable to find such rules anywhere on > the Internet, not just in the Unicode Standard text. The Wikner I posted upthread mentions the tradition concisely in on p.3/6 under (a)Rgveda: http://www.evertype.com/standards/iso10646/pdf/vedic/Vedic_accents_doc.pdf (a) R.gveda has udattaa unmarked the svarita (jatya or suddha) is marked with a vertical line above the syllable (e.g. XX), but the kampa is indicated by a numeral 1 following the vowel if it is short, or the numeral 3 if it is long, and in both cases there is a vertical line above the numeral and a horizontal line below it in the case of the long kampa vowel the syllable usually also has a horizontal line below it (e.g. XX ). The canonical work (this version not searchable unfortunately) https://archive.org/detauls/vedicgrammar00macduoft/vedicgrammar00macduoft - pdf p.468, p.450 talks about the accents 1 and 3 numerals used in this way. I've seen a number of pdfs introducing vedic accents in the wild that document this. Maybe I'll try to make a bibliography