From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED.blaine.gmane.org!not-for-mail From: Robert Alessi Newsgroups: gmane.emacs.bugs Subject: bug#36717: 25.3; greek.el: deprecated vowel+oxia combinations should be replaced with vowel+tonos counterparts Date: Fri, 19 Jul 2019 15:47:12 +0200 Message-ID: <20190719134712.GD9882@robertalessi.net> References: <20190718173252.GA3093@robertalessi.net> <20190718184700.GA4886@robertalessi.net> <20190718203203.GG4886@robertalessi.net> <83tvbiv7dp.fsf@gnu.org> <20190719085824.GA3263@robertalessi.net> <83lfwuv05b.fsf@gnu.org> <20190719095407.GB5734@robertalessi.net> <83d0i6uqso.fsf@gnu.org> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Injection-Info: blaine.gmane.org; posting-host="blaine.gmane.org:195.159.176.226"; logging-data="225959"; mail-complaints-to="usenet@blaine.gmane.org" Cc: rpluim@gmail.com, 36717@debbugs.gnu.org To: Eli Zaretskii Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Fri Jul 19 15:48:07 2019 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([209.51.188.17]) by blaine.gmane.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.89) (envelope-from ) id 1hoTF0-000wea-J3 for geb-bug-gnu-emacs@m.gmane.org; Fri, 19 Jul 2019 15:48:06 +0200 Original-Received: from localhost ([::1]:45710 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.86_2) (envelope-from ) id 1hoTEz-0007yJ-JQ for geb-bug-gnu-emacs@m.gmane.org; Fri, 19 Jul 2019 09:48:05 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:41564) by lists.gnu.org with esmtp (Exim 4.86_2) (envelope-from ) id 1hoTEx-0007y9-2b for bug-gnu-emacs@gnu.org; Fri, 19 Jul 2019 09:48:04 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1hoTEw-0006JF-43 for bug-gnu-emacs@gnu.org; Fri, 19 Jul 2019 09:48:03 -0400 Original-Received: from debbugs.gnu.org ([209.51.188.43]:46602) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1hoTEw-0006J6-0f for bug-gnu-emacs@gnu.org; Fri, 19 Jul 2019 09:48:02 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1hoTEv-00038J-U3 for bug-gnu-emacs@gnu.org; Fri, 19 Jul 2019 09:48:01 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Robert Alessi Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Fri, 19 Jul 2019 13:48:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 36717 X-GNU-PR-Package: emacs Original-Received: via spool by 36717-submit@debbugs.gnu.org id=B36717.156354404011993 (code B ref 36717); Fri, 19 Jul 2019 13:48:01 +0000 Original-Received: (at 36717) by debbugs.gnu.org; 19 Jul 2019 13:47:20 +0000 Original-Received: from localhost ([127.0.0.1]:55423 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hoTEG-00037N-5h for submit@debbugs.gnu.org; Fri, 19 Jul 2019 09:47:20 -0400 Original-Received: from mx1.riseup.net ([198.252.153.129]:55328) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hoTEC-00037E-RI for 36717@debbugs.gnu.org; Fri, 19 Jul 2019 09:47:18 -0400 Original-Received: from capuchin.riseup.net (capuchin-pn.riseup.net [10.0.1.176]) (using TLSv1 with cipher ECDHE-RSA-AES256-SHA (256/256 bits)) (Client CN "*.riseup.net", Issuer "COMODO RSA Domain Validation Secure Server CA" (verified OK)) by mx1.riseup.net (Postfix) with ESMTPS id 193431B9419; Fri, 19 Jul 2019 06:47:16 -0700 (PDT) X-Riseup-User-ID: 3E7101EB63F9D03F56C9BCDF1047A0992D9441956BF6FECFC22BE075D66AA549 Original-Received: from [127.0.0.1] (localhost [127.0.0.1]) by capuchin.riseup.net (Postfix) with ESMTPSA id A8CE112098B; Fri, 19 Jul 2019 06:47:15 -0700 (PDT) Content-Disposition: inline In-Reply-To: <83d0i6uqso.fsf@gnu.org> X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 209.51.188.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.org gmane.emacs.bugs:163412 Archived-At: On Fri, Jul 19, 2019 at 03:55:51PM +0300, Eli Zaretskii wrote: > > For example, if one makes no distinction between the two, then it > > becomes harder to analyse large corpuses with a computer. > > What do you mean by "makes no distinction"? Those are different > codepoints, regardless of how they look on display. So we definitely > _can_ distinguish between them. I meant ``mixes up'': suppose you are dealing with large corpuses with passages both in modern and ancient Greek and you only use one of the two existing variants, either tonos or oxia: then if you wish to analyse your texts with a computer, you are in trouble. More generally, any normalization leads to confusion. This page will give you some examples: https://jktauber.com/articles/python-unicode-ancient-greek/ My point is that one should never use tonos variants in ancient Greek. And so never use oxia variants in modern Greek.