From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Amin Bandali Newsgroups: gmane.emacs.bugs Subject: bug#41005: problem with rendering Persian text in Emacs 27 Date: Sat, 25 Jul 2020 11:53:19 -0400 Organization: GNU Message-ID: <87mu3nd21c.fsf@gnu.org> References: <35A46479-A62C-42FF-995B-B295FE3408C0@gnu.org> <08A9D65F-0C9C-4EE2-B3B9-2AA25BFFAD54@gnu.org> <878sh35j6f.fsf@gmail.com> <83y2p3as6c.fsf@gnu.org> <87pnae4nhx.fsf@gmail.com> <83ftbaaql5.fsf@gnu.org> <87eeqt3nww.fsf@gmail.com> <83bllxbuxq.fsf@gnu.org> <87mu3txcpz.fsf@gnu.org> <871rl4u54a.fsf@gnu.org> <83zh7sg1v9.fsf@gnu.org> <87o8o8jo1o.fsf@gnu.org> <83k0yvfx7o.fsf@gnu.org> <87wo2tzh59.fsf@gnu.org> <83pn8le95z.fsf@gnu.org> <877dusfcq0.fsf@gnu.org> <83y2n8ccou.fsf@gnu.org> Mime-Version: 1.0 Content-Type: multipart/signed; boundary="=-=-="; micalg=pgp-sha256; protocol="application/pgp-signature" Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="5115"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.0.50 (gnu/linux) Cc: rpluim@gmail.com, valizadeh.ho@gmail.com, 41005@debbugs.gnu.org, pipcet@gmail.com To: Eli Zaretskii Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Sat Jul 25 17:54:11 2020 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1jzMUz-0001CR-Ng for geb-bug-gnu-emacs@m.gmane-mx.org; Sat, 25 Jul 2020 17:54:09 +0200 Original-Received: from localhost ([::1]:39908 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1jzMUy-0003XQ-RB for geb-bug-gnu-emacs@m.gmane-mx.org; Sat, 25 Jul 2020 11:54:08 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:51748) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1jzMUs-0003XH-Gm for bug-gnu-emacs@gnu.org; Sat, 25 Jul 2020 11:54:02 -0400 Original-Received: from debbugs.gnu.org ([209.51.188.43]:39329) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1jzMUs-0007vx-6V for bug-gnu-emacs@gnu.org; Sat, 25 Jul 2020 11:54:02 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1jzMUs-0004RH-3p for bug-gnu-emacs@gnu.org; Sat, 25 Jul 2020 11:54:02 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Amin Bandali Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Sat, 25 Jul 2020 15:54:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 41005 X-GNU-PR-Package: emacs Original-Received: via spool by 41005-submit@debbugs.gnu.org id=B41005.159569241417029 (code B ref 41005); Sat, 25 Jul 2020 15:54:02 +0000 Original-Received: (at 41005) by debbugs.gnu.org; 25 Jul 2020 15:53:34 +0000 Original-Received: from localhost ([127.0.0.1]:50875 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1jzMUQ-0004Qb-7e for submit@debbugs.gnu.org; Sat, 25 Jul 2020 11:53:34 -0400 Original-Received: from eggs.gnu.org ([209.51.188.92]:58196) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1jzMUL-0004QK-25 for 41005@debbugs.gnu.org; Sat, 25 Jul 2020 11:53:32 -0400 Original-Received: from fencepost.gnu.org ([2001:470:142:3::e]:45774) by eggs.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1jzMUF-0007sO-8h; Sat, 25 Jul 2020 11:53:23 -0400 Original-Received: from [2607:fea8:3fe0:17d::7] (port=52632 helo=localhost) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1jzMUD-0004GN-Hq; Sat, 25 Jul 2020 11:53:22 -0400 In-Reply-To: <83y2n8ccou.fsf@gnu.org> X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.io gmane.emacs.bugs:183527 Archived-At: --=-=-= Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Eli Zaretskii writes: [...] > > Please help me understand what exactly do you mean by "garbled". The > text you show is still quite long, and I cannot easily locate it in > that page (I don't see any "About Wikipedia" in the English version of > that page), nor do I understand what you mean by "garbled". > Choosing the Persian Wikipedia was probably not the best idea, given how busy their page is and how much material is on there, which does not help at all when trying to find an excerpt of text in a language one doesn't know. Also, sorry for not being clear about what I mean by "garbled" text. Please see . In Persian alphabet, letters take different forms depending on where in a word they appear. The "overview table" in the above page includes examples of the possible contextual forms for each letter. The issue described in this bug report is basically about Emacs using the wrong contextual form of letters when rendering Persian text. > > Would it be possible instead to paste only a very small portion of the > text, and tell exactly which part(s) of that short text are garbled, > and how they are garbled? Can you, for example, post a screenshot > showing exactly which part of the Wikipedia page should be copied, and > another screenshot of the garbled text in Emacs showing which part(s) > are displayed incorrectly? And please keep the pasted text as short > as possible, because locating the garbled part(s) in text I cannot > read which is displayed in a different font from what's in the > screenshot can be a very frustrating and error-prone experience. > > Also, do you copy this from EWW or from some other Web browser? > Certainly. Instead of Persian Wikipedia, let's use the Persian translation of the GNU homepage: ; specifically, the first part of the first sentence of the first paragraph (up to and including the semicolon): =DA=AF=D9=86=D9=88 =DB=8C=DA=A9 =D8=B3=DB=8C=D8=B3=D8=AA=D9=85=E2=80=8C=D8= =B9=D8=A7=D9=85=D9=84 =D8=A8=D8=B1 =D9=85=D8=A8=D9=86=D8=A7=DB=8C =D9=86=D8= =B1=D9=85=E2=80=8C=D8=A7=D9=81=D8=B2=D8=A7=D8=B1 =D8=A2=D8=B2=D8=A7=D8=AF = =D8=A7=D8=B3=D8=AA=D8=9B I have created a very short video screencast of me walking through reproducing the issue, by opening in Debian Buster's firefox-esr (68.10.0esr (64-bit)), copying the above excerpt from the page, and pasting into an Emacs *scratch* using C-y: https://p.bndl.org/emacs-persian-wrong-contextual-forms.webm The two open Emacs instances were both launched with "emacs -Q" and are identical; except for the second one, I did M-x column-number-mode RET before pasting the Persian text. I also meant to include the following in my video in case they might be useful, but forgot to. ,----[ M-x emacs-version RET ] | GNU Emacs 28.0.50 (build 8, x86_64-pc-linux-gnu, X toolkit, cairo | version 1.16.0, Xaw3d scroll bars) of 2020-07-20 `---- ,----[ C-h v system-configuration-options RET ] | "--with-modules --without-gconf --without-gsettings | --with-x-toolkit=3Dlucid --with-xft --with-xaw3d --without-gpm | --with-imagemagick --with-harfbuzz --prefix=3D/data/bandali/usr/local" `---- > > (I tried to explain all of this in my previous message, but > unfortunately this reproduction recipe again presents the same > difficulties as I tried to avoid by explaining how to provide > information that would be easy to follow up.) > > Please help me understand the problem, without that I see no way of > making any progress here. > > Thanks. > I really am trying. :-) Thank you for baring with me here, and for trying to help find the issue, Eli; I appreciate it. Hope this helps. Thanks. --=-=-= Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iQJEBAEBCAAuFiEEObM8jZRIDS3cwqSYi0Sgzce5VvIFAl8cVW8QHGJhbmRhbGlA Z251Lm9yZwAKCRCLRKDNx7lW8tpdD/9mAegOOfEOG7+cceSEC4WVptVFydg6Vp3a dBKvKLjfAJUh0nESkUpjN5j9jb+6HX84ey4EHLG5Pv9BwQW23x7Kwf4JtoLOsG2P UaMXOgLsEJY/QvzgUxmAnBjZWsXZLQHY387c5K7wEBbWJzS3/jbLkW4uaurBWxqp hcq2n1Ve8PnA+p8j+lccSRjFrA3nmJq9EQ2zpumtahc3N29/xlM9PH+MpgFQTIuz DwNzraDWX/DN5uheB9clyX0bVpR5sHck+2XOuayDtKjbqsgezCMlFlrhDQ7wptVm nuDW2OT3ycD+S7akFjaFxAksTO0ZyZ0vvqW0LOLqH7wVix7jEOqBkDjWCljGk+Sh v714t9fC0Qk9cwKJBpB16Oxv3xP6a8gZsAAvfat7yRB5gUB3OiBR6/dfuIgVdGyx 3TvneEOSNC5HU7qofAM7OJWWkEtYyANIPrbL6AOdBLRUsV/3qWub3LgtVDTGDWgc 8Yrur6+fpBT6z1hGdhcWGAKPu/MUeMtxnQ2XEviM9rlg1KVnJkVgpOQAQ4oVw1+D Fj2hGQqdpsWLKn9m36tcgUCcvxYI5XPv1njGXPfzOaqYOE8ZaKj8da4hsI2PyHvr S8b1MXSvc2VRzT9hZQsmUpSrKJ4SkdKEchsE/LHHCoO6lO5F4kfHnEfqL/g2hv6R GZ8F0iuNqw== =wtXd -----END PGP SIGNATURE----- --=-=-=--