From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.ciao.gmane.io!not-for-mail From: scole_mail Newsgroups: gmane.emacs.help Subject: utf-8 thai character display issue? Date: Fri, 05 Jun 2020 11:05:18 -0700 Message-ID: <873679v15t.fsf@dstar.dstar.net> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Injection-Info: ciao.gmane.io; posting-host="ciao.gmane.io:159.69.161.202"; logging-data="47347"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/26.3 (berkeley-unix) To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane-mx.org@gnu.org Fri Jun 05 21:51:56 2020 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1jhINf-000CEP-HA for geh-help-gnu-emacs@m.gmane-mx.org; Fri, 05 Jun 2020 21:51:55 +0200 Original-Received: from localhost ([::1]:49578 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1jhINe-0002ll-Cq for geh-help-gnu-emacs@m.gmane-mx.org; Fri, 05 Jun 2020 15:51:54 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:51338) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1jhGin-0006pz-N5 for help-gnu-emacs@gnu.org; Fri, 05 Jun 2020 14:05:37 -0400 Original-Received: from mout.gmx.net ([212.227.17.21]:34907) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1jhGij-0001cX-Eb for help-gnu-emacs@gnu.org; Fri, 05 Jun 2020 14:05:37 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=gmx.net; s=badeba3b8450; t=1591380330; bh=0vK0hsaWawnJiSpI003wJtSB52jvoj+gWlKcjoYlke0=; h=X-UI-Sender-Class:From:To:Subject:Date; b=Av6T1gb0d47GL2mIGRLjuZzJ3g+kx2pwoh6RO5MyAr9FF5Pw6ob0Y5L9QaqYabN9o ObNJ+QJSjqjhpTlboxDaddbjKthASCiHXwxVnsg45pEtFEgs+8d+7rGSlvBNNkO/9x wK2LAS14FBOi8fL1QRkGV1ZbQVWBmHKnefPq1Goo= X-UI-Sender-Class: 01bb95c1-4bf8-414a-932a-4f6e2808ef9c Original-Received: from dstar.dstar.net.gmx.com ([76.178.148.160]) by mail.gmx.com (mrgmx104 [212.227.17.174]) with ESMTPSA (Nemesis) id 1MLi8m-1jPjpp1GbI-00HbyL for ; Fri, 05 Jun 2020 20:05:30 +0200 X-Provags-ID: V03:K1:az5y0lAiD27lpR/kgX4v39UFwSnNtze7uK7VqwvOfvJMbmA1Yor ZVfJAX4AoN28gHHEQPdMVCs1dKjO3LjHfJq2RHAShXTjN1K8H6fbZHS4GrrdkAJY+zma2iE Wg1NCBvJW0GumPp/alJmOYd3SSoEOtk1uLgMSJSa+gMuZKqZZ8KpwXHgeKc+f37n5vDibod PeLSv4E3tKQTNfOI9ZD9Q== X-UI-Out-Filterresults: notjunk:1;V03:K0:ngUm0nX6j+Q=:apq1jWbZUjqi7EaPTTVREc R8RUkjenCJkZAEluzdcAheYFMCh1M8BzV94xwDIlD5Wggjc/e/gh0juoy1h+HV+wPm4Qkh+T2 cTVUoV+GZbcWOiCa2fR2t8Wp359AVapFYjVyZ6dzi+BPgYlh1LzEObjzH3KQ+pR2U9DLErU2j FpwNgJzSj09lcY05Nb1VFIvGmYnorG9GYd7WL19EfYHjhbFWXpR9yh8vxuQuxJ+iQa9r2cuGh bN16OeZhl61PCkC6dv/aFpnWP6OjZXwCpyLLokINjFepzuXT//4K6zAdEjm2m+3WDKkpirZ1n jBg6kS1XlbbF/ughs17ZtVv79kTHE6nNX+1deSuJrE+oIlCLvy/K9+ow7XqCu5OPUYa882uTm acadB4ANiJ9i35AewsdPONBm9QsuYvpWDKeHiMEbMG8lCscf70C4nFa3exJdMTkzuRcULLxiR Xf8jzfX0DA7cApWfL0Cs9IanMYxcpESDr7f/A99hbeOGeOusvftehNyJ3E7yVTCoyWWeenjWW WJ1oIAiZxVbcQoVwKkGC2pU96V5FgUGPVP3QwOQ1pgtqvIQA1jqpyuaIWhoy4l6PHTs0+SPC7 GrfBTQ2IbUDuWy3cRT5pKEK0aOs0bYs+OIX7KtOsSZA7EdixW5Hub25CovmdC9zaSp0RieqmL B0ibEHZevTVLaKFvGXjBeZE9Ix3+b1Gn92G0mcQQepOIdUb/zBgTS48cnQnZSA6qAw5vZOdQX v0ibt1Dnp8d7NOJoYozl6QsZ2WZ5ZbdxX+TtPutlDmQGhyKXNIoV2/qhZh1w7jv+O6nb1FJ4 Received-SPF: pass client-ip=212.227.17.21; envelope-from=scole_mail@gmx.com; helo=mout.gmx.net X-detected-operating-system: by eggs.gnu.org: First seen = 2020/06/05 14:05:30 X-ACL-Warn: Detected OS = Linux 2.2.x-3.x [generic] X-Spam_score_int: -25 X-Spam_score: -2.6 X-Spam_bar: -- X-Spam_report: (-2.6 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001 autolearn=_AUTOLEARN X-Spam_action: no action X-Mailman-Approved-At: Fri, 05 Jun 2020 15:50:05 -0400 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: "help-gnu-emacs" Xref: news.gmane.io gmane.emacs.help:123247 Archived-At: I am sometimes having trouble with displaying and entering a specific thai character =E0=B8=B3 "THAI CHARACTER SARA AM". If that character follows some other thai characters or I try to type it in directly following some other thai character it won't display properly in the buffer. It looks like there is no character there. I know actually it is in the buffer because I can get the properties of it even though it is not being displayed. Here are 2 example words that have issues on my system =E0=B8=81=E0=B8=B3=E0=B8=A5=E0=B8=B1=E0=B8=87 (ing) =E0=B8=99=E0=B9=89=E0=B8=B3 (water) The 2nd word is always not displayed correctly. The first word is sometimes displayed correctly depending on what other thai text files I may have visited early. I have tried using and switching various thai fonts and sizes, but it seems like once the issue appears changing or resizing the fonts never helps. My emacs version is GNU Emacs 26.3 (build 1, x86_64--netbsd, GTK+ Version 3.24.14) of 2020-05-31 I also use this version of emacs built on macos and it doesn't have the problem. Here is the char info for it when it didn't display: C-x C-u =3D gives this: position: 285 of 365 (78%), restriction: <36-366>, column: 10 character: =E0=B8=B3 (displayed as =E0=B8=B3) (codepoint 3635, = #o7063, #xe33) charset: unicode (Unicode (ISO10646)) code point in charset: 0x0E33 script: thai syntax: w which means: word category: .:Base, 1:base vowel, L:Left-to-right (strong), t:Th= ai to input: type "e" with thai-kesmanee input method buffer code: #xE0 #xB8 #xB3 file code: #xE0 #xB8 #xB3 (encoded by coding system utf-8-emacs) display: by this font (glyph code) xft:-PfEd-Garuda-normal-normal-normal-*-13-*-*-*-*-0-iso10646-1 (#x104) Character code properties: customize what to show name: THAI CHARACTER SARA AM old-name: THAI VOWEL SIGN SARA AM general-category: Lo (Letter, Other) decomposition: (compat 3661 3634) (compat '=E0=B9=8D' '=E0=B8=B2') There are text properties here: fontified t [back] Here's for one time it was being display correctly: position: 50 of 83 (59%), column: 7 character: =E0=B8=B3 (displayed as =E0=B8=B3) (codepoint 3635, = #o7063, #xe33) charset: unicode (Unicode (ISO10646)) code point in charset: 0x0E33 script: thai syntax: w which means: word category: .:Base, 1:base vowel, L:Left-to-right (strong), t:Th= ai to input: type "C-x 8 RET e33" or "C-x 8 RET THAI CHARACTER SA= RA AM" buffer code: #xE0 #xB8 #xB3 file code: #xE0 #xB8 #xB3 (encoded by coding system utf-8-unix) display: composed to form "=E0=B8=B3" (see below) Composed using this font: xft:-PfEd-Garuda-normal-normal-normal-*-13-*-*-*-*-0-iso10646-1 by these glyphs: [1 1 3634 259 6 0 5 8 0 nil] Character code properties: customize what to show name: THAI CHARACTER SARA AM old-name: THAI VOWEL SIGN SARA AM general-category: Lo (Letter, Other) decomposition: (compat 3661 3634) (compat '=E0=B9=8D' '=E0=B8=B2') There are text properties here: field output fontified t front-sticky (field inhibit-line-move-field-capture) inhibit-line-move-field-capture t rear-nonsticky t Thank you for any suggestions