From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.ciao.gmane.io!not-for-mail From: Dmitry Alexandrov Newsgroups: gmane.emacs.help Subject: Re: (Mis?)using quote as apostrophe (was: Hunspell and contractions with apostrophes) Date: Wed, 27 May 2020 09:05:24 +0300 Message-ID: References: <87y2pelh8t.fsf@ericabrahamsen.net> Mime-Version: 1.0 Content-Type: multipart/signed; boundary="=-=-="; micalg=pgp-sha256; protocol="application/pgp-signature" Injection-Info: ciao.gmane.io; posting-host="ciao.gmane.io:159.69.161.202"; logging-data="88645"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.0.50 (gnu/linux) Cc: Eric Abrahamsen , help-gnu-emacs To: Yuri Khan Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane-mx.org@gnu.org Wed May 27 08:09:10 2020 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1jdpFV-000N2I-V3 for geh-help-gnu-emacs@m.gmane-mx.org; Wed, 27 May 2020 08:09:09 +0200 Original-Received: from localhost ([::1]:46584 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1jdpFU-0004cf-Lc for geh-help-gnu-emacs@m.gmane-mx.org; Wed, 27 May 2020 02:09:09 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:52766) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1jdpC1-0002jp-Iz for help-gnu-emacs@gnu.org; Wed, 27 May 2020 02:05:33 -0400 Original-Received: from relay-1.mailobj.net ([213.182.54.6]:35496) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1jdpC0-0007QS-Bi for help-gnu-emacs@gnu.org; Wed, 27 May 2020 02:05:33 -0400 Original-Received: from v-1c.localdomain (unknown [192.168.90.161]) by relay-1.mailobj.net (Postfix) with SMTP id B92361281; Wed, 27 May 2020 08:05:29 +0200 (CEST) Original-Received: by mail-1.net-c.com [213.182.54.15] with ESMTP Wed, 27 May 2020 08:05:29 +0200 (CEST) X-EA-Auth: sZb7blwmIBzTGT/t8Z4f6a7oGyg33OpQwrXaSAfIZ2Ko5XlzfyGwBEFKLcfxKca/WNOh/YVEyWkw299C4s+t7ghmWC7lspIS In-Reply-To: (Yuri Khan's message of "Wed, 27 May 2020 11:22:42 +0700") OpenPGP: id=525F7E60AD812C2361752BB4C8B0F8548EE7F3E7; url=https://openpgpkey.gnui.org/.well-known/openpgpkey/gnui.org/hu/hr4k5tkxm6shwdc18su4bkm34w3dctjd Received-SPF: pass client-ip=213.182.54.6; envelope-from=dag@gnui.org; helo=relay-1.mailobj.net X-detected-operating-system: by eggs.gnu.org: First seen = 2020/05/27 02:05:29 X-ACL-Warn: Detected OS = Linux 3.1-3.10 X-Spam_score_int: -18 X-Spam_score: -1.9 X-Spam_bar: - X-Spam_report: (-1.9 / 5.0 requ) BAYES_00=-1.9, RCVD_IN_MSPIKE_H4=0.001, RCVD_IN_MSPIKE_WL=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001 autolearn=_AUTOLEARN X-Spam_action: no action X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: "help-gnu-emacs" Xref: news.gmane.io gmane.emacs.help:123138 Archived-At: --=-=-= Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Yuri Khan wrote: > On Wed, 27 May 2020 at 07:23, Dmitry Alexandrov wrote: > >> First and foremost, your =E2=80=99 is *not* an apostrophe, it=CA=BCs a r= ight quote. Apostrophe is =CA=BC. >> I you insist on using right single quote as apostrophe, though, I have n= o idea, how to make ispell.el pass it to hunspell(1) as a part of a word. = Neither why ever do that. > > Because the Unicode standard says so. > > , entry for U+2019: > > 2019 =E2=80=99 RIGHT SINGLE QUOTATION MARK > =3D single comma quotation mark > =E2=80=A2 this is the preferred character to use for apostrophe > =E2=86=92 0027 ' apostrophe > =E2=86=92 02BC =CA=BC modifier letter apostrophe > =E2=86=92 275C =E2=9D=9C heavy single comma quotation mark orn= ament > > (Yes, it=E2=80=99s unfortunate for word boundary algorithms.) Heh. I=CA=BCm afraid, it=CA=BCs not merely unfortunate, it=CA=BCs totally = in spite of the whole spirit of Unicode, and simply silly. Just as if it s= aid that =E2=80=9D (RIGHT DOUBLE QUOTATION MARK) were the preferred charact= er for inches sign. > Also, : > > 02BC =CA=BC MODIFIER LETTER APOSTROPHE > =3D apostrophe > =E2=80=A2 glottal stop, glottalization, ejective > =E2=80=A2 many languages use this as a letter of their alphabe= ts > =E2=80=A2 used as a tone marker in Bodo, Dogri, and Maithili > =E2=80=A2 2019 =E2=80=99 is the preferred character for a punc= tuation apostrophe A! _Punctuation_ apostrophe. Whatever is it, it=CA=BCs not what we are ta= king about here. hunspell does not check punctuation. --=-=-= Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iIMEARYIACsWIQRSX35grYEsI2F1K7TIsPhUjufz5wUCXs4DJA0cZGFnQGdudWku b3JnAAoJEMiw+FSO5/PnOdkBAJybuobgRpfx//ulTm2aApgWFi4XvYH7UJWBuoHT RLGpAP9QHXSCmWLE4Agpk135CTj3jktLvWc/bnz2RE082eojBQ== =YbJs -----END PGP SIGNATURE----- --=-=-=--