From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED.blaine.gmane.org!not-for-mail From: =?UTF-8?Q?Micha=C5=82?= Nazarewicz Newsgroups: gmane.emacs.bugs Subject: bug#20871: 25.0.50; fill-single-char-nobreak-p does not recognize a single-letter word when it is preceded by an open paren Date: Mon, 19 Aug 2019 16:36:15 +0100 Message-ID: References: <9A9C6F59-CB27-42D1-911E-F027B443B9BE@acm.org> <8336i1p8zd.fsf@gnu.org> <83mug8nut2.fsf@gnu.org> <83y2zpkxmt.fsf@gnu.org> Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Injection-Info: blaine.gmane.org; posting-host="blaine.gmane.org:195.159.176.226"; logging-data="79547"; mail-complaints-to="usenet@blaine.gmane.org" Cc: Mattias =?UTF-8?Q?Engdeg=C3=A5rd?= , mbork@mbork.pl, 20871@debbugs.gnu.org To: Eli Zaretskii Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Mon Aug 19 17:37:13 2019 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([209.51.188.17]) by blaine.gmane.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.89) (envelope-from ) id 1hzjia-000KWi-Og for geb-bug-gnu-emacs@m.gmane.org; Mon, 19 Aug 2019 17:37:12 +0200 Original-Received: from localhost ([::1]:54594 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1hzjiZ-0005ow-7K for geb-bug-gnu-emacs@m.gmane.org; Mon, 19 Aug 2019 11:37:11 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:57055) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1hzjiR-0005nC-5r for bug-gnu-emacs@gnu.org; Mon, 19 Aug 2019 11:37:04 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1hzjiP-0005hn-Vx for bug-gnu-emacs@gnu.org; Mon, 19 Aug 2019 11:37:03 -0400 Original-Received: from debbugs.gnu.org ([209.51.188.43]:51624) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1hzjiP-0005hj-Sh for bug-gnu-emacs@gnu.org; Mon, 19 Aug 2019 11:37:01 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1hzjiP-00022L-MG for bug-gnu-emacs@gnu.org; Mon, 19 Aug 2019 11:37:01 -0400 X-Loop: help-debbugs@gnu.org Resent-From: =?UTF-8?Q?Micha=C5=82?= Nazarewicz Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Mon, 19 Aug 2019 15:37:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 20871 X-GNU-PR-Package: emacs Original-Received: via spool by 20871-submit@debbugs.gnu.org id=B20871.15662289947793 (code B ref 20871); Mon, 19 Aug 2019 15:37:01 +0000 Original-Received: (at 20871) by debbugs.gnu.org; 19 Aug 2019 15:36:34 +0000 Original-Received: from localhost ([127.0.0.1]:60445 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hzjhy-00021X-Dg for submit@debbugs.gnu.org; Mon, 19 Aug 2019 11:36:34 -0400 Original-Received: from mail-wr1-f42.google.com ([209.85.221.42]:46738) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hzjhw-00021E-OW for 20871@debbugs.gnu.org; Mon, 19 Aug 2019 11:36:33 -0400 Original-Received: by mail-wr1-f42.google.com with SMTP id z1so9175858wru.13 for <20871@debbugs.gnu.org>; Mon, 19 Aug 2019 08:36:32 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc:content-transfer-encoding; bh=6kN50EJ3AET88Uj0VYm7IGtDjYhOBDGhG9qTb3M0cwI=; b=neyfn0mmw2LZdT4LOAqFBXOP/rFWvYiJUyISHXyf20+y1XuN7aB6SoMk4JW9vk3gCb rmbzAJht+cqmbcAwbNfJ1Whw9e8gWFB4lY+x+thKpPQrz3uciV6Im6EVajBZvhXrByH2 O2fU2219WRrUFQ1LOPRffFM9gEKMnf74eTcAHiAT++Ff4/Qutjcb8+HQaVCqbzIZZujz CWfpE/UsW2uFCkrouPEXNENUw5zlxbF/3pTwuuIUyXjfPmDsJkV46BkEgKg1cTeJrSkY tcqi4CNVut76q74lruwx8KLxs8+vjz7ilvsC4On0BGQGHiiwzk6P7zRq7xGakh4UI8Y4 jWjA== X-Gm-Message-State: APjAAAXSOpvgALz1OWc9zkJZ3wrLMcAy3NQkxDHucH9ZobsVWM5sD54z GG0uKtVSodUFnEeYfpKpYL075sKfYDHcu1JMHqDYH29R X-Google-Smtp-Source: APXvYqw0p2zoH93DUOsqVXmhNNaoWa5BEXo1FjbhcPwPKgFyYA+gyS5AUAr+HF1MWIJ2gLozcB93q2YwdtHU6u0Ze5A= X-Received: by 2002:a05:6000:104c:: with SMTP id c12mr25211377wrx.328.1566228987066; Mon, 19 Aug 2019 08:36:27 -0700 (PDT) In-Reply-To: <83y2zpkxmt.fsf@gnu.org> X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 209.51.188.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.org gmane.emacs.bugs:165386 Archived-At: > > From: Micha=C5=82 Nazarewicz > > Date: Mon, 19 Aug 2019 15:07:56 +0100 > > As discussed previously, =E2=80=98fill-single-char-nobreak-p=E2=80=99 a= nd > > =E2=80=98fill-polish-nobreak-p=E2=80=99 and serve pretty much the same = purpose. When > > I wrote the former I had Polish typography in mind and obviously the > > latter is meant to handle the same case. As such, having those two > > functions don=E2=80=99t provide much option to the user. On Mon, 19 Aug 2019 at 16:01, Eli Zaretskii wrote: > If both functions attempt to produce the same behavior, then yes, we > need only one. But then wouldn't we need a second one, to produce the > behavior expected, say, in US English? The expected behaviour for US English is achieved by not using the function at all. Unless you mean that someone want to follow the rule in English even though it=E2=80=99s not an established thing in that language. In that cas= e they can just use the existing functions and they will work for English. In that case I could see a potential reason to have multiple functions: - =E2=80=98fill-polish-nobreak-p=E2=80=99 =E2=80=93 don=E2=80=99t break aft= er a, e, i, o, u, w or z; - =E2=80=98fill-czech-nobreak-p=E2=80=99 =E2=80=93 don=E2=80=99t break afte= r a, i, k, o, s, u, v or z; and =E2=80=93 =E2=80=98fill-single-char-nobreak-p=E2=80=99 =E2=80=93 don=E2=80= =99t break after any single letter word. This can also be achieved by a single function and a variable listing all the characters. Note also that there is a different rule which applies to all languages which deals with breaking line between number and a unit, e.g. =E2=80=9860 s=E2=80=99, =E2=80=98100 m=E2=80=99 etc. I=E2=80= =99m not sure how this fits with current discussion since neither tildify nor the *-nobreak-p functions deal with that case. > > > I'm also okay with extending tildify.el to support more than just > > > Czech rules, but that's a separate issue. > > > > The differences between Czech and Polish can largely be ignored. > > I didn't mean Polish, I meant in general languages where the > conventions are different. Surely, there are some, and tildify > explicitly assumes that. I don=E2=80=99t think there are. It is possible that I=E2=80=99m incorrect= but all the materials I=E2=80=99ve found talked about Polish and Czech only. Polis= h Wikipedia entry=C2=B9 explicitly states that this rule is only for those tw= o languages. =C2=B9 https://pl.wikipedia.org/wiki/Sierotka_(typografia) (note that =E2=80=98sierotka=E2=80=99 literally translates to =E2=80=98orphan=E2=80= =99 but is a different thing than =E2=80=98orphan=E2=80=99 in English typography). -- Best regards =E3=83=9F=E3=83=8F=E3=82=A6 =E2=80=9C=F0=9D=93=B6=F0=9D=93=B2=F0=9D=93=B7= =F0=9D=93=AA86=E2=80=9D =E3=83=8A=E3=82=B6=E3=83=AC=E3=83=B4=E3=82=A4=E3=83= =84 =C2=ABIf at first you don=E2=80=99t succeed, give up skydiving=C2=BB