From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.bugs Subject: bug#23647: 25.1.50; In man pages, links on hyphenated words don't work Date: Sat, 04 Jun 2016 18:35:46 +0300 Message-ID: <83lh2lufu5.fsf@gnu.org> References: <87d1o52ntu.fsf@gmx.net> <83eg8lx6wq.fsf@gnu.org> <878tys31i6.fsf@gmx.net> <83vb1wwg0t.fsf@gnu.org> <87a8j7tzto.fsf@gmx.net> Reply-To: Eli Zaretskii NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Trace: ger.gmane.org 1465054587 13825 80.91.229.3 (4 Jun 2016 15:36:27 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Sat, 4 Jun 2016 15:36:27 +0000 (UTC) Cc: 23647@debbugs.gnu.org To: Stephen Berman Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Sat Jun 04 17:36:16 2016 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1b9DcV-0004lH-K7 for geb-bug-gnu-emacs@m.gmane.org; Sat, 04 Jun 2016 17:36:15 +0200 Original-Received: from localhost ([::1]:33056 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1b9DcU-0007nG-R3 for geb-bug-gnu-emacs@m.gmane.org; Sat, 04 Jun 2016 11:36:14 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:54840) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1b9DcO-0007mx-70 for bug-gnu-emacs@gnu.org; Sat, 04 Jun 2016 11:36:09 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1b9DcI-0005XH-5r for bug-gnu-emacs@gnu.org; Sat, 04 Jun 2016 11:36:07 -0400 Original-Received: from debbugs.gnu.org ([208.118.235.43]:42175) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1b9DcI-0005XD-2R for bug-gnu-emacs@gnu.org; Sat, 04 Jun 2016 11:36:02 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1b9DcH-00061N-RG for bug-gnu-emacs@gnu.org; Sat, 04 Jun 2016 11:36:01 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Eli Zaretskii Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Sat, 04 Jun 2016 15:36:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 23647 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: patch Original-Received: via spool by 23647-submit@debbugs.gnu.org id=B23647.146505452923101 (code B ref 23647); Sat, 04 Jun 2016 15:36:01 +0000 Original-Received: (at 23647) by debbugs.gnu.org; 4 Jun 2016 15:35:29 +0000 Original-Received: from localhost ([127.0.0.1]:54512 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1b9Dbl-00060X-HN for submit@debbugs.gnu.org; Sat, 04 Jun 2016 11:35:29 -0400 Original-Received: from eggs.gnu.org ([208.118.235.92]:35583) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1b9Dbk-00060L-I3 for 23647@debbugs.gnu.org; Sat, 04 Jun 2016 11:35:28 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1b9Dba-0005Sx-AK for 23647@debbugs.gnu.org; Sat, 04 Jun 2016 11:35:23 -0400 Original-Received: from fencepost.gnu.org ([2001:4830:134:3::e]:39917) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1b9Dba-0005Sq-73; Sat, 04 Jun 2016 11:35:18 -0400 Original-Received: from 84.94.185.246.cable.012.net.il ([84.94.185.246]:2805 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_128_CBC_SHA1:128) (Exim 4.82) (envelope-from ) id 1b9DbY-00078z-GY; Sat, 04 Jun 2016 11:35:16 -0400 In-reply-to: <87a8j7tzto.fsf@gmx.net> (message from Stephen Berman on Mon, 30 May 2016 15:55:47 +0200) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 208.118.235.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.org gmane.emacs.bugs:119057 Archived-At: > From: Stephen Berman > Cc: 23647@debbugs.gnu.org > Date: Mon, 30 May 2016 15:55:47 +0200 > > > I'm not enough of a roff expert to tell, but how about asking on the > > Groff list? > > I did that and got this feedback from Steffen Nurpmeso: > > > I have been convinced that soft hyphen is a control character and > > not something visual, it should be used as a «break-indicator» > > rather than as a hyphenation character, interpretation of which is > > left as an excercise for the processing software. I have no idea > > still but would guess groff uses "hyphen minus" U+002D or hyphen > > U+2010 if Unicode is possible. > > In a followup to another response he added: > > > For display purposes however i think U+00AD can't be used > > directly, but will be replaced by the renderer to either nothing, > > if no wrap is to be applied at the character position, or > > something appropriate, like ASCII hyphen-minus or some extended > > Unicode "Pd" letter, of which there are some (e.g., U+058A > > ARMENIAN HYPHEN, U+1400 CANADIAN SYLLABICS HYPHEN, and more). > > And he also made this suggestion: > > > Eli Zaretskii is so active on the > > Unicode list, why don't you use the Pd character class for > > detecting «hyphen»? I guess this should cover all such things > > already as of today, thanks to Werner Lemberg?! > > So how should we proceed from here? We could add U+2010 to the regexp > in my patch, which would then be this: "[-‐­]" (hyphen-minus (ASCII 45), > hyphen (U+2010), soft hyphen (U+00AD) -- it seems harmless to retain the > latter, given that man.el already uses it elsewhere), but if these are > all included in the Unicode Pd character class along with other possible > hyphen characters, maybe a different approach is required. I know > nothing about the Pd character class and how to detect it with Elisp; I > also don't know if doing that would lead to further changes in man.el, > making this a larger undertaking. What do you suggest? I'd go with just those 3, I think the others will not be produced by Groff. Thanks.