From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED.blaine.gmane.org!not-for-mail From: Sebastian Urban Newsgroups: gmane.emacs.bugs Subject: bug#36359: 'sentence-end-base' 3 additional symbols Date: Tue, 9 Jul 2019 20:29:14 +0200 Message-ID: <1ab0a90a-41bb-6198-d0d5-64bdf09a10b3@gmail.com> References: <87y318f1u4.fsf@mouse.gnus.org> <30a6aa9c-c2db-5142-ddad-cc742b27e0ce@gmail.com> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Injection-Info: blaine.gmane.org; posting-host="blaine.gmane.org:195.159.176.226"; logging-data="264164"; mail-complaints-to="usenet@blaine.gmane.org" User-Agent: Mozilla/5.0 (Windows NT 6.1; rv:60.0) Gecko/20100101 Thunderbird/60.7.2 Cc: 36359@debbugs.gnu.org To: Lars Ingebrigtsen Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Tue Jul 09 20:30:28 2019 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([209.51.188.17]) by blaine.gmane.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.89) (envelope-from ) id 1hkusk-0016aX-Vn for geb-bug-gnu-emacs@m.gmane.org; Tue, 09 Jul 2019 20:30:27 +0200 Original-Received: from localhost ([::1]:52724 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.86_2) (envelope-from ) id 1hkusj-0002Ep-Kb for geb-bug-gnu-emacs@m.gmane.org; Tue, 09 Jul 2019 14:30:25 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:34360) by lists.gnu.org with esmtp (Exim 4.86_2) (envelope-from ) id 1hkusO-0002C3-8H for bug-gnu-emacs@gnu.org; Tue, 09 Jul 2019 14:30:05 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1hkusN-00086W-5P for bug-gnu-emacs@gnu.org; Tue, 09 Jul 2019 14:30:04 -0400 Original-Received: from debbugs.gnu.org ([209.51.188.43]:53599) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1hkusM-00084Y-PQ for bug-gnu-emacs@gnu.org; Tue, 09 Jul 2019 14:30:03 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1hkusM-0007HB-IC for bug-gnu-emacs@gnu.org; Tue, 09 Jul 2019 14:30:02 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Sebastian Urban Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Tue, 09 Jul 2019 18:30:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 36359 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: fixed Original-Received: via spool by 36359-submit@debbugs.gnu.org id=B36359.156269696827901 (code B ref 36359); Tue, 09 Jul 2019 18:30:02 +0000 Original-Received: (at 36359) by debbugs.gnu.org; 9 Jul 2019 18:29:28 +0000 Original-Received: from localhost ([127.0.0.1]:34187 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hkurl-0007Fv-JV for submit@debbugs.gnu.org; Tue, 09 Jul 2019 14:29:28 -0400 Original-Received: from mail-wm1-f49.google.com ([209.85.128.49]:51341) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hkurj-0007Fh-Uc for 36359@debbugs.gnu.org; Tue, 09 Jul 2019 14:29:24 -0400 Original-Received: by mail-wm1-f49.google.com with SMTP id 207so4075939wma.1 for <36359@debbugs.gnu.org>; Tue, 09 Jul 2019 11:29:23 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:subject:to:cc:references:message-id:date:user-agent :mime-version:in-reply-to:content-language:content-transfer-encoding; bh=+/mvZDxBI+UO46YWYsD7XoaBLBHpomBUO39Nz6utxFY=; b=e8BJcmz2TjoVbLlDYn8hxLbgr0xmM6XhP4nNoLgdDjW5YnPZ5J+4TSW8Bo684ihMTj 4nBJXM12fMRlQ+OF96LrYHcHG2aFeA+GMo5MddtVImsW696TnEm+c8FhGpGHVxtGdPpb A8kuG/TjZesd334PZIYqr4wWS37m1w64QJBtzDQi0ICOYAmkZkQWMWuVM4pLUdjP32a/ n0XtkCmYfBUtH7PGXLl1+ZRvP5vZ/LNAHpeEnjmw5Iz6Zty+NbwxnJTJWW8CgmwNuRmW h12ylC5GPoeuV6zUQ6X8WaM+p/KZZj8ST0p4GsGdCEPK44LHkdL1Q7NIz9nsYAEGZccE 7B5A== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:subject:to:cc:references:message-id:date :user-agent:mime-version:in-reply-to:content-language :content-transfer-encoding; bh=+/mvZDxBI+UO46YWYsD7XoaBLBHpomBUO39Nz6utxFY=; b=M46U0Qi/1Jcekdyp4XXqVb29hFpGXYz8M7+wGCZpesgNX3TyFjeWVkudunAyQ+mHPY TwyCBTm7vhnWJGqVhSsb9mf3zAUHXQ5pY6buNjeE6v1iIUivjPFrP9IM7Q5a69UyYqbs e+aIqdE/7PY9lfx3ODmLknWW8jTQjRvsDZ+OJNlkolX43HQnLaQvMrvFUmhzdQ5Y2hDf yQ4f1ZrpfAFa+A4QvGJjV3xNnnBrBpYqZ+Q8ix2z94jTxHU2TYN5RDI9bY6wXJlZyNXd MpAtq/joW3XV4j/wY5XoWPsHGyQUJaV4baHnctWbGRFVNvBooOBWKuZAtMordbuGiHzt IT1Q== X-Gm-Message-State: APjAAAUhLO57JDIC2DXLGXVNQDWKCkmFf81w9FL+MlL4nOJ8QKzYObQp Rs2GnirRohhc2UTnGwoHYzcWY9GW X-Google-Smtp-Source: APXvYqx0eNIuzkoh+8Fb1JnzdLYa0Z7Tp30UeIaKOO8m5veOgoHVSvwVvyIPCNVDOnH3DUS6hodgUQ== X-Received: by 2002:a1c:b707:: with SMTP id h7mr1004466wmf.45.1562696957637; Tue, 09 Jul 2019 11:29:17 -0700 (PDT) Original-Received: from ?IPv6:2a00:f41:1877:3e0f:70b0:a78c:729d:986a? ([2a00:f41:1877:3e0f:70b0:a78c:729d:986a]) by smtp.gmail.com with ESMTPSA id s3sm4531924wmh.27.2019.07.09.11.29.16 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Tue, 09 Jul 2019 11:29:17 -0700 (PDT) In-Reply-To: Content-Language: en-GB X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 209.51.188.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.org gmane.emacs.bugs:162539 Archived-At: >> (...) you can get '»' by typing '>>'. > > But you end up with » in the buffer, so I don't quite follow how > having > in sentence-end-base is useful... You will get » but in generated .PDF, in .TEX it'll be >>. Just like '' in .TEX and ” in .PDF. > So unless anybody objects, I'm adding › and » to the regexp. Thanks, but I'm worried a bit about spaces they put before closing quotes. In the example quotation from your message, at the end, there is "DOT SPACE 'RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK'" - regexp won't recognize this. Perhaps update to this will do: "[.?!…‽] ?[]\"'”’»›)}]*" ^^-these were added But then I don't know how people who use these quotes, actually use them, i.e. with or without space? Because for example: gutenberg.org -> bookshelves -> Français -> any category/book -> Plain Text (UTF-8), doesn't use space, as far as I know.