unofficial mirror of bug-gnu-emacs@gnu.org 
 help / color / mirror / code / Atom feed
* bug#36359: 'sentence-end-base' 3 additional symbols
@ 2019-06-24 16:13 Sebastian Urban
  2019-07-09  1:12 ` Lars Ingebrigtsen
  0 siblings, 1 reply; 9+ messages in thread
From: Sebastian Urban @ 2019-06-24 16:13 UTC (permalink / raw)
  To: 36359

I just wanted to suggest to perhaps add this:

- '>' - GREATER-THAN SIGN
   (codepoint 62, #o76, #x3e),
- '»' - RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
   (codepoint 187, #o273, #xbb),
- '›' - SINGLE RIGHT-POINTING ANGLE QUOTATION MARK
   (codepoint 8250, #o20072, #x203a),

to the value of 'sentence-end-base', like this for example:

-"[.?!…‽][]\"'”’)}]*"
+"[.?!…‽][]\"'”’»›)}>]*"

And perhaps update example in section "15.8(INFO) Regular Expression
Example" of Emacs manual.


S. U.


In GNU Emacs 26.2 (build 1, i686-w64-mingw32)
  of 2019-04-13 built on CIRROCUMULUS
Repository revision: fd1b34bfba8f3f6298df47c8e10b61530426f749
Windowing system distributor 'Microsoft Corp.', version 6.1.7601





^ permalink raw reply	[flat|nested] 9+ messages in thread

* bug#36359: 'sentence-end-base' 3 additional symbols
  2019-06-24 16:13 bug#36359: 'sentence-end-base' 3 additional symbols Sebastian Urban
@ 2019-07-09  1:12 ` Lars Ingebrigtsen
  2019-07-09  9:07   ` Sebastian Urban
  0 siblings, 1 reply; 9+ messages in thread
From: Lars Ingebrigtsen @ 2019-07-09  1:12 UTC (permalink / raw)
  To: Sebastian Urban; +Cc: 36359

Sebastian Urban <mrsebastianurban@gmail.com> writes:

> I just wanted to suggest to perhaps add this:
>
> - '>' - GREATER-THAN SIGN
>   (codepoint 62, #o76, #x3e),
> - '»' - RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
>   (codepoint 187, #o273, #xbb),
> - '›' - SINGLE RIGHT-POINTING ANGLE QUOTATION MARK
>   (codepoint 8250, #o20072, #x203a),
>
> to the value of 'sentence-end-base', like this for example:
>
> -"[.?!…‽][]\"'”’)}]*"
> +"[.?!…‽][]\"'”’»›)}>]*"

I can see » being useful here, but do people use > in these
circumstances?

And › I've never seen before -- what language is that used in?

-- 
(domestic pets only, the antidote for overdose, milk.)
   bloggy blog: http://lars.ingebrigtsen.no





^ permalink raw reply	[flat|nested] 9+ messages in thread

* bug#36359: 'sentence-end-base' 3 additional symbols
  2019-07-09  1:12 ` Lars Ingebrigtsen
@ 2019-07-09  9:07   ` Sebastian Urban
  2019-07-09 12:17     ` Lars Ingebrigtsen
  0 siblings, 1 reply; 9+ messages in thread
From: Sebastian Urban @ 2019-07-09  9:07 UTC (permalink / raw)
  To: Lars Ingebrigtsen; +Cc: 36359

> I can see » being useful here, (...)

So, I'll take it as done.

> (...) but do people use > in these circumstances?

Well, I was thinking about writing them in LaTeX documents, where (if
font encoding is OT4 or T1) you can get '»' by typing '>>'.  If this
is not enough, then skip it, I'll set it manually.

> And › I've never seen before -- what language is that used in?

And this is (I think) used for inner quotes, just like '’',
i.e. « ... ‹ ... › ... ».

Here is short thread about it:
https://forum.wordreference.com/threads/fr-citations-imbriqu%C3%A9es-quotation-within-a-quotation-typography.1061025/





^ permalink raw reply	[flat|nested] 9+ messages in thread

* bug#36359: 'sentence-end-base' 3 additional symbols
  2019-07-09  9:07   ` Sebastian Urban
@ 2019-07-09 12:17     ` Lars Ingebrigtsen
  2019-07-09 18:29       ` Sebastian Urban
  0 siblings, 1 reply; 9+ messages in thread
From: Lars Ingebrigtsen @ 2019-07-09 12:17 UTC (permalink / raw)
  To: Sebastian Urban; +Cc: 36359

Sebastian Urban <mrsebastianurban@gmail.com> writes:

>> (...) but do people use > in these circumstances?
>
> Well, I was thinking about writing them in LaTeX documents, where (if
> font encoding is OT4 or T1) you can get '»' by typing '>>'.  If this
> is not enough, then skip it, I'll set it manually.

But you end up with » in the buffer, so I don't quite follow how
having > in sentence-end-base is useful...

>> And › I've never seen before -- what language is that used in?
>
> And this is (I think) used for inner quotes, just like '’',
> i.e. « ... ‹ ... › ... ».

Right:

« La Constitution du 3 septembre 1791 proclame la nécessité d'‹ une instruction
publique, commune à tous les citoyens, gratuite à l'égard des parties
d'enseignement indispensables pour tous les hommes ›. »

That example is also interesting because it has the full stop before the
», while I was wondering whether the French did that (or put it after
the »), so I guess that answers that.

So unless anybody objects, I'm adding › and » to the regexp.

-- 
(domestic pets only, the antidote for overdose, milk.)
   bloggy blog: http://lars.ingebrigtsen.no





^ permalink raw reply	[flat|nested] 9+ messages in thread

* bug#36359: 'sentence-end-base' 3 additional symbols
  2019-07-09 12:17     ` Lars Ingebrigtsen
@ 2019-07-09 18:29       ` Sebastian Urban
  2019-07-09 19:40         ` Lars Ingebrigtsen
  0 siblings, 1 reply; 9+ messages in thread
From: Sebastian Urban @ 2019-07-09 18:29 UTC (permalink / raw)
  To: Lars Ingebrigtsen; +Cc: 36359

>> (...) you can get '»' by typing '>>'.
>
> But you end up with » in the buffer, so I don't quite follow how
> having > in sentence-end-base is useful...

You will get » but in generated .PDF, in .TEX it'll be >>.  Just like
'' in .TEX and ” in .PDF.

> So unless anybody objects, I'm adding › and » to the regexp.

Thanks, but I'm worried a bit about spaces they put before closing
quotes.  In the example quotation from your message, at the end, there
is "DOT SPACE 'RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK'" - regexp
won't recognize this.  Perhaps update to this will do:

    "[.?!…‽] ?[]\"'”’»›)}]*"
	   ^^-these were added

But then I don't know how people who use these quotes, actually use
them, i.e. with or without space?  Because for example: gutenberg.org
-> bookshelves -> Français -> any category/book -> Plain Text (UTF-8),
doesn't use space, as far as I know.





^ permalink raw reply	[flat|nested] 9+ messages in thread

* bug#36359: 'sentence-end-base' 3 additional symbols
  2019-07-09 18:29       ` Sebastian Urban
@ 2019-07-09 19:40         ` Lars Ingebrigtsen
  2019-07-10  7:18           ` Sebastian Urban
  0 siblings, 1 reply; 9+ messages in thread
From: Lars Ingebrigtsen @ 2019-07-09 19:40 UTC (permalink / raw)
  To: Sebastian Urban; +Cc: 36359

Sebastian Urban <mrsebastianurban@gmail.com> writes:

> But then I don't know how people who use these quotes, actually use
> them, i.e. with or without space?  Because for example: gutenberg.org
> -> bookshelves -> Français -> any category/book -> Plain Text (UTF-8),
> doesn't use space, as far as I know.

Yeah, I thought it looked pretty strange with the spaces, too, so unless
any French people speak up and want to have that added, I think we can
just leave it as it is.

-- 
(domestic pets only, the antidote for overdose, milk.)
   bloggy blog: http://lars.ingebrigtsen.no





^ permalink raw reply	[flat|nested] 9+ messages in thread

* bug#36359: 'sentence-end-base' 3 additional symbols
  2019-07-09 19:40         ` Lars Ingebrigtsen
@ 2019-07-10  7:18           ` Sebastian Urban
  2019-08-01 13:27             ` Sebastian Urban
  0 siblings, 1 reply; 9+ messages in thread
From: Sebastian Urban @ 2019-07-10  7:18 UTC (permalink / raw)
  To: Lars Ingebrigtsen; +Cc: 36359

> (...) unless any French people speak up and want to have that added,
> I think we can just leave it as it is.

I agree.

So to sum things up, » and › were added, while > I'll set on my own,
until one day someone else will add another argument.

If yes, then I'll consider this thread as closed.





^ permalink raw reply	[flat|nested] 9+ messages in thread

* bug#36359: 'sentence-end-base' 3 additional symbols
  2019-07-10  7:18           ` Sebastian Urban
@ 2019-08-01 13:27             ` Sebastian Urban
  2019-08-01 13:29               ` Lars Ingebrigtsen
  0 siblings, 1 reply; 9+ messages in thread
From: Sebastian Urban @ 2019-08-01 13:27 UTC (permalink / raw)
  To: Sebastian Urban, Lars Ingebrigtsen; +Cc: 36359

> So to sum things up, » and › were added, while > I'll set
> on my own, until one day someone else will add another
> argument.

Just to clear up any doubts, can I get a confirmation on
this, because I'm not sure whether I should consider it done
or not.





^ permalink raw reply	[flat|nested] 9+ messages in thread

* bug#36359: 'sentence-end-base' 3 additional symbols
  2019-08-01 13:27             ` Sebastian Urban
@ 2019-08-01 13:29               ` Lars Ingebrigtsen
  0 siblings, 0 replies; 9+ messages in thread
From: Lars Ingebrigtsen @ 2019-08-01 13:29 UTC (permalink / raw)
  To: Sebastian Urban; +Cc: 36359

Sebastian Urban <mrsebastianurban@gmail.com> writes:

>> So to sum things up, » and › were added, while > I'll set
>> on my own, until one day someone else will add another
>> argument.
>
> Just to clear up any doubts, can I get a confirmation on
> this, because I'm not sure whether I should consider it done
> or not.

Your summation was correct.

-- 
(domestic pets only, the antidote for overdose, milk.)
   bloggy blog: http://lars.ingebrigtsen.no





^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2019-08-01 13:29 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2019-06-24 16:13 bug#36359: 'sentence-end-base' 3 additional symbols Sebastian Urban
2019-07-09  1:12 ` Lars Ingebrigtsen
2019-07-09  9:07   ` Sebastian Urban
2019-07-09 12:17     ` Lars Ingebrigtsen
2019-07-09 18:29       ` Sebastian Urban
2019-07-09 19:40         ` Lars Ingebrigtsen
2019-07-10  7:18           ` Sebastian Urban
2019-08-01 13:27             ` Sebastian Urban
2019-08-01 13:29               ` Lars Ingebrigtsen

Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).