all messages for Emacs-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
From: Drew Adams <drew.adams@oracle.com>
To: "Clément Pit--Claudel" <clement.pit@gmail.com>, emacs-devel@gnu.org
Subject: RE: ASCII-only startup message?
Date: Sat, 26 Dec 2015 18:51:35 -0800 (PST)	[thread overview]
Message-ID: <ba22711d-ce01-4f07-b502-734a2739175e@default> (raw)
In-Reply-To: <567F38EF.1030908@gmail.com>

> > I have never seen any doc or typography guideline that favors
> > a quotation mark over an apostrophe for English contractions,
> > possessives, or non-word plurals.  Quite the contrary.  These
> > use cases are precisely the raison d'être for the apostrophe.
> 
> I don't know much about this topic, so this may not be the type of documents
> you're looking for. Are you aware of the following passage, on page 274 of
> the latest Unicode standard (page 19 of
> http://www.unicode.org/versions/Unicode8.0.0/ch06.pdf)? I believe that David
> Kastrup already quoted it.
> 
> > Apostrophes
> >
> > U+0027 apostrophe is the most commonly used character for apostrophe. For
> > historical reasons, U+0027 is a particularly overloaded character. In
> > ASCII, it is used to represent a punctuation mark (such as right single
> > quotation mark, left single quotation mark, apostrophe punctuation, vertical
> > line, or prime) or a modifier letter (such as apostrophe modifier or acute
> > accent).
> > Punctuation marks generally break words; modifier letters generally are
> > considered part of a word. When text is set, U+2019 right single quotation
> > mark is preferred as apostrophe, but only U+0027 is present on most keyboards.
> > Software commonly offers a facility for automatically converting the U+0027
> > apostrophe to a contextually selected curly quotation glyph. In these systems,
> > a U+0027 in the data stream is always represented as a straight vertical line
> > and can never represent a curly apostrophe or a right quotation mark.
> >
> > Letter Apostrophe.
> >
> > U+02BC modifier letter apostrophe is preferred where the apostrophe is to
> > represent a modifier letter (for example, in transliterations to indicate
> > a glottal stop). In the latter case, it is also referred to as a letter
> > apostrophe.
> >
> > Punctuation Apostrophe.
> >
> > U+2019 right single quotation mark is preferred where the character is to
> > represent a punctuation mark, as for contractions: “We’ve been here
> > before.” In this latter case, U+2019 is also referred to as a punctuation
> > apostrophe.
> >
> > An implementation cannot assume that users’ text always adheres to the
> > distinction between these characters. The text may come from different
> > sources, including mapping from other character sets that do not make this
> > distinction between the letter apostrophe and the punctuation apostrophe/right
> > single quotation mark. In that case, all of them will generally be represented
> > by U+2019.
> >
> > The semantics of U+2019 are therefore context dependent. For example, if
> > surrounded by letters or digits on both sides, it behaves as an in-text
> > punctuation character and does not separate words or lines.
> 
> I understand this as an explicit endorsement of "There’s" over "There's",
> and of "it’s the _wrong thing_" over "it's the _wrong thing_".  Mark Davis
> (the president of the Unicode consortium) clarified this in an email back in
> 1999: http://www.unicode.org/mail-arch/unicode-ml/Archives-Old/UML017/0558.html

I see, and I thank both you and Paul for that information, of which I was unaware.

I should have checked first the Wikipedia article for Apostrophe, which covers
all of this (including the Unicode corner) quite well:
https://en.wikipedia.org/wiki/Apostrophe.

The topic seems still to be muddy water.  Unicode still confuses quotation
mark with apostrophe.  At least the _role_ of apostrophe (used within a word)
is recognized as what I said, as opposed to the role of quotation marks (used
around words).

But Unicode has apparently decided to consider the _same character_ as
appropriate for both uses.  Odd that that choice is made over and against
the same kind of confusion wrt use of the ASCII apostrophe character, which
they rightfully point to as (even more) overloaded in its use ("U+0027 is a
particularly overloaded character").

I bend to Unicode's choice in this, of course.  But it is too bad, IMO,
that it does not distinguish apostrophe and quotation marks in usage,
but recommends using the _same character for both uses_.

Recommending this, even when they could have done otherwise (ASCII's
existing-keyboards excuse does not apply to Unicode choices), seems like
a mistake to me.  But what do I know?  I'm no expert in these things.
And Unicode has wider concerns than English typography - similar-looking
glyphs apparently have different usages across languages.  Perhaps this
confounding of apostrophe and quote mark was a compromise of some kind.

I do agree that one character for two uses (apostrophe and quotation) is
better than one character for several uses, which has been the case for
the ASCII apostrophe.  I am surprised that two different characters were
not defined for these different uses, however, even if they might often
have similar or even identical appearances.

Oh well.  I would still argue for for _Emacs_ to use ASCII apostrophe (and
not a quote mark) for apostrophe uses, both on the basis of appearance -
at least in the default fonts and in the fonts I use - and (especially) on
the basis of simplicity of use for most keyboard users.  Emacs is about
writing and editing more than it is about presentation-level typography.

In terms of appearance, I disagree with Paul's appreciation that "it’s"
is preferable to "it's" in the default Emacs fonts (and more generally in
the fonts I use, for Emacs and for technical doc applications).  But in
any case, I think that ease of use is more important for Emacs than
appearance, for this.

Anyway, thanks to both of you, again, for teaching me about Unicode's
equivalence of apostrophe and quotation mark, and its preference of this
character for the uses of an apostrophe.

It's not my personal preference for uses of an apostrophe.  It's not the
preference used (so far) by my company in its technical docs (FWIW).
And I don't think it should be the preference for Emacs - which _should_
generally use Unicode, and which should respect its recommendations when
appropriate, but which need not bend to using the Unicode right quotation
mark as apostrophe.

I think (so far) that Emacs should stick to using ASCII apostrophe as
apostrophe, in spite of the Unicode standard's recommendation here.
And this mainly for simplicity of use, not appearance (though I also
prefer the appearance, personally, in the fonts I know).

I have now heard Unicode's recommendation (thanks to you), but I don't
read the reason given for that recommendation as a strong reason.

Of course, even the weakest of reasons given by Unicode becomes a
strong reason (in general), just by virtue of being a Unicode
Consortium recommendation.  Whether it is appropriate for Emacs is
another story.



  reply	other threads:[~2015-12-27  2:51 UTC|newest]

Thread overview: 62+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2015-12-26 17:25 ASCII-only startup message? Paul Eggert
2015-12-26 18:16 ` Eli Zaretskii
2015-12-26 18:41   ` Random832
2015-12-26 18:50     ` Paul Eggert
2015-12-26 19:11       ` Eli Zaretskii
2015-12-26 19:01     ` Eli Zaretskii
2015-12-26 18:45   ` Paul Eggert
2015-12-26 19:10     ` Eli Zaretskii
2015-12-26 19:40       ` Paul Eggert
2015-12-26 20:50         ` Eli Zaretskii
2015-12-26 23:28           ` Paul Eggert
2015-12-27  0:17             ` Drew Adams
2015-12-27  1:03               ` Clément Pit--Claudel
2015-12-27  2:51                 ` Drew Adams [this message]
2015-12-27  1:09               ` Paul Eggert
2015-12-27 15:56                 ` Eli Zaretskii
2015-12-27 18:45                   ` Paul Eggert
2015-12-27  6:58               ` Random832
2015-12-27 14:17                 ` Per Starbäck
2015-12-27 14:55                   ` Drew Adams
2015-12-27 16:35                     ` Per Starbäck
2015-12-27 17:42                       ` Drew Adams
2015-12-27 19:27                         ` Per Starbäck
2015-12-27 22:47                           ` Drew Adams
2015-12-27 23:45                             ` Per Starbäck
2015-12-28  2:01                               ` Drew Adams
2015-12-28  5:51                                 ` Random832
2015-12-28 10:09                                   ` Drew Adams
2015-12-28  6:05                                 ` Per Starbäck
2015-12-28 10:13                                   ` Drew Adams
2015-12-28  9:12                                 ` Nikolai Weibull
2015-12-28 10:15                                   ` Drew Adams
2015-12-28 14:59                                     ` Nikolai Weibull
2015-12-28 18:39                                       ` Drew Adams
2015-12-29  9:06                                       ` Alan Mackenzie
2015-12-28  9:37                             ` Paul Eggert
2015-12-28 10:16                               ` Drew Adams
2015-12-29  7:05                                 ` Random832
2015-12-29  8:01                                   ` Yuri Khan
2015-12-29 14:38                                     ` Random832
2015-12-29 15:58                                     ` Eli Zaretskii
2015-12-29 17:05                                     ` Paul Eggert
2015-12-29 18:00                                       ` Drew Adams
2015-12-29 18:16                                       ` Eli Zaretskii
2015-12-29 19:24                                         ` Paul Eggert
2015-12-29 15:55                                   ` Eli Zaretskii
2015-12-29 17:40                                   ` Drew Adams
2015-12-28 16:31                               ` Eli Zaretskii
2015-12-27  3:44             ` Eli Zaretskii
2015-12-27  8:12               ` Nikolai Weibull
2015-12-28 20:04               ` John Wiegley
2015-12-29  6:50                 ` Richard Stallman
2015-12-29 16:55                   ` John Wiegley
2015-12-29 17:30                     ` Paul Eggert
2015-12-29 18:18                       ` Drew Adams
2016-01-01 13:29                       ` Marcin Borkowski
2016-01-01 17:48                         ` John Wiegley
2016-01-01 17:50                           ` John Wiegley
2016-01-02  8:14                           ` Paul Eggert
2015-12-29 15:05                 ` Random832
2015-12-29 16:49                   ` John Wiegley
     [not found]                 ` <<n5u7gn$6vh$1@ger.gmane.org>
2015-12-29 17:46                   ` Drew Adams
     [not found] <<567ECD8C.1070408@cs.ucla.edu>

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=ba22711d-ce01-4f07-b502-734a2739175e@default \
    --to=drew.adams@oracle.com \
    --cc=clement.pit@gmail.com \
    --cc=emacs-devel@gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/emacs.git
	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.