From: David Reitter <david.reitter@gmail.com>
To: Carsten Bormann <cabo@tzi.org>
Cc: Emacs-Devel devel <emacs-devel@gnu.org>
Subject: German-postfix fix for Quail
Date: Thu, 21 Aug 2008 19:43:27 -0400 [thread overview]
Message-ID: <10724F20-5032-4AE1-AF3E-C8D9A16B1A52@gmail.com> (raw)
In-Reply-To: <B38610C6-933F-4F40-A82D-5FABFE1D5397@tzi.org>
[-- Attachment #1: Type: text/plain, Size: 3210 bytes --]
cc'ing Emacs devel. For the patch, see below.
On 21 Aug 2008, at 19:19, Carsten Bormann wrote:
> On Aug 22 2008, at 00:09, David Reitter wrote:
>
>> seems sensible. Have you checked your new rules with a corpus?
>
>
> Well it worked well for my everyday typing so far :-)
>
> I just did, in a limited way (a bug in the "wortschatz" website
> doesn't let me search for umlauts and DWDS only allows wildcards at
> the beginning *or* end of words). The only occurrence that I could
> not immediately rule out as an obvious typo was:
>
> "Casa Scanteü" in Bukarest
>
> which doesn't even google (so probably also a typo).
>
> Do you have access to better corpora?
Yes, generally speaking.
The 34 Million word Frankfurter Rundschau corpus for example brings up
a few hapax words, correctly spelled, which are, without exception,
compound nouns:
Diaüberblendschau
Guerillaüberfall
Klimaübereinkommen
Klimaüberwachungs
Yogaübung
Kameraüberwachung
This is for aü only.
For eü, we get this at morpheme boundaries: "geöffenten", for
instance. I find a lot of legitimate eü (mostly at word boundaries in
compounds).
At least for aü, I also get a LOT of misspelled words (like
"Aüßerungen") - more than the correctly spelled ones (most of which
are, as I said frequency=1).
"qü" does not occur.
I presume Quail has some provisions for typing something like
"Yogaübung" even with your patch?
For "eü", perhaps we should consider holding off.
- D
Begin forwarded message:
> From: Carsten Bormann <cabo@tzi.org>
> Date: 21 August 2008 17:24:55 EDT
> To: Carsten Bormann <cabo@tzi.org>
> Cc: David Reitter <david.reitter@gmail.com>
> Subject: Re: German-postfix fix
>
> Aargh, wrong version (it pays to do another C-x v =), should be:
>
>
> --- latin-post.el.~1.31.~ 2008-05-07 05:37:06.000000000 +0200
> +++ latin-post.el 2008-08-21 23:22:20.000000000 +0200
> @@ -1085,7 +1085,7 @@
> aee -> ae
> oe -> ö
> oee -> oe
> -ue -> ü
> +ue -> ü (not after a/e/q)
> uee -> ue
> sz -> ß
> szz -> sz
> @@ -1108,6 +1108,13 @@
> ("UEE" ["UE"])
> ("uee" ["ue"])
> ("szz" ["sz"])
> +
> + ("eue" ["eue"])
> + ("Eue" ["Eue"])
> + ("aue" ["aue"])
> + ("Aue" ["Aue"])
> + ("que" ["que"])
> + ("Que" ["Que"])
> )
>
> (quail-define-package
>
>
> Sorry, Carsten
>
> On Aug 21 2008, at 23:21, Carsten Bormann wrote:
>
>> David,
>>
>> would it be in your purview to apply this important little patch?
>> (I believe you speak German, so you should be able to understand
>> why "Steür" and "Qülle" and "Maür" make no sense :-)
>>
>> Gruesse, Carsten
>>
>> --- latin-post.el.~1.31.~ 2008-05-07 05:37:06.000000000 +0200
>> +++ latin-post.el 2008-08-21 23:16:29.000000000 +0200
>> @@ -1085,7 +1085,7 @@
>> aee -> ae
>> oe -> ö
>> oee -> oe
>> -ue -> ü
>> +ue -> ü (not after a/e/q)
>> uee -> ue
>> sz -> ß
>> szz -> sz
>> @@ -1108,6 +1108,13 @@
>> ("UEE" ["UE"])
>> ("uee" ["ue"])
>> ("szz" ["sz"])
>> +
>> + ("eue" ["eue"])
>> + ("Eue" ["eue"])
>> + ("aue" ["aue"])
>> + ("Aue" ["Aue"])
>> + ("que" ["que"])
>> + ("Que" ["Que"])
>> )
>>
>> (quail-define-package
>
[-- Attachment #2: smime.p7s --]
[-- Type: application/pkcs7-signature, Size: 2193 bytes --]
next parent reply other threads:[~2008-08-21 23:43 UTC|newest]
Thread overview: 4+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <56EC050C-FE4A-408A-8402-F8AFD817B5BA@tzi.org>
[not found] ` <F8174BA2-E86D-4A07-BDDA-29A2F3037302@tzi.org>
[not found] ` <B2231AFA-B8CE-4A55-9BB2-8EAAA6716497@gmail.com>
[not found] ` <B38610C6-933F-4F40-A82D-5FABFE1D5397@tzi.org>
2008-08-21 23:43 ` David Reitter [this message]
2008-08-22 1:25 ` German-postfix fix for Quail Carsten Bormann
2008-08-27 21:31 ` David Reitter
2008-08-28 16:44 ` Stefan Monnier
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
List information: https://www.gnu.org/software/emacs/
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=10724F20-5032-4AE1-AF3E-C8D9A16B1A52@gmail.com \
--to=david.reitter@gmail.com \
--cc=cabo@tzi.org \
--cc=emacs-devel@gnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
Code repositories for project(s) associated with this public inbox
https://git.savannah.gnu.org/cgit/emacs.git
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).