bug#1750: [PATCH] Support standard ASCII-IPA mappings in leim/quail/ipa.el.

unofficial mirror of bug-gnu-emacs@gnu.org 
 help / color / mirror / code / Atom feed

* bug#1750: [PATCH] Support standard ASCII-IPA mappings in leim/quail/ipa.el.
@ 2008-12-31 13:41 ` Aidan Kehoe
  2008-12-31 18:40   ` Stefan Monnier
                     ` (2 more replies)
  0 siblings, 3 replies; 8+ messages in thread
From: Aidan Kehoe @ 2008-12-31 13:41 UTC (permalink / raw)
  To: emacs-pretest-bug

[-- Attachment #1: message body text --]
[-- Type: text/plain, Size: 199 bytes --]


I was asked to resend this after the release; see
http://mid.gmane.org/17834.14174.503231.877601@parhasard.net and
http://thread.gmane.org/gmane.emacs.pretest.bugs/16625 for the associated
thread.


[-- Attachment #2: forwarded message --]
[-- Type: message/rfc822, Size: 26157 bytes --]

From: Aidan Kehoe <kehoea@parhasard.net>
To: emacs-pretest-bug@gnu.org
Subject: [PATCH] Support standard ASCII-IPA mappings in leim/quail/ipa.el.
Date: Sun, 14 Jan 2007 14:59:58 +0100
Message-ID: <17834.14174.503231.877601@parhasard.net>


There are two widely-used mappings from ASCII to the International Phonetic
Alphabet used on Usenet and in the online linguistics community
generally. The extant ipa.el uses its own convention to map from ASCII to
the IPA; this patch addes support for the more-widely used ones to it, with
two new input methods, kirshenbaum-ipa and x-sampa-ipa, following the names
of the two conventions.

It also moves the file encoding to UTF-8, since Kirshenbaum and X-SAMPA
require more character codes than are provided by the Mule IPA character
set, and since UTF-8 is more compatible with other applications. 

In the event that my mailer or yours trashes the character encoding of the
patch, I’ve made a bzip2’d version of the full file available at
http://www.parhasard.net/ipa-20070114.el.bz2 . 

leim/ChangeLog addition:

2007-01-14  Aidan Kehoe  <kehoea@parhasard.net>

	* quail/ipa.el:
	Recode the file as UTF-8, for better interoperability with other
	applications. 
	* quail/ipa.el ("kirshenbaum-ipa"):
	* quail/ipa.el ("x-sampa-ipa"):
	* quail/ipa.el (x-sampa-prepend-to-keymap-entry): New function.
	* quail/ipa.el (x-sampa-implosive-submap): New Quail submap.
	* quail/ipa.el (x-sampa-underscore-implosive): New function.
	Two new input methods, both following widely-used Usenet
	conventions for mapping ASCII to the IPA. Kirshenbaum is common in
	sci.lang and alt.usage.english, X-SAMPA in various (mostly
	European) non-English language fora. X-SAMPA is slightly more
	complex to support in Quail that Kirshenbaum, whence the two extra
	function and one extra submap to support it. 


GNU Emacs Trunk source patch:
Diff command:   cvs -q diff -Nu
Files affected: leim/quail/ipa.el
===================================================================
RCS

Index: leim/quail/ipa.el
===================================================================
RCS file: /sources/emacs/emacs/leim/quail/ipa.el,v
retrieving revision 1.8
diff -u -u -r1.8 ipa.el
--- leim/quail/ipa.el	13 Dec 2006 01:13:56 -0000	1.8
+++ leim/quail/ipa.el	14 Jan 2007 13:41:57 -0000
@@ -1,8 +1,10 @@
-;;; ipa.el --- Quail package for inputting IPA characters  -*-coding: iso-2022-7bit;-*-
-
+;;; quail/ipa.el -- Quail package for inputting IPA. -*- coding: utf-8 -*- 
+;;; 
 ;; Copyright (C) 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005, 2006
 ;;   National Institute of Advanced Industrial Science and Technology (AIST)
 ;;   Registration Number H14PRO021
+;; Licensed to the Free Software Foundation.
+;; Copyright (C) 2006 Free Software Foundation, Inc.
 
 ;; Keywords: multilingual, input method, IPA
 
@@ -23,11 +25,10 @@
 ;; Free Software Foundation, Inc., 51 Franklin Street, Fifth Floor,
 ;; Boston, MA 02110-1301, USA.
 
-;;; Commentary:
-
 ;;; Code:
 
 (require 'quail)
+(eval-when-compile (require 'cl))
 
 (quail-define-package
  "ipa" "IPA" "IPA" t
@@ -37,77 +38,473 @@
  nil t nil nil nil nil nil nil nil nil t)
 
 (quail-define-rules
- ("i" ?^[,0 ^[(B)
- ("I" ?^[,0!^[(B)
- ("e" ?^[,0"^[(B)
- ("/3" ?^[,0#^[(B)
- ("E" ?^[,0#^[(B)
- ("ae" ?^[,0$^[(B)
- ("a" ?^[,0%^[(B)
- ("i-" ?^[,0&^[(B)
- ("/e" ?^[,0'^[(B)
- ("/a" ?^[,0(^[(B)
- ("/m" ?^[,0)^[(B)
- ("&" ?^[,0*^[(B)
- ("/v" ?^[,0+^[(B)
- ("A" ?^[,0,^[(B)
- ("o|" ?^[,0,^[(B)
- ("y" ?^[,0-^[(B)
- ("Y" ?^[,0.^[(B)
- ("o/" ?^[,0/^[(B)
- ("oe" ?^[,00^[(B)
- ("OE" ?^[,01^[(B)
- ("u-" ?^[,02^[(B)
- ("o-" ?^[,03^[(B)
- ("u" ?^[,04^[(B)
- ("U" ?^[,05^[(B)
- ("o" ?^[,06^[(B)
- ("/c" ?^[,07^[(B)
- ("/A" ?^[,08^[(B)
- ("|o" ?^[,08^[(B)
- ("e-" ?^[,0:^[(B)
- ("e|" ?^[,0:^[(B)
- ("/3~" ?^[,0;^[(B)
- ("E~" ?^[,0;^[(B)
- ("A~" ?^[,0<^[(B)
- ("oe~" ?^[,0=^[(B)
- ("/c~" ?^[,0>^[(B)
- ("p" ?^[,0@^[(B)
- ("b" ?^[,0A^[(B)
- ("t" ?^[,0B^[(B)
- ("d" ?^[,0C^[(B)
- ("k" ?^[,0D^[(B)
- ("g" ?^[,0E^[(B)
- ("f" ?^[,0F^[(B)
- ("v" ?^[,0G^[(B)
- ("th" ?^[,0H^[(B)
- ("dh" ?^[,0I^[(B)
- ("s" ?^[,0J^[(B)
- ("z" ?^[,0K^[(B)
- ("sh" ?^[,0L^[(B)
- ("zh" ?^[,0M^[(B)
- ("3" ?^[,0M^[(B)
- ("c," ?^[,0N^[(B)
- ("x" ?^[,0O^[(B)
- ("/R" ?^[,0P^[(B)
- ("h" ?^[,0Q^[(B)
- ("m" ?^[,0R^[(B)
- ("n" ?^[,0S^[(B)
- ("gn" ?^[,0T^[(B)
- ("ng" ?^[,0U^[(B)
- ("r" ?^[,0V^[(B)
- ("R" ?^[,0W^[(B)
- ("/r" ?^[,0X^[(B)
- ("j" ?^[,0Y^[(B)
- ("l" ?^[,0Z^[(B)
- ("/y" ?^[,0[^[(B)
- ("L" ?^[,0\^[(B)
- ("/h" ?^[,0]^[(B)
- ("w" ?^[,0^^[(B)
- ("M" ?^[,0_^[(B)
- ("'" ?^[,0p^[(B)
- ("`" ?^[,0q^[(B)
- (":" ?^[,0r^[(B))
+ ("i" ?i)
+ ("I" ?ɪ)
+ ("e" ?e)
+ ("/3" ?ɛ)
+ ("E" ?ɛ)
+ ("ae" ?æ)
+ ("a" ?a)
+ ("i-" ?ɨ)
+ ("/e" ?ə)
+ ("/a" ?ɐ)
+ ("/m" ?ɯ)
+ ("&" ?ɤ)
+ ("/v" ?ʌ)
+ ("A" ?ɑ)
+ ("o|" ?ɑ)
+ ("y" ?y)
+ ("Y" ?ʏ)
+ ("o/" ?ø)
+ ("oe" ?œ)
+ ("OE" ?ɶ)
+ ("u-" ?ʉ)
+ ("o-" ?ɵ)
+ ("u" ?u)
+ ("U" ?ʊ)
+ ("o" ?o)
+ ("/c" ?ɔ)
+ ("/A" ?ɒ)
+ ("|o" ?ɒ)
+ ("e-" ?ɚ)
+ ("e|" ?ɚ)
+ ("p" ?p)
+ ("b" ?b)
+ ("t" ?t)
+ ("d" ?d)
+ ("k" ?k)
+ ("g" ?g)
+ ("f" ?f)
+ ("v" ?v)
+ ("th" ?θ)
+ ("dh" ?ð)
+ ("s" ?s)
+ ("z" ?z)
+ ("sh" ?ʃ)
+ ("zh" ?ʒ)
+ ("3" ?ʒ)
+ ("c," ?ç)
+ ("x" ?x)
+ ("/R" ?ʁ)
+ ("h" ?h)
+ ("m" ?m)
+ ("n" ?n)
+ ("gn" ?ɲ)
+ ("ng" ?ŋ)
+ ("r" ?r)
+ ("R" ?ʀ)
+ ("/r" ?ɹ)
+ ("j" ?j)
+ ("l" ?l)
+ ("/y" ?ʎ)
+ ("L" ?ʟ)
+ ("/h" ?ɥ)
+ ("w" ?w)
+ ("M" ?ʍ)
+ ("'" ?ˈ)
+ ("`" ?ˌ)
+ (":" ?ː))
+
+(quail-define-package
+ "kirshenbaum-ipa" "IPA" "IPA-K" t
+ "The International Phonetic Alphabet, using Kirshenbaum ASCII translit. 
+
+Kirshenbaum IPA is an ASCII transliteration of the phonetic alphabet, common
+in the Usenet groups `sci.lang' and `alt.usage.english'.  This input method
+allows you to type Kirshenbaum on your ASCII-capable keyboard, producing the
+corresponding actual IPA characters in your editor.
+
+See http://www.kirshenbaum.net/IPA/ascii-ipa.pdf for full details of the
+transliteration.
+
+A caveat with regard to that document; while XEmacs currently preserves
+Unicode diacritics on reading and emitting them, it displays them,
+incorrectly, as separate from the modified glyphs. ")
+
+(quail-define-rules
+ ("r" "ɹ")	;; Alveolar approximant			U+0279
+ ("A" "ɑ")	;; Low back unrounded vowel		U+0251
+ ("B" "β")	;; Voiced bilabial fricative		U+03B2
+ ("C" "ç")	;; Voiced palatal fricative		U+00E7
+ ("D" "ð")	;; Voiced dental fricative		U+00F0
+ ("E" "ɛ")	;; Lower-mid front unrounded vowel	U+025B
+ ("G" "ɢ")	;; Voiced uvular stop			U+0262
+ ("H" "ħ")	;; Voiced pharyngeal fricative		U+0127
+ ("I" "ɪ")	;; Semi-high front unrounded vowel	U+026A
+ ("J" "ɟ")	;; Voiced palatal stop			U+025F
+ ("L" ["ɫ"	;; Voiced velar lateral			U+026B
+       "ʟ"	;; Voiced velar lateral			U+029F
+       "ɬ"])	;; Voiced alveolar lateral fricative	U+026C
+ ("M" "ɱ")	;; Labio-dental nasal			U+0271
+ ("N" "ŋ")	;; Velar nasal				U+014B
+ ("O" "ɔ")	;; Low-mid back rounded vowel		U+0254
+ ("P" "Φ")	;; Voiceless bilabial fricative		U+03A6
+ ("Q" "ɣ")	;; Voiced velar fricative		U+0263
+ ("R" ["ʀ"	;; Alveolar trill			U+0280
+       "ɚ"])    ;; Rhotacised schwa			U+025A
+ ("@<r>" "ɚ")	;; Mid central rhotacised vowel		U+025A
+ ("S" "ʃ")	;; Voiced postalveolar fricative	U+0283
+ ("T" "θ")	;; Voiced dental fricative		U+03B8
+ ("U" "ʊ")	;; Semi-high back rounded vowel		U+028A
+ ("V" "ʌ")	;; Low-mid back unrounded vowel		U+028C
+ ("W" "œ")	;; Low-mid front rounded vowel		U+0153
+ ("X" "χ")	;; Voiceless uvular fricative		U+03C7
+ ("Y" "ø")	;; Upper-mid front rounded vowel	U+00F8
+ ("Z" "ʒ")	;; Voiced postalveolar fricative	U+0292
+ ("?" "ʔ")	;; Glottal stop				U+0294
+ ("@" "ə")	;; Mid central unrounded vowel (schwa)	U+0259
+ ("&" "æ")	;; Low front unrounded vowel		U+00E6
+ ("*" "ɾ")	;; Voiced alveolar flap			U+027E
+
+ ("a~" "ã")	;; Low central unrounded vowel, nasal	U+00E3
+ ("o~" "õ")	;; Upper-mid back rounded vowel, nasal	U+00F5
+ ("u~" "ũ")	;; High back rounded vowel, nasal	U+0169
+ ("~" "̃")	;; +Nasalised modifier			U+0303
+ (":" "ː")	;; +Long modifier			U+02D0
+ ("-" "̩")	;; +Syllabic modifier			U+0329
+ ("." "̣")	;; +Retroflex modifier			U+0323
+ ("`" "ʼ")	;; +Ejective modifier			U+02BC
+ ("[" "̪")	;; +Dental modifier			U+032A
+ (";" "ʲ")	;; +Palatalised modifier		U+02B2
+ ("<H>" "̴")	;; +Pharyngealised modifier		U+0334
+ ("<h>" "ʰ")	;; +Aspirated modifier			U+02B0
+ ("<o>" ["̥"	;; +Voiceless modifier			U+0325
+	 "˚"])	;; +Unexploded modifier			U+02DA
+ ("<r>" "ʳ")	;; +Rhotacised modifier			U+02B3
+ ("<w>" "ʷ")	;; +Labialised modifier			U+02B7
+ ("<?>" "ʱ")	;; +Murmured modififer			U+02B1
+
+ ("b<trl>" "ʙ")	;; Bilabial trill			U+0299
+ ("b`" "ɓ")	;; Bilabial implosive			U+0253
+ ("p!" "ʘ")	;; Bilabial click			U+0298
+ ("r<lbd>" "ʋ")	;; Labio-dental approximant		U+028B
+ ("d`" "ɗ")	;; Dental implosive			U+0257
+ ("t!" "ʇ")	;; Dental click				U+0287
+ ("s<lat>" "ɬ")	;; Voiceless alveolar lateral fricative	U+026C
+ ("z<lat>" "ɮ")	;; Voiced alveolar lateral fricative	U+026E
+ ("r<trl>" "ʀ")	;; Alveolar trill			U+0280
+
+ ("*<lat>" "ɺ")	;; Voiced alveolar lateral flap		U+027A
+ ("c!" "ʗ")	;; Alveolar click			U+0297
+ ("l!" "ʖ")	;; Alveolar lateral click		U+0296
+ ("n." "ɳ")	;; Retroflex nasal			U+0273
+ ("t." "ʈ")	;; Voiceless retroflex stop		U+0288
+ ("d." "ɖ")	;; Voiced retroflex stop		U+0256
+ ("s." "ʂ")	;; Voiceless retroflex fricative	U+0282
+ ("z." "ʐ")	;; Voiceless retroflex fricative	U+0290
+ ("r." "ɻ")	;; Retroflex approximant		U+027B
+ ("l." "ɭ")	;; Retroflex lateral			U+026D
+ ("*." "ɽ")	;; Retroflex flap			U+027D
+
+ ("C<vcd>" "ʝ")	;; Voiced palatal fricative		U+029D
+ ("j<rnd>" "ɥ")	;; Rounded palatal approximant		U+0265
+ ("l^" "ʎ")	;; Palatal lateral			U+028E
+ ("J`" "ʄ")	;; Palatal implosive			U+0284
+ ("j<vel>" "ɰ")	;; Velar approximant			U+0270
+ ("g`" "ɠ")	;; Velar implosive			U+0260
+ ("k!" "ʞ")	;; Velar click				U+029E
+ 
+ ("n<lbv>" ["n⁀g"]) ;; Labio-velar nasal
+ ("t<lbv>" ["k⁀p"]) ;; Voiceless labio-velar stop
+
+ ;; "n<lbv> for "gb" WITH U+2030 CHARACTER TIE was ambiguous and
+ ;; misleading. I _believe_ this is what was meant instead. 
+ ("d<lbv>" ["g⁀b"]) ;; Voiced labio-velar stop. 
+
+ ("w<vls>" "ʍ")	;; Voiceless labio-velar stop		U+028D
+ ("n\"" "ɴ")	;; Uvular nasal				U+0274
+ ("g\"" "ʁ")	;; Voiced uvular fricative		U+0281
+ ("r\"" "ʀ")	;; Uvular trill				U+0280
+ ("G`" "ʛ")	;; Voiced uvular implosive		U+029B
+ ("H<vcd>" "ʕ")	;; Voiced pharyngeal fricative		U+0295
+
+ ("h<?>" "ɦ")	;; Murmured glottal fricative		U+0266
+ ("I." "ʏ")	;; Semi-high front rounded vowel	U+028F
+ ("&." "ɶ")	;; Low front unrounded vowel		U+0276
+
+ ("i\"" "ɨ")	;; High central unrounded vowel		U+0268
+ ("u\"" "ʉ")	;; High central rounded vowel		U+0289
+ ("@<umd>" "ɘ")	;; Upper-mid central unrounded vowel	U+0258
+
+ ("R<umd>" "ɝ")	;; Upper-mid central rhotacised vowel	U+025D
+
+ ("@." "ɵ")	;; Mid central rounded vowel		U+0275
+ ("V\"" "ɜ")	;; Lower-mid central unrounded vowel	U+025C
+ ("O\"" "ɞ")	;; Lower-mid central rounded vowel	U+025E
+ ("u-" "ɯ")	;; High back unrounded vowel		U+026F
+ ("o-" "ɤ")	;; Upper-mid back unrounded vowel	U+0264
+ ("A." "ɒ"))	;; Lower back rounded vowel		U+0252
+
+;; On XEmacs, with the supplied X-SAMPA data, this function is capably
+;; implemented with: 
+;;
+;;   (list (vector (concat to-prepend quail-keymap)))
+;; 
+;; Supporting GNU Emacs too makes it a good deal more complicated. 
+
+(defun x-sampa-prepend-to-keymap-entry (to-prepend quail-keymap)
+  "Return QUAIL-KEYMAP with TO-PREPEND at the beginning of each result.
+
+QUAIL-KEYMAP is a cons that satisfies `quail-map-p'; TO-PREPEND is a
+string.  "
+  (when (consp quail-keymap) (setq quail-keymap (cdr quail-keymap)))
+  (if (or (integerp quail-keymap) 
+	  (and (fboundp 'characterp) (characterp quail-keymap)))
+      (setq quail-keymap (list (string quail-keymap)))
+    (if (stringp quail-keymap)
+	(setq quail-keymap (list quail-keymap))
+      (assert (vectorp quail-keymap) t)
+      (setq quail-keymap (append quail-keymap nil))))
+  (list 
+   (apply 'vector 
+	  (mapcar 
+	   '(lambda (entry)
+	      (assert (char-or-string-p entry) t)
+	      (format "%s%s" to-prepend 
+		      (if (integerp entry) (string entry) entry)))
+	   quail-keymap))))
+
+(defconst x-sampa-implosive-submap 
+  '(("b_<"   ?ɓ)   ;; Voiced bilabial implosive U+0253 
+    ("d_<"   ?ɗ)   ;; Voiced alveolar implosive U+0257
+    ("g_<"   ?ɠ)   ;; Voiced velar implosive    U+0260
+    ("G\\_<" ?ʛ)   ;; Voiced uvular implosive   U+029B
+    ("J\\_<" ?ʄ))  ;; Voiced palatal implosive  U+0284
+  "A map from the X-SAMPA for some implosive consonants to characters.
+This is used because their X-SAMPA syntax is quasi-diacritic, but the
+corresponding Unicode characters themselves don't have diacritics, they are
+separate code points.  So we need to implement some extra logic that isn't
+normally provided by Quail. ")
+
+(defun x-sampa-underscore-implosive (input-string length)
+  "Return keymap with IPA implosives, for INPUT-STRING, length LENGTH.
+
+The implosive consonants in X-SAMPA are represented with more or less a
+diacritic syntax, but the property +implosive in the IPA is expressed using
+separate characters, and not using a diacritic.  This function works around
+the confusion that implies when generating IPA from X-SAMPA; it returns a
+Quail map that is a copy of the map for `_', but with all the DIACRITIC
+entries changed to return the diacritic together with the base character,
+and with the map to the implosive added to its end.
+
+Like all `quail-defrule'-assigned functions, this will be called once for
+each particular sequence of keys, the first time the user types that
+particular sequence of keys, and the result will be cached by Quail. "
+  (let* ((input-string (substring input-string 0 (or length)))
+	 (underscore-map (copy-tree (quail-lookup-key "_")))
+	 (split-input (split-string input-string "_"))
+	 (pre-underscore (car split-input))
+	 (pre-underscore-map (quail-lookup-key pre-underscore))
+	 (x-sampa-submap-entry 
+	  (assoc (format "%s<" input-string) x-sampa-implosive-submap))
+	 underscore-map-entry)
+    (if (and (consp pre-underscore-map) (car pre-underscore-map))
+	(setq pre-underscore-map (car pre-underscore-map))
+      (setq pre-underscore-map pre-underscore))
+    (unless (stringp pre-underscore-map)
+      (setq pre-underscore-map (string pre-underscore-map)))
+    (dolist (underscoring underscore-map)
+      (cond ((null underscoring))
+	    ((eq (length underscoring) 2)
+	     (setq underscore-map-entry (second underscoring))
+	     (setcdr underscoring (x-sampa-prepend-to-keymap-entry 
+				   pre-underscore-map underscore-map-entry)))
+	    ((eq (length underscoring) 3)
+	     (setq underscore-map-entry (second (third underscoring)))
+	     (setcdr (third underscoring) 
+		     (x-sampa-prepend-to-keymap-entry 
+		      pre-underscore-map underscore-map-entry)))
+	    (t 
+	     (assert (null t) t 
+		     "Can't handle subtrees of this level right now."))))
+    (append underscore-map (list (list ?< (second x-sampa-submap-entry))))))
+
+(quail-define-package
+ "x-sampa-ipa" "IPA" "IPA-X" t
+ "The International Phonetic Alphabet, using J.C. Wells' X-SAMPA. 
+
+X-SAMPA is an ASCII transliteration of the IPA, normally used for data
+exchange in environments where Unicode is not available.  This input method
+uses this transliteration to allow you to produce the IPA in your editor
+with a keyboard that's limited to ASCII.
+
+See http://www.phon.ucl.ac.uk/home/sampa/ipasam-x.pdf for a full definition
+of the mapping. A caveat with regard to that document; while XEmacs
+currently preserves Unicode diacritics on reading and emitting them, it
+displays them, incorrectly, as separate from the modified glyphs. " nil t t)
+
+(quail-define-rules
+ ;; Table taken from http://en.wikipedia.org/wiki/X-SAMPA, checked with
+ ;; http://www.phon.ucl.ac.uk/home/sampa/ipasam-x.pdf
+
+ ("d`" "ɖ")	;; Voiced retroflex plosive		U+0256
+
+ ;; In the Wikipedia article, they list ASCII g as mapping to U+0261, LATIN
+ ;; SMALL LETTER SCRIPT G. The typeset g is equally acceptable as the voiced
+ ;; velar plosive, however, and we don't override the identity mapping here.
+
+ ("h\\" "ɦ")	;; Voiced glottal fricative		U+0266
+ ("j\\" "ʝ")	;; Voiced palatal fricative		U+029D
+ ("l`" "ɭ")	;; Retroflex lateral approximant	U+026D
+ ("l\\" "ɺ")	;; Alveolar lateral flap		U+027A
+ ("n`" "ɳ")	;; Retroflex nasal			U+0273
+ ("p\\" "ɸ")	;; Voiceless bilabial fricative		U+0278
+ ("r`" "ɽ")	;; Retroflex flap			U+027D
+ ("r\\" "ɹ")	;; Alveolar approximant			U+0279
+ ("r\\`" "ɻ")	;; Retroflex approximant		U+027B
+ ("s`" "ʂ")	;; Voiceless retroflex fricative	U+0282
+ ("s\\" "ɕ")	;; Voiceless alveolo-palatal fricative	U+0255
+ ("t`" "ʈ")	;; Voiceless retroflex plosive		U+0288
+ ("v\\" "ʋ")	;; Labiodental approximant		U+028B
+ ("x\\" "ɧ")	;; Voiceless palatal-velar fricative	U+0267
+ ("z`" "ʐ")	;; Voiced retroflex fricative		U+0290
+ ("z\\" "ʑ")	;; Voiced alveolo-palatal fricative	U+0291
+ ("A" "ɑ")	;; Open back unrounded vowel		U+0251
+ ("B" "β")	;; Voiced bilabial fricative		U+03B2
+ ("B\\" "ʙ")	;; Bilabial trill			U+0299
+ ("C" "ç")	;; Voiceless palatal fricative		U+00E7
+ ("D" "ð")	;; Voiced dental fricative		U+00F0
+ ("E" "ɛ")	;; Open-mid front unrounded vowel	U+025B
+ ("F" "ɱ")	;; Labiodental nasal			U+0271
+ ("G" "ɣ")	;; Voiced velar fricative		U+0263
+ ("G\\" "ɢ")	;; Voiced uvular plosive		U+0262
+ ("H" "ɥ")	;; Labial-palatal approximant		U+0265
+ ("H\\" "ʜ")	;; Voiceless epiglottal fricative	U+029C
+ ("I" "ɪ")	;; Near-close near-front unrounded vowel	U+026A
+ ("I\\" "Ɨ")	;; Central lax close unrounded vowel	U+0197
+ ("J" "ɲ")	;; Palatal nasal			U+0272
+ ("J\\" "ɟ")	;; Voiceless palatal plosive		U+025F
+ ("K" "ɬ")	;; Voiceless alveolar lateral fricative	U+026C
+ ("K\\" "ɮ")	;; Voiced alveolar lateral fricative	U+026E
+ ("L" "ʎ")	;; Palatal lateral approximant		U+028E
+ ("L\\" "ʟ")	;; Velar lateral approximant		U+029F
+ ("M" "ɯ")	;; Close back unrounded vowel		U+026F
+ ("M\\" "ɰ")	;; Velar approximant			U+0270
+ ("N" "ŋ")	;; Velar nasal				U+014B
+ ("N\\" "ɴ")	;; Uvular nasal				U+0274
+ ("O" "ɔ")	;; Open-mid back rounded vowel		U+0254
+ ("O\\" "ʘ")	;; Bilabial click			U+0298
+ ("P" "ʋ")	;; Labiodental approximant		U+028B
+ ("Q" "ɒ")	;; Open back rounded vowel		U+0252
+ ("R" "ʁ")	;; Voiced uvular fricative		U+0281
+ ("R\\" "ʀ")	;; Uvular trill				U+0280
+ ("S" "ʃ")	;; Voiceless postalveolar fricative	U+0283
+ ("T" "θ")	;; Voiceless dental fricative		U+03B8
+ ("U" "ʊ")	;; Near-close near-back rounded vowel	U+028A
+ ("U\\" ["ʊ̵"])	;; Central lax close rounded vowel, U+028A U+0335
+ ("V" "ʌ")	;; Open-mid back unrounded vowel	U+028C
+ ("W" "ʍ")	;; Voiceless labial-velar fricative	U+028D
+ ("X" "χ")	;; Voiceless uvular fricative		U+03C7
+ ("X\\" "ħ")	;; Voiceless pharyngeal fricative	U+0127
+ ("Y" "ʏ")	;; Near-close near-front rounded vowel	U+028F
+ ("Z" "ʒ")	;; Voiced postalveolar fricative	U+0292
+
+ ("\"" "ˈ")	;; Primary stress			U+02C8
+ ("%" "ˌ")	;; Secondary stress	 		U+02CC
+ (":" "ː")	;; Long					U+02D0
+ (":\\" "ˑ")	;; Half-long				U+02D1
+ ("@" "ə")	;; Schwa				U+0259
+ ("@\\" "ɘ")	;; Close-mid central unrounded vowel	U+0258
+ ("@`" "ɚ")	;; Rhotacised schwa			U+025A
+ ("{" "æ")	;; Near-open front unrounded vowel	U+00E6
+ ("}" "ʉ")	;; Close central rounded vowel		U+0289
+ ("1" "ɨ")	;; Close central unrounded vowel	U+0268
+ ("2" "ø")	;; Close-mid front rounded vowel	U+00F8
+ ("3" "ɜ")	;; Open-mid central unrounded vowel	U+025C
+ ("3\\" "ɞ")	;; Open-mid central rounded vowel	U+025E
+ ("4" "ɾ")	;; Alveolar flap			U+027E
+ ("5" "ɫ")	;; Velarised alveolar lateral approximant	U+026B
+ ("6" "ɐ")	;; Near-open central vowel 		U+0250
+ ("7" "ɤ")	;; Close-mid back unrounded vowel	U+0264
+ ("8" "ɵ")	;; Close-mid central rounded vowel	U+0275
+ ("9" "œ")	;; Open-mid front rounded vowel		U+0153
+ ("&" "ɶ")	;; Open front rounded vowel		U+0276
+ ("?" "ʔ")	;; Glottal stop				U+0294
+ ("?\\" "ʕ")	;; Voiced pharyngeal fricative		U+0295
+ ;; The undefined escape character, ignored. 
+ ;; Indeterminacy in French vowels, ignored.
+ ;; Begin nonsegmental notation, ignored. 
+ ("<\\" "ʢ")	;; Voiced epiglottal fricative		U+02A2
+ ;; End nonsegmental notation, ignored. 
+ (">\\" "ʡ")	;; Epiglottal plosive			U+02A1
+ ("^" "↑")	;; Upstep				U+2191
+ ("!" "↓")	;; Downstep				U+2193
+ ("!\\" "ǃ")	;; Postalveolal click			U+01C3
+ ("\\" "ǀ")	;; Dental click				U+01C0
+ ("\\|\\" "ǁ")	;; Lateral alveolar click		U+01C1
+ ("=\\" "ǂ")	;; Palatal click			U+01C2
+ ("-\\" "̮")	;; Linking mark				U+032E
+
+ ;; Diacritics. Note that XEmacs doesn't yet have composed characters, so we
+ ;; can input them, but they won't display properly. If you send email using
+ ;; them, and the recipient's client is capable, they will get through,
+ ;; though.
+
+ ("_\"" "̈")	;; Centralised		 		U+0308
+ ("_+" "̟")	;; Advanced				U+031F
+ ("_-" "̠")	;; Retracted				U+0320
+ ("_/" "ˇ")	;; Rising tone				U+02C7
+ ("_0" "̥")	;; Voiceless				U+0325
+ ("_=" "̩")	;; Syllabic				U+0329
+ ("=" "̩")	;; Syllabic				U+0329
+ ("_>" "ʼ")	;; Ejective				U+02BC
+ ("_?\\" "ˤ")	;; Pharyngealised			U+02E4
+ ("_\\" "ˆ")	;; Falling Tone				U+02C6
+ ("_^" "̯")	;; Non-syllabic				U+032F
+ ("_}" "̚")	;; No audible release			U+031A
+ ;; ` is alternatively; retroflexion in consonants
+ ("`" "˞")	;; Rhotacization in vowels		U+02DE
+ ("_~" "̃")	;; Nasalisation				U+0303
+ ("~" "̃")	;; Nasalisation				U+0303
+ ("_A" "̘")	;; Advanced tongue root			U+0318
+ ("_a" "̺")	;; Apical				U+033A
+ ("_B" "̏")	;; Extra low tone			U+030F
+ ;; _B_L omitted, no Unicode code point for "low rising tone."
+ ("_c" "̜")	;; Less rounded				U+031C
+ ("_d" "̪")	;; Dental				U+032A
+ ("_e" "̴")	;; Velarised or pharyngeal		U+0334
+
+ ("<F>" "↙")	;; Global fall; SOUTH EAST ARROW; may be a bit smaller than
+		;; intended.
+ ("_F" "̂")	;; Falling tone				U+0302
+ ("_G" "ˠ")	;; Velarised				U+02E0
+ ("_H" "́")	;; High tone				U+0301
+ ;; "_H_T omitted, no Unicode code point for "high rising tone"
+ ("_h" "ʰ")	;; Aspirated				U+02B0
+ ("_j" "ʲ")	;; Palatalised				U+02B2
+ ("'" "ʲ")	;; Palatalised				U+02B2
+ ("_k" "̰")	;; Creaky voice				U+0330
+ ("_L" "̀")	;; Low tone				U+0300
+ ("_l" "ˡ")	;; Lateral release			U+02E1
+ ("_M" "̄")	;; Mid tone				U+0304
+ ("_m" "̻")	;; Laminal				U+033B
+ ("_N" "̼")	;; Linguolabial				U+033C
+ ("_n" "ⁿ")	;; Nasal release			U+207F
+ ("_O" "̹")	;; More rounded				U+0339
+ ("_o" "̞")	;; Lowered				U+031E
+ ("_q" "̙")	;; Retracted tongue root		U+0319
+ ("<R>" "↗")	;; NORTH EAST ARROW; may be a bit smaller than intended. 
+ ("_R" "̌")     ;; Haček, caron, rising tone.		U+030C
+ ;; _R_F omitted, apparently there's no corresponding Unicode entry. 
+ ("_r" "̝")	;; Raised				U+031D
+ ("_T" "̋")	;; Extra high tone			U+030B
+ ("_t" "̤")	;; Breathy voice			U+0324
+ ("_v" "̬")	;; Voiced				U+032C
+ ("_w" "ʷ")	;; Labialised				U+02B7
+ ("_X" "̆")	;; Extra-short				U+0306
+ ("_x" "̽"))	;; Mid-centralised			U+033D
+
+;; Putting in place rules for the implosives like for the others above
+;; breaks the "_<diacritic>" rules for b, d, g, G and J a little--you need
+;; to interrupt Quail before typing the underscore if you want the
+;; diacritic. To avoid this, handle the input specially with the function
+;; x-sampa-underscore-implosive.
+
+(dolist (implosive-x-sampa (mapcar 'car x-sampa-implosive-submap))
+  (setq implosive-x-sampa (car (split-string implosive-x-sampa "_")))
+  (quail-defrule (format "%s_" implosive-x-sampa)
+		 'x-sampa-underscore-implosive))
 
 ;;; arch-tag: cf2614cc-ecce-4ef5-ba51-37faeed41691
 ;;; ipa.el ends here
+

-- 
When I was in the scouts, the leader told me to pitch a tent. I couldn't
find any pitch, so I used creosote.

[-- Attachment #3: .signature --]
[-- Type: text/plain, Size: 143 bytes --]


-- 
¿Dónde estará ahora mi sobrino Yoghurtu Nghé, que tuvo que huir
precipitadamente de la aldea por culpa de la escasez de rinocerontes?

^ permalink raw reply	[flat|nested] 8+ messages in thread

* bug#1750: [PATCH] Support standard ASCII-IPA mappings in leim/quail/ipa.el.
  2008-12-31 13:41 ` bug#1750: [PATCH] Support standard ASCII-IPA mappings in leim/quail/ipa.el Aidan Kehoe
@ 2008-12-31 18:40   ` Stefan Monnier
  2008-12-31 19:51     ` Aidan Kehoe
  2008-12-31 19:57   ` James Cloos
  2009-08-30 23:40   ` bug#1750: marked as done ([PATCH] Support standard ASCII-IPA mappings in leim/quail/ipa.el. ) Emacs bug Tracking System
  2 siblings, 1 reply; 8+ messages in thread
From: Stefan Monnier @ 2008-12-31 18:40 UTC (permalink / raw)
  To: Aidan Kehoe; +Cc: emacs-pretest-bug, 1750

> I was asked to resend this after the release; see
> http://mid.gmane.org/17834.14174.503231.877601@parhasard.net and
> http://thread.gmane.org/gmane.emacs.pretest.bugs/16625 for the associated
> thread.

I was tempted to switch to utf-8 already so as to split your patch into
2 steps where the second results in a cleaner patch.  But I see that
ipa.el contains some chars (IIUC there are 5 of those) that Emacs
doesn't know how to encode into utf-8.  E.g. the char associated with
"/3~".  Your patch seems to just remove that one.  Can you explain
what's going on with those?


        Stefan






^ permalink raw reply	[flat|nested] 8+ messages in thread

* bug#1750: [PATCH] Support standard ASCII-IPA mappings in leim/quail/ipa.el.
  2008-12-31 18:40   ` Stefan Monnier
@ 2008-12-31 19:51     ` Aidan Kehoe
  2009-01-19  7:11       ` Aidan Kehoe
  0 siblings, 1 reply; 8+ messages in thread
From: Aidan Kehoe @ 2008-12-31 19:51 UTC (permalink / raw)
  To: Stefan Monnier; +Cc: emacs-pretest-bug, 1750

[-- Attachment #1: message body text --]
[-- Type: text/plain, Size: 2461 bytes --]

 Ar an t-aonú lá is triochad de mí na Nollaig, scríobh Stefan Monnier: 

 > > I was asked to resend this after the release; see
 > > http://mid.gmane.org/17834.14174.503231.877601@parhasard.net and
 > > http://thread.gmane.org/gmane.emacs.pretest.bugs/16625 for the associated
 > > thread.
 > 
 > I was tempted to switch to utf-8 already so as to split your patch into
 > 2 steps where the second results in a cleaner patch.  But I see that
 > ipa.el contains some chars (IIUC there are 5 of those) that Emacs
 > doesn't know how to encode into utf-8.  E.g. the char associated with
 > "/3~".  Your patch seems to just remove that one.  Can you explain
 > what's going on with those?

Those characters are precomposed nasalised vowels (see the attached pic; see
also character code 54, which is just o, there is nothing special about the
o in IPA versus its use in most Roman-alpahbet languages. The pic uses a
font from https://launchpad.net/ubuntu/+source/xfonts-kappa20/0.396-1 .) 
Nasalisation is indicated in the IPA with a combining tilde, U+0303--Unicode
includes some vowels with precombined tildes, but not those.

I was going to say that the mapping of "~" to U+0303 in the input method
meant that equivalent text was input (with, for example "/3~" now giving
"\u025b\u0303"), but now I double-check, the mapping of "~" to U+0303 is
included in the other two input methods in the file, but not in that one. It
would be sensible to add it, though I hope people use the Kirshenbaum or
X-SAMPA input methods instead, there’s no good reason to learn that
ASCII-IPA mapping. 

The change at http://mid.gmane.org/18779.31070.155006.447104@parhasard.net
would also be helpful. Here’s a patch to the file including both:

--- ipa-20070114.el	2007-01-14 13:49:36.000000000 +0000
+++ ipa-20070114.el.new	2008-12-31 19:46:57.000000000 +0000
@@ -103,7 +103,8 @@
  ("M" ?ʍ)
  ("'" ?ˈ)
  ("`" ?ˌ)
- (":" ?ː))
+ (":" ?ː)
+ ("~" ?\u0303))

 (quail-define-package
  "kirshenbaum-ipa" "IPA" "IPA-K" t
@@ -138,7 +139,7 @@
  ("M" "ɱ")	;; Labio-dental nasal			U+0271
  ("N" "ŋ")	;; Velar nasal				U+014B
  ("O" "ɔ")	;; Low-mid back rounded vowel		U+0254
- ("P" "Φ")	;; Voiceless bilabial fricative		U+03A6
+ ("P" "ɸ")	;; Voiceless bilabial fricative		U+0278
  ("Q" "ɣ")	;; Voiced velar fricative		U+0263
  ("R" ["ʀ"	;; Alveolar trill			U+0280
        "ɚ"])    ;; Rhotacised schwa			U+025A

[-- Attachment #2: ipa-characters.png --]
[-- Type: image/png, Size: 6979 bytes --]

[-- Attachment #3: .signature --]
[-- Type: text/plain, Size: 141 bytes --]

-- 
¿Dónde estará ahora mi sobrino Yoghurtu Nghé, que tuvo que huir
precipitadamente de la aldea por culpa de la escasez de rinocerontes?

^ permalink raw reply	[flat|nested] 8+ messages in thread

* bug#1750: [PATCH] Support standard ASCII-IPA mappings in leim/quail/ipa.el.
  2008-12-31 13:41 ` bug#1750: [PATCH] Support standard ASCII-IPA mappings in leim/quail/ipa.el Aidan Kehoe
  2008-12-31 18:40   ` Stefan Monnier
@ 2008-12-31 19:57   ` James Cloos
  2009-08-30 23:40   ` bug#1750: marked as done ([PATCH] Support standard ASCII-IPA mappings in leim/quail/ipa.el. ) Emacs bug Tracking System
  2 siblings, 0 replies; 8+ messages in thread
From: James Cloos @ 2008-12-31 19:57 UTC (permalink / raw)
  To: Stefan Monnier; +Cc: 1750

Those Mule IPA characters exist in the UCS only as character sequences.

The example you mentioned, \3~ (which C-uC-x= says is aka E~) is written
with UCS characters as ɛ U+025B U+0303 (LATIN SMALL LETTER OPEN E
followed by COMBINING TILDE).

Ideally the quail ipa method would enter the two character sequences
into the buffer for such cases.

I beleive the quail ethiopic method lost a couple of chars of the same
sort when unicode-2 was merged in; if quail can be convinced to enter a
string then those, too, should be recovered.

-JimC
-- 
James Cloos <cloos@jhcloos.com>         OpenPGP: 1024D/ED7DAEA6

^ permalink raw reply	[flat|nested] 8+ messages in thread

* bug#1750: [PATCH] Support standard ASCII-IPA mappings in leim/quail/ipa.el.
  2008-12-31 19:51     ` Aidan Kehoe
@ 2009-01-19  7:11       ` Aidan Kehoe
  0 siblings, 0 replies; 8+ messages in thread
From: Aidan Kehoe @ 2009-01-19  7:11 UTC (permalink / raw)
  To: Stefan Monnier, 1750, emacs-pretest-bug

[-- Attachment #1: message body and .signature --]
[-- Type: text/plain, Size: 22771 bytes --]


 Ar an t-aonú lá is triochad de mí na Nollaig, scríobh Aidan Kehoe: 

 > [...]

I’m not sure if you’re waiting for an updated patch from me, or what. Here’s
one, anyway, against the current file with the 2009 copyright year, and with
better handling of the precomposed Mule IPA nasals. I’ve also attached a
compressed version of the new file, in case anyone’s mail client does Bad
Things with the encoding.

James, see the entries below corresponding to /3~ for how to tell quail to
insert a series of characters. 

--- ipa.el.GNU-Emacs	2009-01-19 07:01:35.000000000 +0000
+++ ipa.el.new	2009-01-19 07:04:18.000000000 +0000
@@ -1,4 +1,4 @@
-;;; ipa.el --- Quail package for inputting IPA characters  -*-coding: iso-2022-7bit;-*-
+;;; ipa.el --- Quail package for inputting IPA characters. -*- coding: utf-8 -*- 
 
 ;; Copyright (C) 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005,
 ;;   2006, 2007, 2008, 2009
@@ -21,12 +21,16 @@
 
 ;; You should have received a copy of the GNU General Public License
 ;; along with GNU Emacs.  If not, see <http://www.gnu.org/licenses/>.
+;; Copyright (C) 1997 Electrotechnical Laboratory, JAPAN.
+;; Copyright (C) 2006 Free Software Foundation, Inc.
+;; Licensed to the Free Software Foundation.
 
 ;;; Commentary:
 
 ;;; Code:
 
 (require 'quail)
+(eval-when-compile (require 'cl))
 
 (quail-define-package
  "ipa" "IPA" "IPA" t
@@ -36,77 +40,477 @@
  nil t nil nil nil nil nil nil nil nil t)
 
 (quail-define-rules
- ("i" ?^[,0 ^[(B)
- ("I" ?^[,0!^[(B)
- ("e" ?^[,0"^[(B)
- ("/3" ?^[,0#^[(B)
- ("E" ?^[,0#^[(B)
- ("ae" ?^[,0$^[(B)
- ("a" ?^[,0%^[(B)
- ("i-" ?^[,0&^[(B)
- ("/e" ?^[,0'^[(B)
- ("/a" ?^[,0(^[(B)
- ("/m" ?^[,0)^[(B)
- ("&" ?^[,0*^[(B)
- ("/v" ?^[,0+^[(B)
- ("A" ?^[,0,^[(B)
- ("o|" ?^[,0,^[(B)
- ("y" ?^[,0-^[(B)
- ("Y" ?^[,0.^[(B)
- ("o/" ?^[,0/^[(B)
- ("oe" ?^[,00^[(B)
- ("OE" ?^[,01^[(B)
- ("u-" ?^[,02^[(B)
- ("o-" ?^[,03^[(B)
- ("u" ?^[,04^[(B)
- ("U" ?^[,05^[(B)
- ("o" ?^[,06^[(B)
- ("/c" ?^[,07^[(B)
- ("/A" ?^[,08^[(B)
- ("|o" ?^[,08^[(B)
- ("e-" ?^[,0:^[(B)
- ("e|" ?^[,0:^[(B)
- ("/3~" ?^[,0;^[(B)
- ("E~" ?^[,0;^[(B)
- ("A~" ?^[,0<^[(B)
- ("oe~" ?^[,0=^[(B)
- ("/c~" ?^[,0>^[(B)
- ("p" ?^[,0@^[(B)
- ("b" ?^[,0A^[(B)
- ("t" ?^[,0B^[(B)
- ("d" ?^[,0C^[(B)
- ("k" ?^[,0D^[(B)
- ("g" ?^[,0E^[(B)
- ("f" ?^[,0F^[(B)
- ("v" ?^[,0G^[(B)
- ("th" ?^[,0H^[(B)
- ("dh" ?^[,0I^[(B)
- ("s" ?^[,0J^[(B)
- ("z" ?^[,0K^[(B)
- ("sh" ?^[,0L^[(B)
- ("zh" ?^[,0M^[(B)
- ("3" ?^[,0M^[(B)
- ("c," ?^[,0N^[(B)
- ("x" ?^[,0O^[(B)
- ("/R" ?^[,0P^[(B)
- ("h" ?^[,0Q^[(B)
- ("m" ?^[,0R^[(B)
- ("n" ?^[,0S^[(B)
- ("gn" ?^[,0T^[(B)
- ("ng" ?^[,0U^[(B)
- ("r" ?^[,0V^[(B)
- ("R" ?^[,0W^[(B)
- ("/r" ?^[,0X^[(B)
- ("j" ?^[,0Y^[(B)
- ("l" ?^[,0Z^[(B)
- ("/y" ?^[,0[^[(B)
- ("L" ?^[,0\^[(B)
- ("/h" ?^[,0]^[(B)
- ("w" ?^[,0^^[(B)
- ("M" ?^[,0_^[(B)
- ("'" ?^[,0p^[(B)
- ("`" ?^[,0q^[(B)
- (":" ?^[,0r^[(B))
+ ("i" ?i)
+ ("I" ?ɪ)
+ ("e" ?e)
+ ("/3" ?ɛ)
+ ("E" ?ɛ)
+ ("ae" ?æ)
+ ("a" ?a)
+ ("i-" ?ɨ)
+ ("/e" ?ə)
+ ("/a" ?ɐ)
+ ("/m" ?ɯ)
+ ("&" ?ɤ)
+ ("/v" ?ʌ)
+ ("A" ?ɑ)
+ ("o|" ?ɑ)
+ ("y" ?y)
+ ("Y" ?ʏ)
+ ("o/" ?ø)
+ ("oe" ?œ)
+ ("OE" ?ɶ)
+ ("u-" ?ʉ)
+ ("o-" ?ɵ)
+ ("u" ?u)
+ ("U" ?ʊ)
+ ("o" ?o)
+ ("/c" ?ɔ)
+ ("/A" ?ɒ)
+ ("|o" ?ɒ)
+ ("e-" ?ɚ)
+ ("e|" ?ɚ)
+ ("/3~" ["ɛ̃"])
+ ("E~" ["ɛ̃"])
+ ("A~" ["ɑ̃"])
+ ("oe~" ["œ̃"])
+ ("/c~" ["ɔ̃"])
+ ("p" ?p)
+ ("b" ?b)
+ ("t" ?t)
+ ("d" ?d)
+ ("k" ?k)
+ ("g" ?g)
+ ("f" ?f)
+ ("v" ?v)
+ ("th" ?θ)
+ ("dh" ?ð)
+ ("s" ?s)
+ ("z" ?z)
+ ("sh" ?ʃ)
+ ("zh" ?ʒ)
+ ("3" ?ʒ)
+ ("c," ?ç)
+ ("x" ?x)
+ ("/R" ?ʁ)
+ ("h" ?h)
+ ("m" ?m)
+ ("n" ?n)
+ ("gn" ?ɲ)
+ ("ng" ?ŋ)
+ ("r" ?r)
+ ("R" ?ʀ)
+ ("/r" ?ɹ)
+ ("j" ?j)
+ ("l" ?l)
+ ("/y" ?ʎ)
+ ("L" ?ʟ)
+ ("/h" ?ɥ)
+ ("w" ?w)
+ ("M" ?ʍ)
+ ("'" ?ˈ)
+ ("`" ?ˌ)
+ (":" ?ː))
 
-;; arch-tag: cf2614cc-ecce-4ef5-ba51-37faeed41691
-;;; ipa.el ends here
+(quail-define-package
+ "kirshenbaum-ipa" "IPA" "IPA-K" t
+ "The International Phonetic Alphabet, using Kirshenbaum ASCII translit. 
+
+Kirshenbaum IPA is an ASCII transliteration of the phonetic alphabet, common
+in the Usenet groups `sci.lang' and `alt.usage.english'.  This input method
+allows you to type Kirshenbaum on your ASCII-capable keyboard, producing the
+corresponding actual IPA characters in your editor.
+
+See http://www.kirshenbaum.net/IPA/ascii-ipa.pdf for full details of the
+transliteration.
+
+A caveat with regard to that document; while XEmacs currently preserves
+Unicode diacritics on reading and emitting them, it displays them,
+incorrectly, as separate from the modified glyphs. ")
+
+(quail-define-rules
+ ("r" "ɹ")	;; Alveolar approximant			U+0279
+ ("A" "ɑ")	;; Low back unrounded vowel		U+0251
+ ("B" "β")	;
+  ; Voiced bilabial fricative		U+03B2
+ ("C" "ç")	;; Voiced palatal fricative		U+00E7
+ ("D" "ð")	;; Voiced dental fricative		U+00F0
+ ("E" "ɛ")	;; Lower-mid front unrounded vowel	U+025B
+ ("G" "ɢ")	;; Voiced uvular stop			U+0262
+ ("H" "ħ")	;; Voiced pharyngeal fricative		U+0127
+ ("I" "ɪ")	;; Semi-high front unrounded vowel	U+026A
+ ("J" "ɟ")	;; Voiced palatal stop			U+025F
+ ("L" ["ɫ"	;; Voiced velar lateral			U+026B
+       "ʟ"	;; Voiced velar lateral			U+029F
+       "ɬ"])	;; Voiced alveolar lateral fricative	U+026C
+ ("M" "ɱ")	;; Labio-dental nasal			U+0271
+ ("N" "ŋ")	;; Velar nasal				U+014B
+ ("O" "ɔ")	;; Low-mid back rounded vowel		U+0254
+ ("P" "ɸ")	;; Voiceless bilabial fricative		U+0278
+ ("Q" "ɣ")	;; Voiced velar fricative		U+0263
+ ("R" ["ʀ"	;; Alveolar trill			U+0280
+       "ɚ"])    ;; Rhotacised schwa			U+025A
+ ("@<r>" "ɚ")	;; Mid central rhotacised vowel		U+025A
+ ("S" "ʃ")	;; Voiced postalveolar fricative	U+0283
+ ("T" "θ")	;; Voiced dental fricative		U+03B8
+ ("U" "ʊ")	;; Semi-high back rounded vowel		U+028A
+ ("V" "ʌ")	;; Low-mid back unrounded vowel		U+028C
+ ("W" "œ")	;; Low-mid front rounded vowel		U+0153
+ ("X" "χ")	;; Voiceless uvular fricative		U+03C7
+ ("Y" "ø")	;; Upper-mid front rounded vowel	U+00F8
+ ("Z" "ʒ")	;; Voiced postalveolar fricative	U+0292
+ ("?" "ʔ")	;; Glottal stop				U+0294
+ ("@" "ə")	;; Mid central unrounded vowel (schwa)	U+0259
+ ("&" "æ")	;; Low front unrounded vowel		U+00E6
+ ("*" "ɾ")	;; Voiced alveolar flap			U+027E
+
+ ("a~" "ã")	;; Low central unrounded vowel, nasal	U+00E3
+ ("o~" "õ")	;; Upper-mid back rounded vowel, nasal	U+00F5
+ ("u~" "ũ")	;; High back rounded vowel, nasal	U+0169
+ ("~" "̃")	;; +Nasalised modifier			U+0303
+ (":" "ː")	;; +Long modifier			U+02D0
+ ("-" "̩")	;; +Syllabic modifier			U+0329
+ ("." "̣")	;; +Retroflex modifier			U+0323
+ ("`" "ʼ")	;; +Ejective modifier			U+02BC
+ ("[" "̪")	;; +Dental modifier			U+032A
+ (";" "ʲ")	;; +Palatalised modifier		U+02B2
+ ("<H>" "̴")	;; +Pharyngealised modifier		U+0334
+ ("<h>" "ʰ")	;; +Aspirated modifier			U+02B0
+ ("<o>" ["̥"	;; +Voiceless modifier			U+0325
+	 "˚"])	;; +Unexploded modifier			U+02DA
+ ("<r>" "ʳ")	;; +Rhotacised modifier			U+02B3
+ ("<w>" "ʷ")	;; +Labialised modifier			U+02B7
+ ("<?>" "ʱ")	;; +Murmured modififer			U+02B1
+
+ ("b<trl>" "ʙ")	;; Bilabial trill			U+0299
+ ("b`" "ɓ")	;; Bilabial implosive			U+0253
+ ("p!" "ʘ")	;; Bilabial click			U+0298
+ ("r<lbd>" "ʋ")	;; Labio-dental approximant		U+028B
+ ("d`" "ɗ")	;; Dental implosive			U+0257
+ ("t!" "ʇ")	;; Dental click				U+0287
+ ("s<lat>" "ɬ")	;; Voiceless alveolar lateral fricative	U+026C
+ ("z<lat>" "ɮ")	;; Voiced alveolar lateral fricative	U+026E
+ ("r<trl>" "ʀ")	;; Alveolar trill			U+0280
+
+ ("*<lat>" "ɺ")	;; Voiced alveolar lateral flap		U+027A
+ ("c!" "ʗ")	;; Alveolar click			U+0297
+ ("l!" "ʖ")	;; Alveolar lateral click		U+0296
+ ("n." "ɳ")	;; Retroflex nasal			U+0273
+ ("t." "ʈ")	;; Voiceless retroflex stop		U+0288
+ ("d." "ɖ")	;; Voiced retroflex stop		U+0256
+ ("s." "ʂ")	;; Voiceless retroflex fricative	U+0282
+ ("z." "ʐ")	;; Voiceless retroflex fricative	U+0290
+ ("r." "ɻ")	;; Retroflex approximant		U+027B
+ ("l." "ɭ")	;; Retroflex lateral			U+026D
+ ("*." "ɽ")	;; Retroflex flap			U+027D
+
+ ("C<vcd>" "ʝ")	;; Voiced palatal fricative		U+029D
+ ("j<rnd>" "ɥ")	;; Rounded palatal approximant		U+0265
+ ("l^" "ʎ")	;; Palatal lateral			U+028E
+ ("J`" "ʄ")	;; Palatal implosive			U+0284
+ ("j<vel>" "ɰ")	;; Velar approximant			U+0270
+ ("g`" "ɠ")	;; Velar implosive			U+0260
+ ("k!" "ʞ")	;; Velar click				U+029E
+ 
+ ("n<lbv>" ["n⁀g"]) ;; Labio-velar nasal
+ ("t<lbv>" ["k⁀p"]) ;; Voiceless labio-velar stop
+
+ ;; "n<lbv> for "gb" WITH U+2030 CHARACTER TIE was ambiguous and
+ ;; misleading. I _believe_ this is what was meant instead. 
+ ("d<lbv>" ["g⁀b"]) ;; Voiced labio-velar stop. 
+
+ ("w<vls>" "ʍ")	;; Voiceless labio-velar stop		U+028D
+ ("n\"" "ɴ")	;; Uvular nasal				U+0274
+ ("g\"" "ʁ")	;; Voiced uvular fricative		U+0281
+ ("r\"" "ʀ")	;; Uvular trill				U+0280
+ ("G`" "ʛ")	;; Voiced uvular implosive		U+029B
+ ("H<vcd>" "ʕ")	;; Voiced pharyngeal fricative		U+0295
+
+ ("h<?>" "ɦ")	;; Murmured glottal fricative		U+0266
+ ("I." "ʏ")	;; Semi-high front rounded vowel	U+028F
+ ("&." "ɶ")	;; Low front unrounded vowel		U+0276
+
+ ("i\"" "ɨ")	;; High central unrounded vowel		U+0268
+ ("u\"" "ʉ")	;; High central rounded vowel		U+0289
+ ("@<umd>" "ɘ")	;; Upper-mid central unrounded vowel	U+0258
+
+ ("R<umd>" "ɝ")	;; Upper-mid central rhotacised vowel	U+025D
+
+ ("@." "ɵ")	;; Mid central rounded vowel		U+0275
+ ("V\"" "ɜ")	;; Lower-mid central unrounded vowel	U+025C
+ ("O\"" "ɞ")	;; Lower-mid central rounded vowel	U+025E
+ ("u-" "ɯ")	;; High back unrounded vowel		U+026F
+ ("o-" "ɤ")	;; Upper-mid back unrounded vowel	U+0264
+ ("A." "ɒ"))	;; Lower back rounded vowel		U+0252
+
+;; On XEmacs, with the supplied X-SAMPA data, this function is capably
+;; implemented with: 
+;;
+;;   (list (vector (concat to-prepend quail-keymap)))
+;; 
+;; Supporting GNU Emacs too makes it a good deal more complicated. 
+
+(defun x-sampa-prepend-to-keymap-entry (to-prepend quail-keymap)
+  "Return QUAIL-KEYMAP with TO-PREPEND at the beginning of each result.
+
+QUAIL-KEYMAP is a cons that satisfies `quail-map-p'; TO-PREPEND is a
+string.  "
+  (when (consp quail-keymap) (setq quail-keymap (cdr quail-keymap)))
+  (if (or (integerp quail-keymap) 
+	  (and (fboundp 'characterp) (characterp quail-keymap)))
+      (setq quail-keymap (list (string quail-keymap)))
+    (if (stringp quail-keymap)
+	(setq quail-keymap (list quail-keymap))
+      (assert (vectorp quail-keymap) t)
+      (setq quail-keymap (append quail-keymap nil))))
+  (list 
+   (apply 'vector 
+	  (mapcar 
+	   #'(lambda (entry)
+               (assert (char-or-string-p entry) t)
+               (format "%s%s" to-prepend 
+                       (if (integerp entry) (string entry) entry)))
+	   quail-keymap))))
+
+(defun x-sampa-underscore-implosive (input-string length)
+  "Return keymap with IPA implosives, for INPUT-STRING, length LENGTH.
+
+The implosive consonants in X-SAMPA are represented with more or less a
+diacritic syntax, but the property +implosive in the IPA is expressed using
+separate characters, and not using a diacritic.  This function works around
+the confusion that implies when generating IPA from X-SAMPA; it returns a
+Quail map that is a copy of the map for `_', but with all the DIACRITIC
+entries changed to return the diacritic together with the base character,
+and with the map to the implosive added to its end.
+
+Like all `quail-defrule'-assigned functions, this will be called once for
+each particular sequence of keys, the first time the user types that
+particular sequence of keys, and the result will be cached by Quail. "
+  (let* ((input-string (substring input-string 0 (or length)))
+	 (underscore-map (copy-tree (quail-lookup-key "_")))
+	 (split-input (split-string input-string "_"))
+	 (pre-underscore (car split-input))
+	 (pre-underscore-map (quail-lookup-key pre-underscore))
+	 (x-sampa-submap-entry 
+	  (assoc (format "%s<" input-string) x-sampa-implosive-submap))
+	 underscore-map-entry)
+    (if (and (consp pre-underscore-map) (car pre-underscore-map))
+	(setq pre-underscore-map (car pre-underscore-map))
+      (setq pre-underscore-map pre-underscore))
+    (unless (stringp pre-underscore-map)
+      (setq pre-underscore-map (string pre-underscore-map)))
+    (dolist (underscoring underscore-map)
+      (cond ((null underscoring))
+	    ((eq (length underscoring) 2)
+	     (setq underscore-map-entry (second underscoring))
+	     (setcdr underscoring (x-sampa-prepend-to-keymap-entry 
+				   pre-underscore-map underscore-map-entry)))
+	    ((eq (length underscoring) 3)
+	     (setq underscore-map-entry (second (third underscoring)))
+	     (setcdr (third underscoring) 
+		     (x-sampa-prepend-to-keymap-entry 
+		      pre-underscore-map underscore-map-entry)))
+	    (t 
+	     (assert (null t) t 
+		     "Can't handle subtrees of this level right now."))))
+    (append underscore-map (list (list ?< (second x-sampa-submap-entry))))))
+
+(quail-define-package
+ "x-sampa-ipa" "IPA" "IPA-X" t
+ "The International Phonetic Alphabet, using J.C. Wells' X-SAMPA. 
+
+X-SAMPA is an ASCII transliteration of the IPA, normally used for data
+exchange in environments where Unicode is not available.  This input method
+uses this transliteration to allow you to produce the IPA in your editor
+with a keyboard that's limited to ASCII.
+
+See http://www.phon.ucl.ac.uk/home/sampa/ipasam-x.pdf for a full definition
+of the mapping. A caveat with regard to that document; while XEmacs
+currently preserves Unicode diacritics on reading and emitting them, it
+displays them, incorrectly, as separate from the modified glyphs. " nil t t)
+
+(quail-define-rules
+ ;; Table taken from http://en.wikipedia.org/wiki/X-SAMPA, checked with
+ ;; http://www.phon.ucl.ac.uk/home/sampa/ipasam-x.pdf
+
+ ("d`" "ɖ")	;; Voiced retroflex plosive		U+0256
+
+ ;; In the Wikipedia article, they list ASCII g as mapping to U+0261, LATIN
+ ;; SMALL LETTER SCRIPT G. The typeset g is equally acceptable as the voiced
+ ;; velar plosive, however, and we don't override the identity mapping here.
+
+ ("h\\" "ɦ")	;; Voiced glottal fricative		U+0266
+ ("j\\" "ʝ")	;; Voiced palatal fricative		U+029D
+ ("l`" "ɭ")	;; Retroflex lateral approximant	U+026D
+ ("l\\" "ɺ")	;; Alveolar lateral flap		U+027A
+ ("n`" "ɳ")	;; Retroflex nasal			U+0273
+ ("p\\" "ɸ")	;; Voiceless bilabial fricative		U+0278
+ ("r`" "ɽ")	;; Retroflex flap			U+027D
+ ("r\\" "ɹ")	;; Alveolar approximant			U+0279
+ ("r\\`" "ɻ")	;; Retroflex approximant		U+027B
+ ("s`" "ʂ")	;; Voiceless retroflex fricative	U+0282
+ ("s\\" "ɕ")	;; Voiceless alveolo-palatal fricative	U+0255
+ ("t`" "ʈ")	;; Voiceless retroflex plosive		U+0288
+ ("v\\" "ʋ")	;; Labiodental approximant		U+028B
+ ("x\\" "ɧ")	;; Voiceless palatal-velar fricative	U+0267
+ ("z`" "ʐ")	;; Voiced retroflex fricative		U+0290
+ ("z\\" "ʑ")	;; Voiced alveolo-palatal fricative	U+0291
+ ("A" "ɑ")	;; Open back unrounded vowel		U+0251
+ ("B" "β")	;; Voiced bilabial fricative		U+03B2
+ ("B\\" "ʙ")	;; Bilabial trill			U+0299
+ ("C" "ç")	;; Voiceless palatal fricative		U+00E7
+ ("D" "ð")	;; Voiced dental fricative		U+00F0
+ ("E" "ɛ")	;; Open-mid front unrounded vowel	U+025B
+ ("F" "ɱ")	;; Labiodental nasal			U+0271
+ ("G" "ɣ")	;; Voiced velar fricative		U+0263
+ ("G\\" "ɢ")	;; Voiced uvular plosive		U+0262
+ ("H" "ɥ")	;; Labial-palatal approximant		U+0265
+ ("H\\" "ʜ")	;; Voiceless epiglottal fricative	U+029C
+ ("I" "ɪ")	;; Near-close near-front unrounded vowel	U+026A
+ ("I\\" "Ɨ")	;; Central lax close unrounded vowel	U+0197
+ ("J" "ɲ")	;; Palatal nasal			U+0272
+ ("J\\" "ɟ")	;; Voiceless palatal plosive		U+025F
+ ("K" "ɬ")	;; Voiceless alveolar lateral fricative	U+026C
+ ("K\\" "ɮ")	;; Voiced alveolar lateral fricative	U+026E
+ ("L" "ʎ")	;; Palatal lateral approximant		U+028E
+ ("L\\" "ʟ")	;; Velar lateral approximant		U+029F
+ ("M" "ɯ")	;; Close back unrounded vowel		U+026F
+ ("M\\" "ɰ")	;; Velar approximant			U+0270
+ ("N" "ŋ")	;; Velar nasal				U+014B
+ ("N\\" "ɴ")	;; Uvular nasal				U+0274
+ ("O" "ɔ")	;; Open-mid back rounded vowel		U+0254
+ ("O\\" "ʘ")	;; Bilabial click			U+0298
+ ("P" "ʋ")	;; Labiodental approximant		U+028B
+ ("Q" "ɒ")	;; Open back rounded vowel		U+0252
+ ("R" "ʁ")	;; Voiced uvular fricative		U+0281
+ ("R\\" "ʀ")	;; Uvular trill				U+0280
+ ("S" "ʃ")	;; Voiceless postalveolar fricative	U+0283
+ ("T" "θ")	;; Voiceless dental fricative		U+03B8
+ ("U" "ʊ")	;; Near-close near-back rounded vowel	U+028A
+ ("U\\" ["ʊ̵"])	;; Central lax close rounded vowel, U+028A U+0335
+ ("V" "ʌ")	;; Open-mid back unrounded vowel	U+028C
+ ("W" "ʍ")	;; Voiceless labial-velar fricative	U+028D
+ ("X" "χ")	;; Voiceless uvular fricative		U+03C7
+ ("X\\" "ħ")	;; Voiceless pharyngeal fricative	U+0127
+ ("Y" "ʏ")	;; Near-close near-front rounded vowel	U+028F
+ ("Z" "ʒ")	;; Voiced postalveolar fricative	U+0292
+
+ ("\"" "ˈ")	;; Primary stress			U+02C8
+ ("%" "ˌ")	;; Secondary stress	 		U+02CC
+ (":" "ː")	;; Long					U+02D0
+ (":\\" "ˑ")	;; Half-long				U+02D1
+ ("@" "ə")	;; Schwa				U+0259
+ ("@\\" "ɘ")	;; Close-mid central unrounded vowel	U+0258
+ ("@`" "ɚ")	;; Rhotacised schwa			U+025A
+ ("{" "æ")	;; Near-open front unrounded vowel	U+00E6
+ ("}" "ʉ")	;; Close central rounded vowel		U+0289
+ ("1" "ɨ")	;; Close central unrounded vowel	U+0268
+ ("2" "ø")	;; Close-mid front rounded vowel	U+00F8
+ ("3" "ɜ")	;; Open-mid central unrounded vowel	U+025C
+ ("3\\" "ɞ")	;; Open-mid central rounded vowel	U+025E
+ ("4" "ɾ")	;; Alveolar flap			U+027E
+ ("5" "ɫ")	;; Velarised alveolar lateral approximant	U+026B
+ ("6" "ɐ")	;; Near-open central vowel 		U+0250
+ ("7" "ɤ")	;; Close-mid back unrounded vowel	U+0264
+ ("8" "ɵ")	;; Close-mid central rounded vowel	U+0275
+ ("9" "œ")	;; Open-mid front rounded vowel		U+0153
+ ("&" "ɶ")	;; Open front rounded vowel		U+0276
+ ("?" "ʔ")	;; Glottal stop				U+0294
+ ("?\\" "ʕ")	;; Voiced pharyngeal fricative		U+0295
+ ;; The undefined escape character, ignored. 
+ ;; Indeterminacy in French vowels, ignored.
+ ;; Begin nonsegmental notation, ignored. 
+ ("<\\" "ʢ")	;; Voiced epiglottal fricative		U+02A2
+ ;; End nonsegmental notation, ignored. 
+ (">\\" "ʡ")	;; Epiglottal plosive			U+02A1
+ ("^" "↑")	;; Upstep				U+2191
+ ("!" "↓")	;; Downstep				U+2193
+ ("!\\" "ǃ")	;; Postalveolal click			U+01C3
+ ("\\" "ǀ")	;; Dental click				U+01C0
+ ("\\|\\" "ǁ")	;; Lateral alveolar click		U+01C1
+ ("=\\" "ǂ")	;; Palatal click			U+01C2
+ ("-\\" "̮")	;; Linking mark				U+032E
+
+ ;; Diacritics. Note that XEmacs doesn't yet have composed characters, so we
+ ;; can input them, but they won't display properly. If you send email using
+ ;; them, and the recipient's client is capable, they will get through,
+ ;; though.
+
+ ("_\"" "̈")	;; Centralised		 		U+0308
+ ("_+" "̟")	;; Advanced				U+031F
+ ("_-" "̠")	;; Retracted				U+0320
+ ("_/" "ˇ")	;; Rising tone				U+02C7
+ ("_0" "̥")	;; Voiceless				U+0325
+ ("_=" "̩")	;; Syllabic				U+0329
+ ("=" "̩")	;; Syllabic				U+0329
+ ("_>" "ʼ")	;; Ejective				U+02BC
+ ("_?\\" "ˤ")	;; Pharyngealised			U+02E4
+ ("_\\" "ˆ")	;; Falling Tone				U+02C6
+ ("_^" "̯")	;; Non-syllabic				U+032F
+ ("_}" "̚")	;; No audible release			U+031A
+ ;; ` is alternatively; retroflexion in consonants
+ ("`" "˞")	;; Rhotacization in vowels		U+02DE
+ ("_~" "̃")	;; Nasalisation				U+0303
+ ("~" "̃")	;; Nasalisation				U+0303
+ ("_A" "̘")	;; Advanced tongue root			U+0318
+ ("_a" "̺")	;; Apical				U+033A
+ ("_B" "̏")	;; Extra low tone			U+030F
+ ;; _B_L omitted, no Unicode code point for "low rising tone."
+ ("_c" "̜")	;; Less rounded				U+031C
+ ("_d" "̪")	;; Dental				U+032A
+ ("_e" "̴")	;; Velarised or pharyngeal		U+0334
+
+ ("<F>" "↙")	;; Global fall; SOUTH EAST ARROW; may be a bit smaller than
+		;; intended.
+ ("_F" "̂")	;; Falling tone				U+0302
+ ("_G" "ˠ")	;; Velarised				U+02E0
+ ("_H" "́")	;; High tone				U+0301
+ ;; "_H_T omitted, no Unicode code point for "high rising tone"
+ ("_h" "ʰ")	;; Aspirated				U+02B0
+ ("_j" "ʲ")	;; Palatalised				U+02B2
+ ("'" "ʲ")	;; Palatalised				U+02B2
+ ("_k" "̰")	;; Creaky voice				U+0330
+ ("_L" "̀")	;; Low tone				U+0300
+ ("_l" "ˡ")	;; Lateral release			U+02E1
+ ("_M" "̄")	;; Mid tone				U+0304
+ ("_m" "̻")	;; Laminal				U+033B
+ ("_N" "̼")	;; Linguolabial				U+033C
+ ("_n" "ⁿ")	;; Nasal release			U+207F
+ ("_O" "̹")	;; More rounded				U+0339
+ ("_o" "̞")	;; Lowered				U+031E
+ ("_q" "̙")	;; Retracted tongue root		U+0319
+ ("<R>" "↗")	;; NORTH EAST ARROW; may be a bit smaller than intended. 
+ ("_R" "̌")     ;; Haček, caron, rising tone.		U+030C
+ ;; _R_F omitted, apparently there's no corresponding Unicode entry. 
+ ("_r" "̝")	;; Raised				U+031D
+ ("_T" "̋")	;; Extra high tone			U+030B
+ ("_t" "̤")	;; Breathy voice			U+0324
+ ("_v" "̬")	;; Voiced				U+032C
+ ("_w" "ʷ")	;; Labialised				U+02B7
+ ("_X" "̆")	;; Extra-short				U+0306
+ ("_x" "̽"))	;; Mid-centralised			U+033D
+
+(defconst x-sampa-implosive-submap 
+  '(("b_<"   ?ɓ)   ;; Voiced bilabial implosive U+0253 
+    ("d_<"   ?ɗ)   ;; Voiced alveolar implosive U+0257
+    ("g_<"   ?ɠ)   ;; Voiced velar implosive    U+0260
+    ("G\\_<" ?ʛ)   ;; Voiced uvular implosive   U+029B
+    ("J\\_<" ?ʄ))  ;; Voiced palatal implosive  U+0284
+  "A map from the X-SAMPA for some implosive consonants to characters.
+This is used because their X-SAMPA syntax is quasi-diacritic, but the
+corresponding Unicode characters themselves don't have diacritics, they are
+separate code points.  So we need to implement some extra logic that isn't
+normally provided by Quail. ")
+
+;; Putting in place rules for the implosives like for the others above
+;; breaks the "_<diacritic>" rules for b, d, g, G and J a little--you need
+;; to interrupt Quail before typing the underscore if you want the
+;; diacritic. To avoid this, handle the input specially with the function
+;; x-sampa-underscore-implosive.
+
+(dolist (implosive-x-sampa (mapcar 'car x-sampa-implosive-submap))
+  (setq implosive-x-sampa (car (split-string implosive-x-sampa "_")))
+  (quail-defrule (format "%s_" implosive-x-sampa)
+		 'x-sampa-underscore-implosive))
+
+(provide 'ipa)


-- 
¿Dónde estará ahora mi sobrino Yoghurtu Nghe, que tuvo que huir
precipitadamente de la aldea por culpa de la escasez de rinocerontes?


[-- Attachment #2: ipa.el.new.bz2 --]
[-- Type: application/octet-stream, Size: 6505 bytes --]

^ permalink raw reply	[flat|nested] 8+ messages in thread

* bug#4109: Status of IPA patch?
       [not found] <4A807E29.1030405@gmail.com>
@ 2009-08-10 20:30 ` Aidan Kehoe
  2009-08-30 23:40   ` bug#4109: marked as done (Status of IPA patch?) Emacs bug Tracking System
  0 siblings, 1 reply; 8+ messages in thread
From: Aidan Kehoe @ 2009-08-10 20:30 UTC (permalink / raw)
  To: Oleksandr Gavenko; +Cc: emacs-pretest-bug, 1750


Hi Oleksandr -- 

The omission of those two was intentional, since U+20A7 is equivalent to tʃ
(note that that digraph doesn’t include the tie that would be necessary to
have it as a phoneme distinct from /t/ followed by /ʃ/) and g is equivalent
to U+0261 for the purposes of the IPA.

I don’t know what’s happening with the application of the patch to GNU
Emacs. I use and develop for XEmacs, it hasn’t been a problem for me; maybe
the list can tell us if there are any problems with it. 

Ciao, 

	Aidan

 Ar an deichiú lá de mí Lúnasa, scríobh Oleksandr Gavenko: 

 > You make patch for IPA input method:
 > 
 > http://thread.gmane.org/gmane.emacs.bugs/23612
 > 
 > I online browse source:
 > 
 > http://cvs.savannah.gnu.org/viewvc/emacs/leim/quail/ipa.el?root=emacs&view=log
 > 
 > and see that it not yet applyed. But it provide useful features.
 > It nicely  if you try again ask developer to apply this patch.
 > 
 > Also I mail you because have such question:
 > 
 > There are two chars in English phonetic transcription:
 >   ʧ (\u02A7 LATIN SMALL LETTER TESH DIGRAPH)
 >   and ɡ (\u0261 LATIN SMALL LETTER SCRIPT G)
 > which used in English/Russian dictionaries and
 > they both do not present in your ipa.el file.
 > 
 > TESH DIGRAPH can be typed as t and LATIN SMALL LETTER ESH.
 > SMALL LETTER SCRIPT G can be typed as g (\u0067) - but some fonts
 > display differ (see article http://en.wikipedia.org/wiki/G ).
 > 
 > Can you comment question and if it have reason add support
 > for this chars?

-- 
¿Dónde estará ahora mi sobrino Yoghurtu Nghe, que tuvo que huir
precipitadamente de la aldea por culpa de la escasez de rinocerontes?





^ permalink raw reply	[flat|nested] 8+ messages in thread

* bug#1750: marked as done ([PATCH] Support standard ASCII-IPA mappings in leim/quail/ipa.el. )
  2008-12-31 13:41 ` bug#1750: [PATCH] Support standard ASCII-IPA mappings in leim/quail/ipa.el Aidan Kehoe
  2008-12-31 18:40   ` Stefan Monnier
  2008-12-31 19:57   ` James Cloos
@ 2009-08-30 23:40   ` Emacs bug Tracking System
  2 siblings, 0 replies; 8+ messages in thread
From: Emacs bug Tracking System @ 2009-08-30 23:40 UTC (permalink / raw)
  To: Juri Linkov

[-- Attachment #1: Type: text/plain, Size: 928 bytes --]

Your message dated Mon, 31 Aug 2009 02:35:23 +0300
with message-id <87y6p050zo.fsf@mail.jurta.org>
and subject line Re: bug#1750: [PATCH] Support standard ASCII-IPA mappings in leim/quail/ipa.el.
has caused the Emacs bug report #1750,
regarding [PATCH] Support standard ASCII-IPA mappings in leim/quail/ipa.el. 
to be marked as done.

This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
bug report if necessary, and/or fix the problem forthwith.

(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact owner@emacsbugs.donarmstrong.com
immediately.)


-- 
1750: http://emacsbugs.donarmstrong.com/cgi-bin/bugreport.cgi?bug=1750
Emacs Bug Tracking System
Contact owner@emacsbugs.donarmstrong.com with problems

[-- Attachment #2: Type: message/rfc822, Size: 29018 bytes --]

[-- Attachment #2.1.1: message body text --]
[-- Type: text/plain, Size: 199 bytes --]


I was asked to resend this after the release; see
http://mid.gmane.org/17834.14174.503231.877601@parhasard.net and
http://thread.gmane.org/gmane.emacs.pretest.bugs/16625 for the associated
thread.


[-- Attachment #2.1.2: forwarded message --]
[-- Type: message/rfc822, Size: 26157 bytes --]

From: Aidan Kehoe <kehoea@parhasard.net>
To: emacs-pretest-bug@gnu.org
Subject: [PATCH] Support standard ASCII-IPA mappings in leim/quail/ipa.el.
Date: Sun, 14 Jan 2007 14:59:58 +0100
Message-ID: <17834.14174.503231.877601@parhasard.net>


There are two widely-used mappings from ASCII to the International Phonetic
Alphabet used on Usenet and in the online linguistics community
generally. The extant ipa.el uses its own convention to map from ASCII to
the IPA; this patch addes support for the more-widely used ones to it, with
two new input methods, kirshenbaum-ipa and x-sampa-ipa, following the names
of the two conventions.

It also moves the file encoding to UTF-8, since Kirshenbaum and X-SAMPA
require more character codes than are provided by the Mule IPA character
set, and since UTF-8 is more compatible with other applications. 

In the event that my mailer or yours trashes the character encoding of the
patch, I’ve made a bzip2’d version of the full file available at
http://www.parhasard.net/ipa-20070114.el.bz2 . 

leim/ChangeLog addition:

2007-01-14  Aidan Kehoe  <kehoea@parhasard.net>

	* quail/ipa.el:
	Recode the file as UTF-8, for better interoperability with other
	applications. 
	* quail/ipa.el ("kirshenbaum-ipa"):
	* quail/ipa.el ("x-sampa-ipa"):
	* quail/ipa.el (x-sampa-prepend-to-keymap-entry): New function.
	* quail/ipa.el (x-sampa-implosive-submap): New Quail submap.
	* quail/ipa.el (x-sampa-underscore-implosive): New function.
	Two new input methods, both following widely-used Usenet
	conventions for mapping ASCII to the IPA. Kirshenbaum is common in
	sci.lang and alt.usage.english, X-SAMPA in various (mostly
	European) non-English language fora. X-SAMPA is slightly more
	complex to support in Quail that Kirshenbaum, whence the two extra
	function and one extra submap to support it. 


GNU Emacs Trunk source patch:
Diff command:   cvs -q diff -Nu
Files affected: leim/quail/ipa.el
===================================================================
RCS

Index: leim/quail/ipa.el
===================================================================
RCS file: /sources/emacs/emacs/leim/quail/ipa.el,v
retrieving revision 1.8
diff -u -u -r1.8 ipa.el
--- leim/quail/ipa.el	13 Dec 2006 01:13:56 -0000	1.8
+++ leim/quail/ipa.el	14 Jan 2007 13:41:57 -0000
@@ -1,8 +1,10 @@
-;;; ipa.el --- Quail package for inputting IPA characters  -*-coding: iso-2022-7bit;-*-
-
+;;; quail/ipa.el -- Quail package for inputting IPA. -*- coding: utf-8 -*- 
+;;; 
 ;; Copyright (C) 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005, 2006
 ;;   National Institute of Advanced Industrial Science and Technology (AIST)
 ;;   Registration Number H14PRO021
+;; Licensed to the Free Software Foundation.
+;; Copyright (C) 2006 Free Software Foundation, Inc.
 
 ;; Keywords: multilingual, input method, IPA
 
@@ -23,11 +25,10 @@
 ;; Free Software Foundation, Inc., 51 Franklin Street, Fifth Floor,
 ;; Boston, MA 02110-1301, USA.
 
-;;; Commentary:
-
 ;;; Code:
 
 (require 'quail)
+(eval-when-compile (require 'cl))
 
 (quail-define-package
  "ipa" "IPA" "IPA" t
@@ -37,77 +38,473 @@
  nil t nil nil nil nil nil nil nil nil t)
 
 (quail-define-rules
- ("i" ?^[,0 ^[(B)
- ("I" ?^[,0!^[(B)
- ("e" ?^[,0"^[(B)
- ("/3" ?^[,0#^[(B)
- ("E" ?^[,0#^[(B)
- ("ae" ?^[,0$^[(B)
- ("a" ?^[,0%^[(B)
- ("i-" ?^[,0&^[(B)
- ("/e" ?^[,0'^[(B)
- ("/a" ?^[,0(^[(B)
- ("/m" ?^[,0)^[(B)
- ("&" ?^[,0*^[(B)
- ("/v" ?^[,0+^[(B)
- ("A" ?^[,0,^[(B)
- ("o|" ?^[,0,^[(B)
- ("y" ?^[,0-^[(B)
- ("Y" ?^[,0.^[(B)
- ("o/" ?^[,0/^[(B)
- ("oe" ?^[,00^[(B)
- ("OE" ?^[,01^[(B)
- ("u-" ?^[,02^[(B)
- ("o-" ?^[,03^[(B)
- ("u" ?^[,04^[(B)
- ("U" ?^[,05^[(B)
- ("o" ?^[,06^[(B)
- ("/c" ?^[,07^[(B)
- ("/A" ?^[,08^[(B)
- ("|o" ?^[,08^[(B)
- ("e-" ?^[,0:^[(B)
- ("e|" ?^[,0:^[(B)
- ("/3~" ?^[,0;^[(B)
- ("E~" ?^[,0;^[(B)
- ("A~" ?^[,0<^[(B)
- ("oe~" ?^[,0=^[(B)
- ("/c~" ?^[,0>^[(B)
- ("p" ?^[,0@^[(B)
- ("b" ?^[,0A^[(B)
- ("t" ?^[,0B^[(B)
- ("d" ?^[,0C^[(B)
- ("k" ?^[,0D^[(B)
- ("g" ?^[,0E^[(B)
- ("f" ?^[,0F^[(B)
- ("v" ?^[,0G^[(B)
- ("th" ?^[,0H^[(B)
- ("dh" ?^[,0I^[(B)
- ("s" ?^[,0J^[(B)
- ("z" ?^[,0K^[(B)
- ("sh" ?^[,0L^[(B)
- ("zh" ?^[,0M^[(B)
- ("3" ?^[,0M^[(B)
- ("c," ?^[,0N^[(B)
- ("x" ?^[,0O^[(B)
- ("/R" ?^[,0P^[(B)
- ("h" ?^[,0Q^[(B)
- ("m" ?^[,0R^[(B)
- ("n" ?^[,0S^[(B)
- ("gn" ?^[,0T^[(B)
- ("ng" ?^[,0U^[(B)
- ("r" ?^[,0V^[(B)
- ("R" ?^[,0W^[(B)
- ("/r" ?^[,0X^[(B)
- ("j" ?^[,0Y^[(B)
- ("l" ?^[,0Z^[(B)
- ("/y" ?^[,0[^[(B)
- ("L" ?^[,0\^[(B)
- ("/h" ?^[,0]^[(B)
- ("w" ?^[,0^^[(B)
- ("M" ?^[,0_^[(B)
- ("'" ?^[,0p^[(B)
- ("`" ?^[,0q^[(B)
- (":" ?^[,0r^[(B))
+ ("i" ?i)
+ ("I" ?ɪ)
+ ("e" ?e)
+ ("/3" ?ɛ)
+ ("E" ?ɛ)
+ ("ae" ?æ)
+ ("a" ?a)
+ ("i-" ?ɨ)
+ ("/e" ?ə)
+ ("/a" ?ɐ)
+ ("/m" ?ɯ)
+ ("&" ?ɤ)
+ ("/v" ?ʌ)
+ ("A" ?ɑ)
+ ("o|" ?ɑ)
+ ("y" ?y)
+ ("Y" ?ʏ)
+ ("o/" ?ø)
+ ("oe" ?œ)
+ ("OE" ?ɶ)
+ ("u-" ?ʉ)
+ ("o-" ?ɵ)
+ ("u" ?u)
+ ("U" ?ʊ)
+ ("o" ?o)
+ ("/c" ?ɔ)
+ ("/A" ?ɒ)
+ ("|o" ?ɒ)
+ ("e-" ?ɚ)
+ ("e|" ?ɚ)
+ ("p" ?p)
+ ("b" ?b)
+ ("t" ?t)
+ ("d" ?d)
+ ("k" ?k)
+ ("g" ?g)
+ ("f" ?f)
+ ("v" ?v)
+ ("th" ?θ)
+ ("dh" ?ð)
+ ("s" ?s)
+ ("z" ?z)
+ ("sh" ?ʃ)
+ ("zh" ?ʒ)
+ ("3" ?ʒ)
+ ("c," ?ç)
+ ("x" ?x)
+ ("/R" ?ʁ)
+ ("h" ?h)
+ ("m" ?m)
+ ("n" ?n)
+ ("gn" ?ɲ)
+ ("ng" ?ŋ)
+ ("r" ?r)
+ ("R" ?ʀ)
+ ("/r" ?ɹ)
+ ("j" ?j)
+ ("l" ?l)
+ ("/y" ?ʎ)
+ ("L" ?ʟ)
+ ("/h" ?ɥ)
+ ("w" ?w)
+ ("M" ?ʍ)
+ ("'" ?ˈ)
+ ("`" ?ˌ)
+ (":" ?ː))
+
+(quail-define-package
+ "kirshenbaum-ipa" "IPA" "IPA-K" t
+ "The International Phonetic Alphabet, using Kirshenbaum ASCII translit. 
+
+Kirshenbaum IPA is an ASCII transliteration of the phonetic alphabet, common
+in the Usenet groups `sci.lang' and `alt.usage.english'.  This input method
+allows you to type Kirshenbaum on your ASCII-capable keyboard, producing the
+corresponding actual IPA characters in your editor.
+
+See http://www.kirshenbaum.net/IPA/ascii-ipa.pdf for full details of the
+transliteration.
+
+A caveat with regard to that document; while XEmacs currently preserves
+Unicode diacritics on reading and emitting them, it displays them,
+incorrectly, as separate from the modified glyphs. ")
+
+(quail-define-rules
+ ("r" "ɹ")	;; Alveolar approximant			U+0279
+ ("A" "ɑ")	;; Low back unrounded vowel		U+0251
+ ("B" "β")	;; Voiced bilabial fricative		U+03B2
+ ("C" "ç")	;; Voiced palatal fricative		U+00E7
+ ("D" "ð")	;; Voiced dental fricative		U+00F0
+ ("E" "ɛ")	;; Lower-mid front unrounded vowel	U+025B
+ ("G" "ɢ")	;; Voiced uvular stop			U+0262
+ ("H" "ħ")	;; Voiced pharyngeal fricative		U+0127
+ ("I" "ɪ")	;; Semi-high front unrounded vowel	U+026A
+ ("J" "ɟ")	;; Voiced palatal stop			U+025F
+ ("L" ["ɫ"	;; Voiced velar lateral			U+026B
+       "ʟ"	;; Voiced velar lateral			U+029F
+       "ɬ"])	;; Voiced alveolar lateral fricative	U+026C
+ ("M" "ɱ")	;; Labio-dental nasal			U+0271
+ ("N" "ŋ")	;; Velar nasal				U+014B
+ ("O" "ɔ")	;; Low-mid back rounded vowel		U+0254
+ ("P" "Φ")	;; Voiceless bilabial fricative		U+03A6
+ ("Q" "ɣ")	;; Voiced velar fricative		U+0263
+ ("R" ["ʀ"	;; Alveolar trill			U+0280
+       "ɚ"])    ;; Rhotacised schwa			U+025A
+ ("@<r>" "ɚ")	;; Mid central rhotacised vowel		U+025A
+ ("S" "ʃ")	;; Voiced postalveolar fricative	U+0283
+ ("T" "θ")	;; Voiced dental fricative		U+03B8
+ ("U" "ʊ")	;; Semi-high back rounded vowel		U+028A
+ ("V" "ʌ")	;; Low-mid back unrounded vowel		U+028C
+ ("W" "œ")	;; Low-mid front rounded vowel		U+0153
+ ("X" "χ")	;; Voiceless uvular fricative		U+03C7
+ ("Y" "ø")	;; Upper-mid front rounded vowel	U+00F8
+ ("Z" "ʒ")	;; Voiced postalveolar fricative	U+0292
+ ("?" "ʔ")	;; Glottal stop				U+0294
+ ("@" "ə")	;; Mid central unrounded vowel (schwa)	U+0259
+ ("&" "æ")	;; Low front unrounded vowel		U+00E6
+ ("*" "ɾ")	;; Voiced alveolar flap			U+027E
+
+ ("a~" "ã")	;; Low central unrounded vowel, nasal	U+00E3
+ ("o~" "õ")	;; Upper-mid back rounded vowel, nasal	U+00F5
+ ("u~" "ũ")	;; High back rounded vowel, nasal	U+0169
+ ("~" "̃")	;; +Nasalised modifier			U+0303
+ (":" "ː")	;; +Long modifier			U+02D0
+ ("-" "̩")	;; +Syllabic modifier			U+0329
+ ("." "̣")	;; +Retroflex modifier			U+0323
+ ("`" "ʼ")	;; +Ejective modifier			U+02BC
+ ("[" "̪")	;; +Dental modifier			U+032A
+ (";" "ʲ")	;; +Palatalised modifier		U+02B2
+ ("<H>" "̴")	;; +Pharyngealised modifier		U+0334
+ ("<h>" "ʰ")	;; +Aspirated modifier			U+02B0
+ ("<o>" ["̥"	;; +Voiceless modifier			U+0325
+	 "˚"])	;; +Unexploded modifier			U+02DA
+ ("<r>" "ʳ")	;; +Rhotacised modifier			U+02B3
+ ("<w>" "ʷ")	;; +Labialised modifier			U+02B7
+ ("<?>" "ʱ")	;; +Murmured modififer			U+02B1
+
+ ("b<trl>" "ʙ")	;; Bilabial trill			U+0299
+ ("b`" "ɓ")	;; Bilabial implosive			U+0253
+ ("p!" "ʘ")	;; Bilabial click			U+0298
+ ("r<lbd>" "ʋ")	;; Labio-dental approximant		U+028B
+ ("d`" "ɗ")	;; Dental implosive			U+0257
+ ("t!" "ʇ")	;; Dental click				U+0287
+ ("s<lat>" "ɬ")	;; Voiceless alveolar lateral fricative	U+026C
+ ("z<lat>" "ɮ")	;; Voiced alveolar lateral fricative	U+026E
+ ("r<trl>" "ʀ")	;; Alveolar trill			U+0280
+
+ ("*<lat>" "ɺ")	;; Voiced alveolar lateral flap		U+027A
+ ("c!" "ʗ")	;; Alveolar click			U+0297
+ ("l!" "ʖ")	;; Alveolar lateral click		U+0296
+ ("n." "ɳ")	;; Retroflex nasal			U+0273
+ ("t." "ʈ")	;; Voiceless retroflex stop		U+0288
+ ("d." "ɖ")	;; Voiced retroflex stop		U+0256
+ ("s." "ʂ")	;; Voiceless retroflex fricative	U+0282
+ ("z." "ʐ")	;; Voiceless retroflex fricative	U+0290
+ ("r." "ɻ")	;; Retroflex approximant		U+027B
+ ("l." "ɭ")	;; Retroflex lateral			U+026D
+ ("*." "ɽ")	;; Retroflex flap			U+027D
+
+ ("C<vcd>" "ʝ")	;; Voiced palatal fricative		U+029D
+ ("j<rnd>" "ɥ")	;; Rounded palatal approximant		U+0265
+ ("l^" "ʎ")	;; Palatal lateral			U+028E
+ ("J`" "ʄ")	;; Palatal implosive			U+0284
+ ("j<vel>" "ɰ")	;; Velar approximant			U+0270
+ ("g`" "ɠ")	;; Velar implosive			U+0260
+ ("k!" "ʞ")	;; Velar click				U+029E
+ 
+ ("n<lbv>" ["n⁀g"]) ;; Labio-velar nasal
+ ("t<lbv>" ["k⁀p"]) ;; Voiceless labio-velar stop
+
+ ;; "n<lbv> for "gb" WITH U+2030 CHARACTER TIE was ambiguous and
+ ;; misleading. I _believe_ this is what was meant instead. 
+ ("d<lbv>" ["g⁀b"]) ;; Voiced labio-velar stop. 
+
+ ("w<vls>" "ʍ")	;; Voiceless labio-velar stop		U+028D
+ ("n\"" "ɴ")	;; Uvular nasal				U+0274
+ ("g\"" "ʁ")	;; Voiced uvular fricative		U+0281
+ ("r\"" "ʀ")	;; Uvular trill				U+0280
+ ("G`" "ʛ")	;; Voiced uvular implosive		U+029B
+ ("H<vcd>" "ʕ")	;; Voiced pharyngeal fricative		U+0295
+
+ ("h<?>" "ɦ")	;; Murmured glottal fricative		U+0266
+ ("I." "ʏ")	;; Semi-high front rounded vowel	U+028F
+ ("&." "ɶ")	;; Low front unrounded vowel		U+0276
+
+ ("i\"" "ɨ")	;; High central unrounded vowel		U+0268
+ ("u\"" "ʉ")	;; High central rounded vowel		U+0289
+ ("@<umd>" "ɘ")	;; Upper-mid central unrounded vowel	U+0258
+
+ ("R<umd>" "ɝ")	;; Upper-mid central rhotacised vowel	U+025D
+
+ ("@." "ɵ")	;; Mid central rounded vowel		U+0275
+ ("V\"" "ɜ")	;; Lower-mid central unrounded vowel	U+025C
+ ("O\"" "ɞ")	;; Lower-mid central rounded vowel	U+025E
+ ("u-" "ɯ")	;; High back unrounded vowel		U+026F
+ ("o-" "ɤ")	;; Upper-mid back unrounded vowel	U+0264
+ ("A." "ɒ"))	;; Lower back rounded vowel		U+0252
+
+;; On XEmacs, with the supplied X-SAMPA data, this function is capably
+;; implemented with: 
+;;
+;;   (list (vector (concat to-prepend quail-keymap)))
+;; 
+;; Supporting GNU Emacs too makes it a good deal more complicated. 
+
+(defun x-sampa-prepend-to-keymap-entry (to-prepend quail-keymap)
+  "Return QUAIL-KEYMAP with TO-PREPEND at the beginning of each result.
+
+QUAIL-KEYMAP is a cons that satisfies `quail-map-p'; TO-PREPEND is a
+string.  "
+  (when (consp quail-keymap) (setq quail-keymap (cdr quail-keymap)))
+  (if (or (integerp quail-keymap) 
+	  (and (fboundp 'characterp) (characterp quail-keymap)))
+      (setq quail-keymap (list (string quail-keymap)))
+    (if (stringp quail-keymap)
+	(setq quail-keymap (list quail-keymap))
+      (assert (vectorp quail-keymap) t)
+      (setq quail-keymap (append quail-keymap nil))))
+  (list 
+   (apply 'vector 
+	  (mapcar 
+	   '(lambda (entry)
+	      (assert (char-or-string-p entry) t)
+	      (format "%s%s" to-prepend 
+		      (if (integerp entry) (string entry) entry)))
+	   quail-keymap))))
+
+(defconst x-sampa-implosive-submap 
+  '(("b_<"   ?ɓ)   ;; Voiced bilabial implosive U+0253 
+    ("d_<"   ?ɗ)   ;; Voiced alveolar implosive U+0257
+    ("g_<"   ?ɠ)   ;; Voiced velar implosive    U+0260
+    ("G\\_<" ?ʛ)   ;; Voiced uvular implosive   U+029B
+    ("J\\_<" ?ʄ))  ;; Voiced palatal implosive  U+0284
+  "A map from the X-SAMPA for some implosive consonants to characters.
+This is used because their X-SAMPA syntax is quasi-diacritic, but the
+corresponding Unicode characters themselves don't have diacritics, they are
+separate code points.  So we need to implement some extra logic that isn't
+normally provided by Quail. ")
+
+(defun x-sampa-underscore-implosive (input-string length)
+  "Return keymap with IPA implosives, for INPUT-STRING, length LENGTH.
+
+The implosive consonants in X-SAMPA are represented with more or less a
+diacritic syntax, but the property +implosive in the IPA is expressed using
+separate characters, and not using a diacritic.  This function works around
+the confusion that implies when generating IPA from X-SAMPA; it returns a
+Quail map that is a copy of the map for `_', but with all the DIACRITIC
+entries changed to return the diacritic together with the base character,
+and with the map to the implosive added to its end.
+
+Like all `quail-defrule'-assigned functions, this will be called once for
+each particular sequence of keys, the first time the user types that
+particular sequence of keys, and the result will be cached by Quail. "
+  (let* ((input-string (substring input-string 0 (or length)))
+	 (underscore-map (copy-tree (quail-lookup-key "_")))
+	 (split-input (split-string input-string "_"))
+	 (pre-underscore (car split-input))
+	 (pre-underscore-map (quail-lookup-key pre-underscore))
+	 (x-sampa-submap-entry 
+	  (assoc (format "%s<" input-string) x-sampa-implosive-submap))
+	 underscore-map-entry)
+    (if (and (consp pre-underscore-map) (car pre-underscore-map))
+	(setq pre-underscore-map (car pre-underscore-map))
+      (setq pre-underscore-map pre-underscore))
+    (unless (stringp pre-underscore-map)
+      (setq pre-underscore-map (string pre-underscore-map)))
+    (dolist (underscoring underscore-map)
+      (cond ((null underscoring))
+	    ((eq (length underscoring) 2)
+	     (setq underscore-map-entry (second underscoring))
+	     (setcdr underscoring (x-sampa-prepend-to-keymap-entry 
+				   pre-underscore-map underscore-map-entry)))
+	    ((eq (length underscoring) 3)
+	     (setq underscore-map-entry (second (third underscoring)))
+	     (setcdr (third underscoring) 
+		     (x-sampa-prepend-to-keymap-entry 
+		      pre-underscore-map underscore-map-entry)))
+	    (t 
+	     (assert (null t) t 
+		     "Can't handle subtrees of this level right now."))))
+    (append underscore-map (list (list ?< (second x-sampa-submap-entry))))))
+
+(quail-define-package
+ "x-sampa-ipa" "IPA" "IPA-X" t
+ "The International Phonetic Alphabet, using J.C. Wells' X-SAMPA. 
+
+X-SAMPA is an ASCII transliteration of the IPA, normally used for data
+exchange in environments where Unicode is not available.  This input method
+uses this transliteration to allow you to produce the IPA in your editor
+with a keyboard that's limited to ASCII.
+
+See http://www.phon.ucl.ac.uk/home/sampa/ipasam-x.pdf for a full definition
+of the mapping. A caveat with regard to that document; while XEmacs
+currently preserves Unicode diacritics on reading and emitting them, it
+displays them, incorrectly, as separate from the modified glyphs. " nil t t)
+
+(quail-define-rules
+ ;; Table taken from http://en.wikipedia.org/wiki/X-SAMPA, checked with
+ ;; http://www.phon.ucl.ac.uk/home/sampa/ipasam-x.pdf
+
+ ("d`" "ɖ")	;; Voiced retroflex plosive		U+0256
+
+ ;; In the Wikipedia article, they list ASCII g as mapping to U+0261, LATIN
+ ;; SMALL LETTER SCRIPT G. The typeset g is equally acceptable as the voiced
+ ;; velar plosive, however, and we don't override the identity mapping here.
+
+ ("h\\" "ɦ")	;; Voiced glottal fricative		U+0266
+ ("j\\" "ʝ")	;; Voiced palatal fricative		U+029D
+ ("l`" "ɭ")	;; Retroflex lateral approximant	U+026D
+ ("l\\" "ɺ")	;; Alveolar lateral flap		U+027A
+ ("n`" "ɳ")	;; Retroflex nasal			U+0273
+ ("p\\" "ɸ")	;; Voiceless bilabial fricative		U+0278
+ ("r`" "ɽ")	;; Retroflex flap			U+027D
+ ("r\\" "ɹ")	;; Alveolar approximant			U+0279
+ ("r\\`" "ɻ")	;; Retroflex approximant		U+027B
+ ("s`" "ʂ")	;; Voiceless retroflex fricative	U+0282
+ ("s\\" "ɕ")	;; Voiceless alveolo-palatal fricative	U+0255
+ ("t`" "ʈ")	;; Voiceless retroflex plosive		U+0288
+ ("v\\" "ʋ")	;; Labiodental approximant		U+028B
+ ("x\\" "ɧ")	;; Voiceless palatal-velar fricative	U+0267
+ ("z`" "ʐ")	;; Voiced retroflex fricative		U+0290
+ ("z\\" "ʑ")	;; Voiced alveolo-palatal fricative	U+0291
+ ("A" "ɑ")	;; Open back unrounded vowel		U+0251
+ ("B" "β")	;; Voiced bilabial fricative		U+03B2
+ ("B\\" "ʙ")	;; Bilabial trill			U+0299
+ ("C" "ç")	;; Voiceless palatal fricative		U+00E7
+ ("D" "ð")	;; Voiced dental fricative		U+00F0
+ ("E" "ɛ")	;; Open-mid front unrounded vowel	U+025B
+ ("F" "ɱ")	;; Labiodental nasal			U+0271
+ ("G" "ɣ")	;; Voiced velar fricative		U+0263
+ ("G\\" "ɢ")	;; Voiced uvular plosive		U+0262
+ ("H" "ɥ")	;; Labial-palatal approximant		U+0265
+ ("H\\" "ʜ")	;; Voiceless epiglottal fricative	U+029C
+ ("I" "ɪ")	;; Near-close near-front unrounded vowel	U+026A
+ ("I\\" "Ɨ")	;; Central lax close unrounded vowel	U+0197
+ ("J" "ɲ")	;; Palatal nasal			U+0272
+ ("J\\" "ɟ")	;; Voiceless palatal plosive		U+025F
+ ("K" "ɬ")	;; Voiceless alveolar lateral fricative	U+026C
+ ("K\\" "ɮ")	;; Voiced alveolar lateral fricative	U+026E
+ ("L" "ʎ")	;; Palatal lateral approximant		U+028E
+ ("L\\" "ʟ")	;; Velar lateral approximant		U+029F
+ ("M" "ɯ")	;; Close back unrounded vowel		U+026F
+ ("M\\" "ɰ")	;; Velar approximant			U+0270
+ ("N" "ŋ")	;; Velar nasal				U+014B
+ ("N\\" "ɴ")	;; Uvular nasal				U+0274
+ ("O" "ɔ")	;; Open-mid back rounded vowel		U+0254
+ ("O\\" "ʘ")	;; Bilabial click			U+0298
+ ("P" "ʋ")	;; Labiodental approximant		U+028B
+ ("Q" "ɒ")	;; Open back rounded vowel		U+0252
+ ("R" "ʁ")	;; Voiced uvular fricative		U+0281
+ ("R\\" "ʀ")	;; Uvular trill				U+0280
+ ("S" "ʃ")	;; Voiceless postalveolar fricative	U+0283
+ ("T" "θ")	;; Voiceless dental fricative		U+03B8
+ ("U" "ʊ")	;; Near-close near-back rounded vowel	U+028A
+ ("U\\" ["ʊ̵"])	;; Central lax close rounded vowel, U+028A U+0335
+ ("V" "ʌ")	;; Open-mid back unrounded vowel	U+028C
+ ("W" "ʍ")	;; Voiceless labial-velar fricative	U+028D
+ ("X" "χ")	;; Voiceless uvular fricative		U+03C7
+ ("X\\" "ħ")	;; Voiceless pharyngeal fricative	U+0127
+ ("Y" "ʏ")	;; Near-close near-front rounded vowel	U+028F
+ ("Z" "ʒ")	;; Voiced postalveolar fricative	U+0292
+
+ ("\"" "ˈ")	;; Primary stress			U+02C8
+ ("%" "ˌ")	;; Secondary stress	 		U+02CC
+ (":" "ː")	;; Long					U+02D0
+ (":\\" "ˑ")	;; Half-long				U+02D1
+ ("@" "ə")	;; Schwa				U+0259
+ ("@\\" "ɘ")	;; Close-mid central unrounded vowel	U+0258
+ ("@`" "ɚ")	;; Rhotacised schwa			U+025A
+ ("{" "æ")	;; Near-open front unrounded vowel	U+00E6
+ ("}" "ʉ")	;; Close central rounded vowel		U+0289
+ ("1" "ɨ")	;; Close central unrounded vowel	U+0268
+ ("2" "ø")	;; Close-mid front rounded vowel	U+00F8
+ ("3" "ɜ")	;; Open-mid central unrounded vowel	U+025C
+ ("3\\" "ɞ")	;; Open-mid central rounded vowel	U+025E
+ ("4" "ɾ")	;; Alveolar flap			U+027E
+ ("5" "ɫ")	;; Velarised alveolar lateral approximant	U+026B
+ ("6" "ɐ")	;; Near-open central vowel 		U+0250
+ ("7" "ɤ")	;; Close-mid back unrounded vowel	U+0264
+ ("8" "ɵ")	;; Close-mid central rounded vowel	U+0275
+ ("9" "œ")	;; Open-mid front rounded vowel		U+0153
+ ("&" "ɶ")	;; Open front rounded vowel		U+0276
+ ("?" "ʔ")	;; Glottal stop				U+0294
+ ("?\\" "ʕ")	;; Voiced pharyngeal fricative		U+0295
+ ;; The undefined escape character, ignored. 
+ ;; Indeterminacy in French vowels, ignored.
+ ;; Begin nonsegmental notation, ignored. 
+ ("<\\" "ʢ")	;; Voiced epiglottal fricative		U+02A2
+ ;; End nonsegmental notation, ignored. 
+ (">\\" "ʡ")	;; Epiglottal plosive			U+02A1
+ ("^" "↑")	;; Upstep				U+2191
+ ("!" "↓")	;; Downstep				U+2193
+ ("!\\" "ǃ")	;; Postalveolal click			U+01C3
+ ("\\" "ǀ")	;; Dental click				U+01C0
+ ("\\|\\" "ǁ")	;; Lateral alveolar click		U+01C1
+ ("=\\" "ǂ")	;; Palatal click			U+01C2
+ ("-\\" "̮")	;; Linking mark				U+032E
+
+ ;; Diacritics. Note that XEmacs doesn't yet have composed characters, so we
+ ;; can input them, but they won't display properly. If you send email using
+ ;; them, and the recipient's client is capable, they will get through,
+ ;; though.
+
+ ("_\"" "̈")	;; Centralised		 		U+0308
+ ("_+" "̟")	;; Advanced				U+031F
+ ("_-" "̠")	;; Retracted				U+0320
+ ("_/" "ˇ")	;; Rising tone				U+02C7
+ ("_0" "̥")	;; Voiceless				U+0325
+ ("_=" "̩")	;; Syllabic				U+0329
+ ("=" "̩")	;; Syllabic				U+0329
+ ("_>" "ʼ")	;; Ejective				U+02BC
+ ("_?\\" "ˤ")	;; Pharyngealised			U+02E4
+ ("_\\" "ˆ")	;; Falling Tone				U+02C6
+ ("_^" "̯")	;; Non-syllabic				U+032F
+ ("_}" "̚")	;; No audible release			U+031A
+ ;; ` is alternatively; retroflexion in consonants
+ ("`" "˞")	;; Rhotacization in vowels		U+02DE
+ ("_~" "̃")	;; Nasalisation				U+0303
+ ("~" "̃")	;; Nasalisation				U+0303
+ ("_A" "̘")	;; Advanced tongue root			U+0318
+ ("_a" "̺")	;; Apical				U+033A
+ ("_B" "̏")	;; Extra low tone			U+030F
+ ;; _B_L omitted, no Unicode code point for "low rising tone."
+ ("_c" "̜")	;; Less rounded				U+031C
+ ("_d" "̪")	;; Dental				U+032A
+ ("_e" "̴")	;; Velarised or pharyngeal		U+0334
+
+ ("<F>" "↙")	;; Global fall; SOUTH EAST ARROW; may be a bit smaller than
+		;; intended.
+ ("_F" "̂")	;; Falling tone				U+0302
+ ("_G" "ˠ")	;; Velarised				U+02E0
+ ("_H" "́")	;; High tone				U+0301
+ ;; "_H_T omitted, no Unicode code point for "high rising tone"
+ ("_h" "ʰ")	;; Aspirated				U+02B0
+ ("_j" "ʲ")	;; Palatalised				U+02B2
+ ("'" "ʲ")	;; Palatalised				U+02B2
+ ("_k" "̰")	;; Creaky voice				U+0330
+ ("_L" "̀")	;; Low tone				U+0300
+ ("_l" "ˡ")	;; Lateral release			U+02E1
+ ("_M" "̄")	;; Mid tone				U+0304
+ ("_m" "̻")	;; Laminal				U+033B
+ ("_N" "̼")	;; Linguolabial				U+033C
+ ("_n" "ⁿ")	;; Nasal release			U+207F
+ ("_O" "̹")	;; More rounded				U+0339
+ ("_o" "̞")	;; Lowered				U+031E
+ ("_q" "̙")	;; Retracted tongue root		U+0319
+ ("<R>" "↗")	;; NORTH EAST ARROW; may be a bit smaller than intended. 
+ ("_R" "̌")     ;; Haček, caron, rising tone.		U+030C
+ ;; _R_F omitted, apparently there's no corresponding Unicode entry. 
+ ("_r" "̝")	;; Raised				U+031D
+ ("_T" "̋")	;; Extra high tone			U+030B
+ ("_t" "̤")	;; Breathy voice			U+0324
+ ("_v" "̬")	;; Voiced				U+032C
+ ("_w" "ʷ")	;; Labialised				U+02B7
+ ("_X" "̆")	;; Extra-short				U+0306
+ ("_x" "̽"))	;; Mid-centralised			U+033D
+
+;; Putting in place rules for the implosives like for the others above
+;; breaks the "_<diacritic>" rules for b, d, g, G and J a little--you need
+;; to interrupt Quail before typing the underscore if you want the
+;; diacritic. To avoid this, handle the input specially with the function
+;; x-sampa-underscore-implosive.
+
+(dolist (implosive-x-sampa (mapcar 'car x-sampa-implosive-submap))
+  (setq implosive-x-sampa (car (split-string implosive-x-sampa "_")))
+  (quail-defrule (format "%s_" implosive-x-sampa)
+		 'x-sampa-underscore-implosive))
 
 ;;; arch-tag: cf2614cc-ecce-4ef5-ba51-37faeed41691
 ;;; ipa.el ends here
+

-- 
When I was in the scouts, the leader told me to pitch a tent. I couldn't
find any pitch, so I used creosote.

[-- Attachment #2.1.3: .signature --]
[-- Type: text/plain, Size: 143 bytes --]


-- 
¿Dónde estará ahora mi sobrino Yoghurtu Nghé, que tuvo que huir
precipitadamente de la aldea por culpa de la escasez de rinocerontes?

[-- Attachment #3: Type: message/rfc822, Size: 1554 bytes --]

From: Juri Linkov <juri@jurta.org>
To: Aidan Kehoe <kehoea@parhasard.net>
Cc: 1750-done@emacsbugs.donarmstrong.com
Subject: Re: bug#1750: [PATCH] Support standard ASCII-IPA mappings in leim/quail/ipa.el.
Date: Mon, 31 Aug 2009 02:35:23 +0300
Message-ID: <87y6p050zo.fsf@mail.jurta.org>

Thank you.

-- 
Juri Linkov
http://www.jurta.org/emacs/

^ permalink raw reply	[flat|nested] 8+ messages in thread

* bug#4109: marked as done (Status of IPA patch?)
  2009-08-10 20:30 ` bug#4109: Status of IPA patch? Aidan Kehoe
@ 2009-08-30 23:40   ` Emacs bug Tracking System
  0 siblings, 0 replies; 8+ messages in thread
From: Emacs bug Tracking System @ 2009-08-30 23:40 UTC (permalink / raw)
  To: Juri Linkov

[-- Attachment #1: Type: text/plain, Size: 882 bytes --]

Your message dated Mon, 31 Aug 2009 02:35:23 +0300
with message-id <87y6p050zo.fsf@mail.jurta.org>
and subject line Re: bug#1750: [PATCH] Support standard ASCII-IPA mappings in leim/quail/ipa.el.
has caused the Emacs bug report #1750,
regarding Status of IPA patch?
to be marked as done.

This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
bug report if necessary, and/or fix the problem forthwith.

(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact owner@emacsbugs.donarmstrong.com
immediately.)

-- 
1750: http://emacsbugs.donarmstrong.com/cgi-bin/bugreport.cgi?bug=1750
Emacs Bug Tracking System
Contact owner@emacsbugs.donarmstrong.com with problems

[-- Attachment #2: Type: message/rfc822, Size: 3995 bytes --]

From: Aidan Kehoe <kehoea@parhasard.net>
To: Oleksandr Gavenko <gavenkoa@gmail.com>
Cc: 1750@emacsbugs.donarmstrong.com, emacs-pretest-bug@gnu.org
Subject: Re: Status of IPA patch?
Date: Mon, 10 Aug 2009 21:30:04 +0100
Message-ID: <19072.33612.609778.383349@parhasard.net>

Hi Oleksandr -- 

The omission of those two was intentional, since U+20A7 is equivalent to tʃ
(note that that digraph doesn’t include the tie that would be necessary to
have it as a phoneme distinct from /t/ followed by /ʃ/) and g is equivalent
to U+0261 for the purposes of the IPA.

I don’t know what’s happening with the application of the patch to GNU
Emacs. I use and develop for XEmacs, it hasn’t been a problem for me; maybe
the list can tell us if there are any problems with it. 

Ciao, 

	Aidan

 Ar an deichiú lá de mí Lúnasa, scríobh Oleksandr Gavenko: 

 > You make patch for IPA input method:
 > 
 > http://thread.gmane.org/gmane.emacs.bugs/23612
 > 
 > I online browse source:
 > 
 > http://cvs.savannah.gnu.org/viewvc/emacs/leim/quail/ipa.el?root=emacs&view=log
 > 
 > and see that it not yet applyed. But it provide useful features.
 > It nicely  if you try again ask developer to apply this patch.
 > 
 > Also I mail you because have such question:
 > 
 > There are two chars in English phonetic transcription:
 >   ʧ (\u02A7 LATIN SMALL LETTER TESH DIGRAPH)
 >   and ɡ (\u0261 LATIN SMALL LETTER SCRIPT G)
 > which used in English/Russian dictionaries and
 > they both do not present in your ipa.el file.
 > 
 > TESH DIGRAPH can be typed as t and LATIN SMALL LETTER ESH.
 > SMALL LETTER SCRIPT G can be typed as g (\u0067) - but some fonts
 > display differ (see article http://en.wikipedia.org/wiki/G ).
 > 
 > Can you comment question and if it have reason add support
 > for this chars?

-- 
¿Dónde estará ahora mi sobrino Yoghurtu Nghe, que tuvo que huir
precipitadamente de la aldea por culpa de la escasez de rinocerontes?

[-- Attachment #3: Type: message/rfc822, Size: 1554 bytes --]

From: Juri Linkov <juri@jurta.org>
To: Aidan Kehoe <kehoea@parhasard.net>
Cc: 1750-done@emacsbugs.donarmstrong.com
Subject: Re: bug#1750: [PATCH] Support standard ASCII-IPA mappings in leim/quail/ipa.el.
Date: Mon, 31 Aug 2009 02:35:23 +0300
Message-ID: <87y6p050zo.fsf@mail.jurta.org>

Thank you.

-- 
Juri Linkov
http://www.jurta.org/emacs/

^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2009-08-30 23:40 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
     [not found] <87y6p050zo.fsf@mail.jurta.org>
2008-12-31 13:41 ` bug#1750: [PATCH] Support standard ASCII-IPA mappings in leim/quail/ipa.el Aidan Kehoe
2008-12-31 18:40   ` Stefan Monnier
2008-12-31 19:51     ` Aidan Kehoe
2009-01-19  7:11       ` Aidan Kehoe
2008-12-31 19:57   ` James Cloos
2009-08-30 23:40   ` bug#1750: marked as done ([PATCH] Support standard ASCII-IPA mappings in leim/quail/ipa.el. ) Emacs bug Tracking System
     [not found] <4A807E29.1030405@gmail.com>
2009-08-10 20:30 ` bug#4109: Status of IPA patch? Aidan Kehoe
2009-08-30 23:40   ` bug#4109: marked as done (Status of IPA patch?) Emacs bug Tracking System

Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).