unofficial mirror of help-gnu-emacs@gnu.org
 help / color / mirror / Atom feed
* Entering UTF-8 Characters
@ 2005-12-20  0:11 Cameron Desautels
  2005-12-20 10:11 ` Peter Dyballa
  0 siblings, 1 reply; 12+ messages in thread
From: Cameron Desautels @ 2005-12-20  0:11 UTC (permalink / raw)


Can anyone tell me if there is a way to enter UTF-8 characters
numerically?

My ultimate goal is to type "smart quotes" into UTF-8 files.  I've
looked through some of the coding systems, but I can't find any way at
all to enter these characters.  I do realize that there are excellent
packages in existence which make this an easy task, but I'd rather not
*depend* on a package.  Any tips?
-- 
Cameron Desautels <cam@apt2324.com>


"The day Microsoft makes something that doesn't suck is probably the
day they start making vacuum cleaners."
 -- Ernst Jan Plugge

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: Entering UTF-8 Characters
       [not found] <mailman.19876.1135037590.20277.help-gnu-emacs@gnu.org>
@ 2005-12-20  2:14 ` David Hansen
  2005-12-20  7:41   ` Cameron Desautels
       [not found]   ` <mailman.19899.1135064583.20277.help-gnu-emacs@gnu.org>
  2005-12-20  4:06 ` B. T. Raven
  1 sibling, 2 replies; 12+ messages in thread
From: David Hansen @ 2005-12-20  2:14 UTC (permalink / raw)


On Mon, 19 Dec 2005 18:11:59 -0600 Cameron Desautels wrote:

> My ultimate goal is to type "smart quotes" into UTF-8 files.  I've
> looked through some of the coding systems, but I can't find any way at
> all to enter these characters.  I do realize that there are excellent
> packages in existence which make this an easy task, but I'd rather not
> *depend* on a package.  Any tips?

Part of GNU emacs:

M-x set-input-method RET TeX RET

\qlqq: „
\grqq: “
\"`:   „
\"':   “
\"<:   «
\">:   »
\lq:   ‘
\rq:   ’

and lots of more fancy stuff like ∫dϕ = 2π …

M-x describe-input-method RET

David

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: Entering UTF-8 Characters
       [not found] <mailman.19876.1135037590.20277.help-gnu-emacs@gnu.org>
  2005-12-20  2:14 ` Entering UTF-8 Characters David Hansen
@ 2005-12-20  4:06 ` B. T. Raven
  2005-12-20 18:16   ` Harald Hanche-Olsen
  1 sibling, 1 reply; 12+ messages in thread
From: B. T. Raven @ 2005-12-20  4:06 UTC (permalink / raw)



"Cameron Desautels" <cam@apt2324.com> wrote in message
news:mailman.19876.1135037590.20277.help-gnu-emacs@gnu.org...
> Can anyone tell me if there is a way to enter UTF-8 characters
> numerically?
>
> My ultimate goal is to type "smart quotes" into UTF-8 files.  I've
> looked through some of the coding systems, but I can't find any way at
> all to enter these characters.  I do realize that there are excellent
> packages in existence which make this an easy task, but I'd rather not
> *depend* on a package.  Any tips?
> --
> Cameron Desautels <cam@apt2324.com>


C-q (octal number) RET

Unfortunately, this number isn't the Unicode number for the code point (I
think). They are at U+201C and D

putting them into a file and then pressing C-x = yields:

 (01234574, 342396, 0x5397c, file ...)

and

 (01234575, 342397, 0x5397d, file ...

So I guess you could type C-q and then those numbers starting with zero.
Very cumbersome. After doing one each you could manually search and
replace initial and final straight quotes with M-% (also manually).

Ed.

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: Entering UTF-8 Characters
  2005-12-20  2:14 ` Entering UTF-8 Characters David Hansen
@ 2005-12-20  7:41   ` Cameron Desautels
       [not found]   ` <mailman.19899.1135064583.20277.help-gnu-emacs@gnu.org>
  1 sibling, 0 replies; 12+ messages in thread
From: Cameron Desautels @ 2005-12-20  7:41 UTC (permalink / raw)


On Tue, Dec 20, 2005 at 03:14:51AM +0100, David Hansen wrote:
> Part of GNU emacs:
> 
> M-x set-input-method RET TeX RET
> 
> \qlqq: ???
> \grqq: ???
> \"`:   ???
> \"':   ???
> \"<:   «
> \">:   »
> \lq:   ???
> \rq:   ???

Only \lq and \rq work for me--the others do not seem to be valid \TeX
entities for me (i.e. TeX input method doesn't understand them, not
simply that they don't display).

I am running GNU emacs 21.4.1.  It is a mule-utf-8 buffer (little 'u'
in the corner), and I know that it can display the characters properly
(thanks to B. T. Raven's C-q 0 1 2 3 4 5 7 5 RET), but I cannot seem
to input them in the fashion described.  Am I missing a step?
-- 
Cameron Desautels <cam@apt2324.com>


"A witty saying proves nothing."
      --Voltaire (1694-1778)

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: Entering UTF-8 Characters
  2005-12-20  0:11 Cameron Desautels
@ 2005-12-20 10:11 ` Peter Dyballa
  2005-12-20 13:44   ` Peter Dyballa
  2005-12-20 15:13   ` kokrhac - Ales MATAS
  0 siblings, 2 replies; 12+ messages in thread
From: Peter Dyballa @ 2005-12-20 10:11 UTC (permalink / raw)
  Cc: help-gnu-emacs


Am 20.12.2005 um 01:11 schrieb Cameron Desautels:

> My ultimate goal is to type "smart quotes" into UTF-8 files.

Three choices:

	• octal a in \20030 ... \20037 = ‘ ’ ‚ ‛ “ ” „ ‟ (end with non-digit)
	• ucs input method C-x RET C-\ RET u201c
	• bind a key to insert these by copying from my eMail or utf8.txt in 
kermit distribution

I think the first two methods won't work in GNU Emacs 21.4. This Emacs 
too gives on C-u C-x = not the correct answers. Whether the last method 
works I can't tell, I prefer GNU Emacs 23 because is has Unicode 
support that you can actually use (at almost 100% of the time, printing 
only works via htmlizing the buffer and then printing out of an 
Internet browser). Here is GNU Emacs 23's description of a "smart" 
quote:

         character: „ (020036, 8222, 0x201e)
preferred charset: [iso-8859-13] (ISO/IEC 8859/13)
        code point: [0xA5]
            syntax: . 	which means: punctuation
       buffer code: 0xE2 0x80 0x9E
         file code: 0xE2 0x80 0x9E (encoded by coding system utf-8)
           display: by this font (glyph code)
      
-B&H-LucidaTypewriter-Medium-R-Normal-Sans-10-100-75-75-M-60-ISO10646-1 
(0x201E)

Character code properties are not shown: [customize what to show]

There are text properties here:
   fontified            t


--
Greetings

   Pete

"A smart quote proves nothing."
      --Pete (19??-20??)

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: Entering UTF-8 Characters
  2005-12-20 10:11 ` Peter Dyballa
@ 2005-12-20 13:44   ` Peter Dyballa
  2005-12-20 15:13   ` kokrhac - Ales MATAS
  1 sibling, 0 replies; 12+ messages in thread
From: Peter Dyballa @ 2005-12-20 13:44 UTC (permalink / raw)
  Cc: Emacs help


Am 20.12.2005 um 11:11 schrieb Peter Dyballa:

> 	• octal a in \20030 ... \20037 = ‘ ’ ‚ ‛ “ ” „ ‟ (end with non-digit)
>

Sorry! This line slipped through the quality control!

I meant to write:

	• octal as in C-q 2 0 0 3 0 ... C-q 2 0 0 3 7 = ‘ ’ ‚ ‛ “ ” „ ‟ (end 
with non-digit)

It doesn't seem to make a difference whether you start with 0 or 
without.

--
Greetings

   Pete

"What is this talk of 'release'?  Klingons do not make software
'releases'.  Our software 'escapes' leaving a bloody trail of
designers and quality assurance people in its wake."

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: Entering UTF-8 Characters
       [not found]   ` <mailman.19899.1135064583.20277.help-gnu-emacs@gnu.org>
@ 2005-12-20 14:02     ` David Hansen
  2005-12-20 17:44       ` Cameron Desautels
  0 siblings, 1 reply; 12+ messages in thread
From: David Hansen @ 2005-12-20 14:02 UTC (permalink / raw)


On Tue, 20 Dec 2005 01:41:05 -0600 Cameron Desautels wrote:

> On Tue, Dec 20, 2005 at 03:14:51AM +0100, David Hansen wrote:
>> Part of GNU emacs:
>> 
>> M-x set-input-method RET TeX RET
>> 
>> \qlqq: ???
>> \grqq: ???
>> \"`:   ???
>> \"':   ???
>> \"<:   «
>> \">:   »
>> \lq:   ???
>> \rq:   ???
>
> Only \lq and \rq work for me--the others do not seem to be valid \TeX
> entities for me (i.e. TeX input method doesn't understand them, not
> simply that they don't display).
>
> I am running GNU emacs 21.4.1.  It is a mule-utf-8 buffer (little 'u'
> in the corner), and I know that it can display the characters properly
> (thanks to B. T. Raven's C-q 0 1 2 3 4 5 7 5 RET), but I cannot seem
> to input them in the fashion described.  Am I missing a step?

Probably my fault:  I'm running emacs from CVS.  You can try

M-x describe-input-method TeX RET

and scroll through all the chars.

Another alternative I used for quite some time may be

ftp://ftp.dante.de/tex-archive/support/emacs-modes/u8tex.el

It's very similar to the TeX input method but uses a '/' instead
of '\'.  Or maybe (never tried this)

http://groups.google.com/group/gnu.emacs.sources/msg/6f1779b22c7110ea

David

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: Entering UTF-8 Characters
  2005-12-20 10:11 ` Peter Dyballa
  2005-12-20 13:44   ` Peter Dyballa
@ 2005-12-20 15:13   ` kokrhac - Ales MATAS
  1 sibling, 0 replies; 12+ messages in thread
From: kokrhac - Ales MATAS @ 2005-12-20 15:13 UTC (permalink / raw)


I use two ways imultaneous


- ucs-insert function (I didn't know mention shortcut, thanks ;] )

- xmlunicode package and its unicode-character-insert function (called 
with C-u prefix - then is unconditionaly inserted the unicode char and 
not xml &#....; like notation)

You could find it at
   http://nwalsh.com/emacs/xmlchars/
It is not very well documented (I think), but if you watch directly the 
code or at least the documentation of the functions you can find it very 
useful

- another alternative is nxml. I contain command nxml-insert-named-char 
(C-c C-u). It has same functionality as unicode-character-insert. So if 
you are editing xml(-like) file you needn't to use xmlunicode package.

	Ales Matas

Peter Dyballa wrote:

> 
> Am 20.12.2005 um 01:11 schrieb Cameron Desautels:
> 
>> My ultimate goal is to type "smart quotes" into UTF-8 files.
> 
> 
> Three choices:
> 
>     • octal a in \20030 ... \20037 = ‘ ’ ‚ ‛ “ ” „ ‟ (end with non-digit)
>     • ucs input method C-x RET C-\ RET u201c
>     • bind a key to insert these by copying from my eMail or utf8.txt in 
> kermit distribution
> 
> I think the first two methods won't work in GNU Emacs 21.4. This Emacs 
> too gives on C-u C-x = not the correct answers. Whether the last method 
> works I can't tell, I prefer GNU Emacs 23 because is has Unicode support 
> that you can actually use (at almost 100% of the time, printing only 
> works via htmlizing the buffer and then printing out of an Internet 
> browser). Here is GNU Emacs 23's description of a "smart" quote:
> 
>         character: „ (020036, 8222, 0x201e)
> preferred charset: [iso-8859-13] (ISO/IEC 8859/13)
>        code point: [0xA5]
>            syntax: .     which means: punctuation
>       buffer code: 0xE2 0x80 0x9E
>         file code: 0xE2 0x80 0x9E (encoded by coding system utf-8)
>           display: by this font (glyph code)
>      
> -B&H-LucidaTypewriter-Medium-R-Normal-Sans-10-100-75-75-M-60-ISO10646-1 
> (0x201E)
> 
> Character code properties are not shown: [customize what to show]
> 
> There are text properties here:
>   fontified            t
> 
> 
> -- 
> Greetings
> 
>   Pete
> 
> "A smart quote proves nothing."
>      --Pete (19??-20??)
> 
> 
> 
> _______________________________________________
> help-gnu-emacs mailing list
> help-gnu-emacs@gnu.org
> http://lists.gnu.org/mailman/listinfo/help-gnu-emacs
> 
> 
> 

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: Entering UTF-8 Characters
  2005-12-20 14:02     ` David Hansen
@ 2005-12-20 17:44       ` Cameron Desautels
  0 siblings, 0 replies; 12+ messages in thread
From: Cameron Desautels @ 2005-12-20 17:44 UTC (permalink / raw)


On Tue, Dec 20, 2005 at 03:02:10PM +0100, David Hansen wrote:
> Probably my fault:  I'm running emacs from CVS.  You can try
> 
> M-x describe-input-method TeX RET
> 
> and scroll through all the chars.

Ha, perhaps we've discovered another difference between CVS emacs and
mine.  21.4.1 doesn't list all of the characters in the description.
Just thought I'd point that out for the other readers.

In any case, thanks for the help.
-- 
Cameron Desautels <cam@apt2324.com>


Given infinite time, 100 monkeys could type out the complete works of
Shakespeare.  Win 98 source code? Eight monkeys, five minutes.

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: Entering UTF-8 Characters
  2005-12-20  4:06 ` B. T. Raven
@ 2005-12-20 18:16   ` Harald Hanche-Olsen
  2005-12-20 23:45     ` B. T. Raven
  0 siblings, 1 reply; 12+ messages in thread
From: Harald Hanche-Olsen @ 2005-12-20 18:16 UTC (permalink / raw)


+ "B. T. Raven" <ecinmn@peoplepc.com>:

| C-q (octal number) RET
|
| Unfortunately, this number isn't the Unicode number for the code
| point (I think).

No, unless you are using the emacs-unicode-2 branch from CVS ...

| They are at U+201C and D

... in which case you can (setq read-quoted-char-radix 16) - or its
interactive equivalent M-x set-variable ... - and type C-q 201c RET.

-- 
* Harald Hanche-Olsen     <URL:http://www.math.ntnu.no/~hanche/>
- Debating gives most of us much more psychological satisfaction
  than thinking does: but it deprives us of whatever chance there is
  of getting closer to the truth.  -- C.P. Snow

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: Entering UTF-8 Characters
  2005-12-20 18:16   ` Harald Hanche-Olsen
@ 2005-12-20 23:45     ` B. T. Raven
  2006-01-11 12:26       ` kokrhac - Ales MATAS
  0 siblings, 1 reply; 12+ messages in thread
From: B. T. Raven @ 2005-12-20 23:45 UTC (permalink / raw)



"Harald Hanche-Olsen" <hanche@math.ntnu.no> wrote in message
news:pcovexjfxcn.fsf@shuttle.math.ntnu.no...
> + "B. T. Raven" <ecinmn@peoplepc.com>:
>
> | C-q (octal number) RET
> |
> | Unfortunately, this number isn't the Unicode number for the code
> | point (I think).
>
> No, unless you are using the emacs-unicode-2 branch from CVS ...
>
> | They are at U+201C and D
>
> ... in which case you can (setq read-quoted-char-radix 16) - or its
> interactive equivalent M-x set-variable ... - and type C-q 201c RET.

No, 21.3.1 windows98. Thanks for the heads-up on that variable. I didn't
know it existed. It didn't work of course. I'll wait for someone to build
w32 binaries in version 23, or 24, or 25...

Ed

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: Entering UTF-8 Characters
  2005-12-20 23:45     ` B. T. Raven
@ 2006-01-11 12:26       ` kokrhac - Ales MATAS
  0 siblings, 0 replies; 12+ messages in thread
From: kokrhac - Ales MATAS @ 2006-01-11 12:26 UTC (permalink / raw)


B. T. Raven wrote:

> "Harald Hanche-Olsen" <hanche@math.ntnu.no> wrote in message
> news:pcovexjfxcn.fsf@shuttle.math.ntnu.no...
> 
>>+ "B. T. Raven" <ecinmn@peoplepc.com>:
>>
>>| C-q (octal number) RET
>>|
>>| Unfortunately, this number isn't the Unicode number for the code
>>| point (I think).
>>
>>No, unless you are using the emacs-unicode-2 branch from CVS ...
>>
>>| They are at U+201C and D
>>
>>... in which case you can (setq read-quoted-char-radix 16) - or its
>>interactive equivalent M-x set-variable ... - and type C-q 201c RET.
> 
> 
> No, 21.3.1 windows98. Thanks for the heads-up on that variable. I didn't
> know it existed. It didn't work of course. I'll wait for someone to build
> w32 binaries in version 23, or 24, or 25...

On
   http://nqmacs.sourceforge.net/
you can find win binaries of emacs v. 22.0.50.1

But C-q also doesnot work with Unicode codes in that build.
But through
   C-u C-x =
you can find emacs codes easilly
(U+201C is 0x5397c, U+201D is 0x5397d)

But I don't know if it is suitable for you ...

	Ales Matas

> 
> Ed
> 
> _______________________________________________
> help-gnu-emacs mailing list
> help-gnu-emacs@gnu.org
> http://lists.gnu.org/mailman/listinfo/help-gnu-emacs
> 
> 
> 
> 

^ permalink raw reply	[flat|nested] 12+ messages in thread

end of thread, other threads:[~2006-01-11 12:26 UTC | newest]

Thread overview: 12+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <mailman.19876.1135037590.20277.help-gnu-emacs@gnu.org>
2005-12-20  2:14 ` Entering UTF-8 Characters David Hansen
2005-12-20  7:41   ` Cameron Desautels
     [not found]   ` <mailman.19899.1135064583.20277.help-gnu-emacs@gnu.org>
2005-12-20 14:02     ` David Hansen
2005-12-20 17:44       ` Cameron Desautels
2005-12-20  4:06 ` B. T. Raven
2005-12-20 18:16   ` Harald Hanche-Olsen
2005-12-20 23:45     ` B. T. Raven
2006-01-11 12:26       ` kokrhac - Ales MATAS
2005-12-20  0:11 Cameron Desautels
2005-12-20 10:11 ` Peter Dyballa
2005-12-20 13:44   ` Peter Dyballa
2005-12-20 15:13   ` kokrhac - Ales MATAS

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).