From: "Buchs, Kevin" <buchs.kevin@mayo.edu>
To: <help-gnu-emacs@gnu.org>
Subject: Re: those funny non-ASCII characters
Date: Fri, 25 May 2012 08:40:25 -0500 [thread overview]
Message-ID: <F326B9A37B353A449FC7A09B36835F820104816E@MACE.sppdg.ad> (raw)
Thanks, Xah and Eli, for contributing to my further understanding. I
went to a specific website where I got the content I copied and pasted
and I can see from the HTML that it has a charset=UTF-8, so I understand
that is Unicode 8-bit. Using the C-u C-x =, I see that the particular
character I pasted has a code point of 0x2013 (U+2013). I didn't see,
however, what the UTF-8 encoding of that code point was. Should I be
able to read that somewhere on the buffer of information I get with C-u
C-x = ? I was poking around the www.unicode.org website, trying to
understand how this U+2013 code point is encoded into UTF-8, but I
haven't determined that yet.
A fresh buffer in emacs for me on my Win-7 box has an encoding system of
iso-latin-1-dos. The coding system used to open and save files is the
same.
So, help me piece together what happens as I paste the UTF-8 text into a
buffer. First, the paste buffer must define that it is in UTF-8. Emacs
reads this information and inserts it into the byte string that defines
the buffer. Now, how does emacs record that it was a UTF-8 encoded
character? Does it translate it into a different internal encoding
instead of just recording the 8 bits transferred? Is this encoding used
as a superset of all possible encoding systems that emacs supports?
Now, Xah, you suggest I embrace Unicode. What does that mean? Would it
involve marking all my lisp library files and my org-mode files with the
file variable -*- coding: utf-8 -*- ? Or is there another way to go
Unicode automatically?
I assume that if my lisp library files are encoded utf-8, then I can
paste that character from the web page into my call to replace-string in
order to substitute the longer dash of Unicode U+2013 with an ascii
hyphen or double hyphen. But, how does that really work? If the lisp
file is encoded utf-8, then how can I put an ascii character in the
replacement string?
I would appreciate it if someone could help me open this new door in my
brain a bit further.
Kevin Buchs | Senior Engineer | SPPDG | 507-538-5459 |
buchs.kevin@mayo.edu
Mayo Clinic | 200 First Street SW | Rochester, MN 55905 |
http://www.mayo.edu/sppdg
-----Original Message-----
With cursor on that character, type "C-u C-x =", and Emacs will show
everything it knows about that character, including its canonical
name.
next reply other threads:[~2012-05-25 13:40 UTC|newest]
Thread overview: 25+ messages / expand[flat|nested] mbox.gz Atom feed top
2012-05-25 13:40 Buchs, Kevin [this message]
2012-05-25 14:04 ` those funny non-ASCII characters Eli Zaretskii
2012-05-25 14:42 ` Jambunathan K
[not found] <mailman.1961.1338398127.855.help-gnu-emacs@gnu.org>
2012-06-01 4:23 ` Jason Rumney
2012-06-01 5:43 ` rusi
2012-06-01 6:12 ` Eli Zaretskii
2012-06-01 7:03 ` Xah Lee
2012-06-01 16:26 ` rusi
2012-06-01 21:06 ` Xah Lee
2012-06-02 3:17 ` rusi
2012-06-02 11:54 ` Xah Lee
2012-06-02 14:10 ` Xah Lee
-- strict thread matches above, loose matches on Subject: below --
2012-05-30 17:15 Buchs, Kevin
2012-05-31 7:17 ` Thien-Thi Nguyen
2012-05-31 14:57 ` Buchs, Kevin
2012-05-31 16:40 ` Thien-Thi Nguyen
2012-05-31 16:56 ` Buchs, Kevin
2012-05-31 21:46 ` Thien-Thi Nguyen
2012-06-01 13:36 ` Doug Lewan
[not found] ` <mailman.2041.1338500734.855.help-gnu-emacs@gnu.org>
2012-06-01 2:42 ` rusi
2012-05-31 15:59 ` PJ Weisberg
[not found] <mailman.1665.1337953237.855.help-gnu-emacs@gnu.org>
2012-05-25 18:33 ` Xah Lee
[not found] <mailman.1638.1337903381.855.help-gnu-emacs@gnu.org>
2012-05-25 0:56 ` Xah Lee
2012-05-24 23:49 Buchs, Kevin
2012-05-25 6:36 ` Eli Zaretskii
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=F326B9A37B353A449FC7A09B36835F820104816E@MACE.sppdg.ad \
--to=buchs.kevin@mayo.edu \
--cc=help-gnu-emacs@gnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
Code repositories for project(s) associated with this external index
https://git.savannah.gnu.org/cgit/emacs.git
https://git.savannah.gnu.org/cgit/emacs/org-mode.git
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.