From: Xah <xahlee@gmail.com>
To: help-gnu-emacs@gnu.org
Subject: Re: Making re-search-forward search for \377
Date: Sun, 2 Nov 2008 12:32:53 -0800 (PST) [thread overview]
Message-ID: <dc53e4fd-316c-44a9-9f7f-d7455191b623@e1g2000pra.googlegroups.com> (raw)
In-Reply-To: 87prlepk45.fsf@pcdesk.net
Xah Lee wrote:
> Xah<xah...@gmail.com> writes:
> > what's the C-q 377 char?
>
> > if i press Ctrl+q 377 Enter, i get this char: ÿ, which is LATIN SMALL
> > LETTER Y WITH DIAERESIS (unicode U+00FF).
>
> > Then if i do:
>
> > (re-search-forward "ÿ")
Tyler Spivey wrote:
> I'm probably going to end up working with binary data in a temp
> buffer. Doing more research, I want enable-multibyte-characters to be
> off. Given that, if we go to *scratch*
> and run M-X toggle-enable-multibyte-characters until that variable
> becomes nil, doing C-Q 377 RET gives 0xff, which is what I want
> (according to C-x =, C-u C-x = and M-x describe-char). Now to
> match it, I try:
>
> (re-search-forward "\xff") - no luck
sorry can't help you much there. ...i don't have much experience
working with binary data.
> What did you use to figure out that the multibyte version of that
> character was 0x00FF? I found it out accidentally as a lisp error, but
> none of the previously described commands (C-X =, M-X describe-char or
> C-u C-x =) will show that it is 0x00ff, they just show FF.
installing a unicode data file is probably what you need.
Q: I have this character α on the screen. How to find out its
unicode's hex value or name?
You can find out a character's decimal, octal, or hex values by
placing your cursor on the character, and type “Alt+x what-cursor-
position” (Ctrl+x =). You can get more info if you place your cursor
on the character, then press “Ctrl+u Ctrl+x =”.
However, if you want the complete unicode info of a character, you
need to download a unicode data file and let emacs know where it is.
The unicode data file can be downloaded at: http://www.unicode.org/Public/UNIDATA/UnicodeData.txt.
After you downloaded it, place the following code in your “~/.emacs”
to let emacs know where it is:
; set unicode data file location. (used by what-cursor-position)
(let ((x "~/Documents/emacs/UnicodeData.txt"))
(when (file-exists-p x)
(setq describe-char-unicodedata-file x)))
Then restart emacs. Once you've done this, then place your cursor on a
unicode char, and do “Ctrl+u Ctrl+x =”, then emacs will give you all
the unicode info about that char, including the code point in decimal,
octal, hex notations, as well the unicode character name, category,
the font emacs is using, and others.
For example, here's the output on the character “α”:
character: α (332721, #o1211661, #x513b1, U+03B1)
charset: mule-unicode-0100-24ff
(Unicode characters of the range U+0100..U+24FF.)
code point: #x27 #x31
syntax: w which means: word
category: g:Greek
buffer code: #x9C #xF4 #xA7 #xB1
file code: #xCE #xB1 (encoded by coding system mule-utf-8-unix)
display: by this font (glyph code)
-apple-symbol-medium-r-normal--14-140-72-72-m-140-mac-symbol
(#x61)
Unicode data:
Name: GREEK SMALL LETTER ALPHA
Category: lowercase letter
Combining class: Spacing
Bidi category: Left-to-Right
Uppercase: Α
Titlecase: Α
There are text properties here:
fontified t
this page might help you if you work with unicode.
http://xahlee.org/emacs/emacs_n_unicode.html
Xah
∑ http://xahlee.org/
☄
next prev parent reply other threads:[~2008-11-02 20:32 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-11-02 7:31 Making re-search-forward search for \377 Tyler Spivey
2008-11-02 8:45 ` Xah
2008-11-02 9:12 ` Tyler Spivey
2008-11-02 18:10 ` Kevin Rodgers
2008-11-02 20:32 ` Xah [this message]
2008-11-02 22:35 ` Tyler Spivey
2008-11-03 4:21 ` Eli Zaretskii
[not found] ` <mailman.2743.1225686066.25473.help-gnu-emacs@gnu.org>
2008-11-03 4:54 ` Tyler Spivey
2008-11-03 19:42 ` Eli Zaretskii
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
List information: https://www.gnu.org/software/emacs/
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=dc53e4fd-316c-44a9-9f7f-d7455191b623@e1g2000pra.googlegroups.com \
--to=xahlee@gmail.com \
--cc=help-gnu-emacs@gnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).