all messages for Emacs-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
From: Eli Zaretskii <eliz@gnu.org>
To: help-gnu-emacs@gnu.org
Subject: Re: Regexp capturing unicode characters
Date: Thu, 01 Aug 2024 08:15:40 +0300	[thread overview]
Message-ID: <865xskygar.fsf@gnu.org> (raw)
In-Reply-To: <dsvxyTSPY2IeArhvS10w_f4j9Hiw3A1eCZCdlBBOIvjH37zyHj8dKii8j5fTodda-SST4ecImQ7L_CE37hVNws5Tzf0Sz_-2TCGfdqALx7k=@protonmail.com> (message from Heime on Wed, 31 Jul 2024 21:24:46 +0000)

> Date: Wed, 31 Jul 2024 21:24:46 +0000
> From: Heime <heimeborgia@protonmail.com>
> 
> I am using unicode characters in my elisp code (e.g. foreign language symbols in icelandic
> and spanish).
> 
> Is the regexp [[:word:]] appropriate to capture them ?

No.  [[:word:]] matches characters that have the word syntax, so which
characters match depends on the major mode.  My suggestion is to use
either [[:alnum:]] or [[:alpha:]] instead, depending on whether you
want or don't want to match digit characters.

The meaning of each character class is documented in the "Char
Classes" node of the ELisp Reference manual, I suggest to read it and
choose the most appropriate one for your needs.



  parent reply	other threads:[~2024-08-01  5:15 UTC|newest]

Thread overview: 13+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-07-31 21:24 Regexp capturing unicode characters Heime
2024-07-31 21:50 ` Heime
2024-08-01  5:15 ` Eli Zaretskii [this message]
2024-08-01 11:26   ` Heime
2024-08-01 12:10     ` Eli Zaretskii
2024-08-01 13:43       ` Heime
2024-08-01 14:30         ` Michael Heerdegen via Users list for the GNU Emacs text editor
2024-08-01 15:34         ` Eli Zaretskii
2024-08-01 17:06           ` Heime
2024-08-01 17:46             ` Eli Zaretskii
2024-08-01 19:44               ` Heime
2024-08-02  5:44                 ` Eli Zaretskii
2024-08-02  8:03                   ` uzibalqa

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=865xskygar.fsf@gnu.org \
    --to=eliz@gnu.org \
    --cc=help-gnu-emacs@gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/emacs.git
	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.