all messages for Emacs-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
From: Marcin Borkowski <mbork@wmi.amu.edu.pl>
To: "help-gnu-emacs\@gnu.org" <help-gnu-emacs@gnu.org>
Subject: Re: Fwd: How to check whether a character (or one-character string) is a letter?
Date: Sun, 05 Oct 2014 02:11:03 +0200	[thread overview]
Message-ID: <8738b3ml48.fsf@wmi.amu.edu.pl> (raw)
In-Reply-To: <CAOj2CQS_5PFwoGn-c0AELNe_Q4xUdgGryHC9Zgp7iKVKtox7iw@mail.gmail.com>


On 2014-10-04, at 04:47, John Mastro wrote:

> [I first sent this directly to Marcin in error - yeah, I use the email gateway]
>
> Hi Marcin,
>
> Marcin Borkowski <mbork@wmi.amu.edu.pl> wrote:
>> Assume that I have a character (taken from some string, which in turn is
>> copied from the buffer - so it need not be ASCII).  What is the best way
>> to check whether it is a letter within ASCII range?
>>
>> The reason I'm asking is that I'm writing a function which converts an
>> arbitrary string to a valid (and nice) filename (e.g., only letters and
>> hyphens) - so basically I want to walk a string character by character
>> and convert any space to a hyphen and omit any other non-letter.  Am I
>> reinventing the wheel?
>
> There are a bunch of ways to do this, but one reasonable approach is to
> use a regular expression. I think this will do what you want:
>
>     (defun reasonable-filename (str)
>       (let* ((str (replace-regexp-in-string "[ \t\n\r]" "-" str))
>              (str (replace-regexp-in-string "[^a-zA-Z-]" "" str)))
>         str))

I think this is probably better than mapcar'ing through the string...

> This is a variation which will also allow the result to contain numbers:
>
>     (defun reasonable-filename (str)
>       (let* ((str (replace-regexp-in-string "[ \t\n\r]" "-" str))
>              (str (replace-regexp-in-string "[^a-zA-Z0-9-]" "" str)))
>         str))

This I don't want, since in case of equal filenames, I want to
differentiate them by appending a number, and allowing digits might
break this.  But thanks anyway.

> To answer your question about identifying whether a character is an
> ASCII letter, the key is that Emacs's characters are really "just"
> integers. Wikipedia has some charts[1] that show the numbers associated
> with the characters. The letters are conveniently grouped together, so
> we can use something like this:
>
>     (defun ascii-letter-p (char)
>       (and (characterp char)
>            (>= char 65)
>            (<= char 122)))
>
> (Of course, this only works if it's really a character, as opposed to a
> string of length one. If it's a string of length one you could either
> "extract" the character with `aref' or use a regular expression
> instead.)
>
> Hope that helps.

Yes it does!

Best,

-- 
Marcin Borkowski
http://octd.wmi.amu.edu.pl/en/Marcin_Borkowski
Adam Mickiewicz University



  reply	other threads:[~2014-10-05  0:11 UTC|newest]

Thread overview: 8+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-10-04  0:29 How to check whether a character (or one-character string) is a letter? Marcin Borkowski
2014-10-04  1:38 ` Thorsten Jolitz
     [not found] ` <CAOj2CQQsnNxtUPzPV8Vw_DgfGFXUUkZExHbArAu_zDjQn-prvw@mail.gmail.com>
2014-10-04  2:47   ` Fwd: " John Mastro
2014-10-05  0:11     ` Marcin Borkowski [this message]
2014-10-04  2:58 ` Eric Abrahamsen
2014-10-04  4:08 ` Yuri Khan
2014-10-05  0:08   ` Marcin Borkowski
2014-10-04  7:29 ` Eli Zaretskii

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=8738b3ml48.fsf@wmi.amu.edu.pl \
    --to=mbork@wmi.amu.edu.pl \
    --cc=help-gnu-emacs@gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/emacs.git
	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.