all messages for Emacs-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
From: Marcin Borkowski <mbork@wmi.amu.edu.pl>
To: "help-gnu-emacs\@gnu.org" <help-gnu-emacs@gnu.org>
Subject: Re: How to check whether a character (or one-character string) is a letter?
Date: Sun, 05 Oct 2014 02:08:38 +0200	[thread overview]
Message-ID: <874mvjml89.fsf@wmi.amu.edu.pl> (raw)
In-Reply-To: <CAP_d_8V7itX8shGC3enP4-B91_vjRdtR36MyhGB-AW96uYy4aw@mail.gmail.com>


On 2014-10-04, at 06:08, Yuri Khan wrote:

> On Sat, Oct 4, 2014 at 7:29 AM, Marcin Borkowski <mbork@wmi.amu.edu.pl> wrote:
>
>> The reason I'm asking is that I'm writing a function which converts an
>> arbitrary string to a valid (and nice) filename (e.g., only letters and
>> hyphens) - so basically I want to walk a string character by character
>> and convert any space to a hyphen and omit any other non-letter.  Am I
>> reinventing the wheel?
>
> What are your assumptions about input string arbitrariness, your
> requirements about output filename niceness, and your requirements
> about the properties of the mapping?
>
> Because these may be in conflict.
>
> For example, if you assume any arbitrary strings, want only
> [-0-9A-Za-z_] characters, and want reasonably different strings to map
> into different filenames, then you will end up having to preserve
> non-nice characters as ugly character encodings (in the spirit of
> urlencode, XML character references, or Punycode). Otherwise, whole
> words or sentences in Russian, Japanese or Greek will map into an
> empty filename.

Good point.  However, I intend to keep a list of filenames, and in case
some of them is already taken, append a number to it.  (This is an
extremely primitive hashing function, but it will suffice for my needs.)

Regards,

-- 
Marcin Borkowski
http://octd.wmi.amu.edu.pl/en/Marcin_Borkowski
Adam Mickiewicz University



  reply	other threads:[~2014-10-05  0:08 UTC|newest]

Thread overview: 8+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-10-04  0:29 How to check whether a character (or one-character string) is a letter? Marcin Borkowski
2014-10-04  1:38 ` Thorsten Jolitz
     [not found] ` <CAOj2CQQsnNxtUPzPV8Vw_DgfGFXUUkZExHbArAu_zDjQn-prvw@mail.gmail.com>
2014-10-04  2:47   ` Fwd: " John Mastro
2014-10-05  0:11     ` Marcin Borkowski
2014-10-04  2:58 ` Eric Abrahamsen
2014-10-04  4:08 ` Yuri Khan
2014-10-05  0:08   ` Marcin Borkowski [this message]
2014-10-04  7:29 ` Eli Zaretskii

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=874mvjml89.fsf@wmi.amu.edu.pl \
    --to=mbork@wmi.amu.edu.pl \
    --cc=help-gnu-emacs@gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/emacs.git
	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.