From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Mark H Weaver Newsgroups: gmane.lisp.guile.user Subject: Re: Filename encoding Date: Wed, 15 Jan 2014 16:34:26 -0500 Message-ID: <87mwixrkt9.fsf@netris.org> References: <20140115125246.53fc72c7@bother.homenet> <87bnzdun74.fsf@netris.org> <834n553w64.fsf@gnu.org> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain X-Trace: ger.gmane.org 1389821948 16928 80.91.229.3 (15 Jan 2014 21:39:08 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Wed, 15 Jan 2014 21:39:08 +0000 (UTC) Cc: guile-user@gnu.org To: Eli Zaretskii Original-X-From: guile-user-bounces+guile-user=m.gmane.org@gnu.org Wed Jan 15 22:39:12 2014 Return-path: Envelope-to: guile-user@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1W3YB5-0005dq-Hz for guile-user@m.gmane.org; Wed, 15 Jan 2014 22:39:11 +0100 Original-Received: from localhost ([::1]:57373 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1W3YB5-0000p9-4i for guile-user@m.gmane.org; Wed, 15 Jan 2014 16:39:11 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:37558) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1W3Y8y-0006c3-U0 for guile-user@gnu.org; Wed, 15 Jan 2014 16:37:06 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1W3Y8t-0006pN-3R for guile-user@gnu.org; Wed, 15 Jan 2014 16:37:00 -0500 Original-Received: from world.peace.net ([96.39.62.75]:50805) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1W3Y8s-0006p2-WA; Wed, 15 Jan 2014 16:36:55 -0500 Original-Received: from 209-6-91-212.c3-0.smr-ubr1.sbo-smr.ma.cable.rcn.com ([209.6.91.212] helo=yeeloong) by world.peace.net with esmtpsa (TLS1.0:DHE_RSA_AES_128_CBC_SHA1:16) (Exim 4.72) (envelope-from ) id 1W3Y8l-0005UU-4y; Wed, 15 Jan 2014 16:36:48 -0500 In-Reply-To: <834n553w64.fsf@gnu.org> (Eli Zaretskii's message of "Wed, 15 Jan 2014 21:02:59 +0200") User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.3 (gnu/linux) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Received-From: 96.39.62.75 X-BeenThere: guile-user@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: General Guile related discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guile-user-bounces+guile-user=m.gmane.org@gnu.org Original-Sender: guile-user-bounces+guile-user=m.gmane.org@gnu.org Xref: news.gmane.org gmane.lisp.guile.user:11019 Archived-At: Eli Zaretskii writes: >> From: Mark H Weaver >> Date: Wed, 15 Jan 2014 13:14:39 -0500 >> Cc: guile-user@gnu.org >> >> My hope is that this will become less of an issue over time, as systems >> increasingly standardize on UTF-8. I see no other good solution. >> >> Thoughts? > > MS-Windows filesystems will not standardize on UTF-8 in any observable > future. Well, I understand that MS has standardized on UTF-16 (right?) but what matters from Guile's perspective is the encoding used by the POSIX-style interfaces that Guile uses, such as 'open'. Do you know what encoding that is on Windows? > Likewise, in some Far Eastern cultures, non-UTF encoding are still > widely used. *nod* > An "other good solution" is to decode file names into Unicode based > representation (which can be UTF-8) for internal handling, then encode > them back into the locale-specific encoding when passing them to > system calls and library functions that receive file names. This is > what Emacs does. That's what Guile does too. Internally, all strings are Unicode. At present we use either Latin-1 or UTF-32, but I intend to change the internal representation to UTF-8 at some point. Thanks, Mark