From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: ludo@gnu.org (Ludovic =?iso-8859-1?Q?Court=E8s?=) Newsgroups: gmane.lisp.guile.devel Subject: Re: Unicode, ports and encoding Date: Wed, 18 Feb 2009 09:48:34 +0100 Message-ID: <87r61wxh99.fsf@gnu.org> References: <550226.89448.qm@web37908.mail.mud.yahoo.com> <87ocx0hgpv.fsf@gnu.org> <559772.471.qm@web37903.mail.mud.yahoo.com> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: ger.gmane.org 1234946955 3568 80.91.229.12 (18 Feb 2009 08:49:15 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Wed, 18 Feb 2009 08:49:15 +0000 (UTC) To: guile-devel@gnu.org Original-X-From: guile-devel-bounces+guile-devel=m.gmane.org@gnu.org Wed Feb 18 09:50:30 2009 Return-path: Envelope-to: guile-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1LZi8R-0000o3-OD for guile-devel@m.gmane.org; Wed, 18 Feb 2009 09:50:28 +0100 Original-Received: from localhost ([127.0.0.1]:33159 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1LZi77-0008UB-Gy for guile-devel@m.gmane.org; Wed, 18 Feb 2009 03:49:05 -0500 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1LZi6r-0008Oc-Bh for guile-devel@gnu.org; Wed, 18 Feb 2009 03:48:49 -0500 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1LZi6p-0008Nw-M5 for guile-devel@gnu.org; Wed, 18 Feb 2009 03:48:48 -0500 Original-Received: from [199.232.76.173] (port=51496 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1LZi6p-0008Nt-IJ for guile-devel@gnu.org; Wed, 18 Feb 2009 03:48:47 -0500 Original-Received: from main.gmane.org ([80.91.229.2]:48412 helo=ciao.gmane.org) by monty-python.gnu.org with esmtps (TLS-1.0:RSA_AES_256_CBC_SHA1:32) (Exim 4.60) (envelope-from ) id 1LZi6p-0006i1-3n for guile-devel@gnu.org; Wed, 18 Feb 2009 03:48:47 -0500 Original-Received: from list by ciao.gmane.org with local (Exim 4.43) id 1LZi6n-0006aR-S4 for guile-devel@gnu.org; Wed, 18 Feb 2009 08:48:45 +0000 Original-Received: from 193.50.110.227 ([193.50.110.227]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Wed, 18 Feb 2009 08:48:45 +0000 Original-Received: from ludo by 193.50.110.227 with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Wed, 18 Feb 2009 08:48:45 +0000 X-Injected-Via-Gmane: http://gmane.org/ Original-Lines: 20 Original-X-Complaints-To: usenet@ger.gmane.org X-Gmane-NNTP-Posting-Host: 193.50.110.227 X-URL: http://www.fdn.fr/~lcourtes/ X-Revolutionary-Date: 30 =?iso-8859-1?Q?Pluvi=F4se?= an 217 de la =?iso-8859-1?Q?R=E9volution?= X-PGP-Key-ID: 0xEA52ECF4 X-PGP-Key: http://www.fdn.fr/~lcourtes/ludovic.asc X-PGP-Fingerprint: 821D 815D 902A 7EAB 5CEE D120 7FBA 3D4F EB1F 5364 X-OS: i686-pc-linux-gnu User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.0.90 (gnu/linux) Cancel-Lock: sha1:Z+cOZIJ2pmzTG+tg2pIgkXdYCVk= X-detected-operating-system: by monty-python.gnu.org: GNU/Linux 2.6, seldom 2.4 (older, 4) X-BeenThere: guile-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Developers list for Guile, the GNU extensibility library" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: guile-devel-bounces+guile-devel=m.gmane.org@gnu.org Errors-To: guile-devel-bounces+guile-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.lisp.guile.devel:8178 Archived-At: Hi Mike, Mike Gran writes: > I thought I could start there, but, it isn't easy. There is a lot that could > be broken by modifying string processing. So I tried writing some tests > first so I can check my work as I go along. But the tests have to be > non-ASCII, so they need to be converted when they are read in. > It gets a little bit circular using scm_from_locale_string to convert > non-ASCII strings in the test source, and then having the test check > the behavior of scm_from_locale_string. I see. OTOH, it should be possible to write plain C tests that would create strings using `scm_from_{utf8,locale}_string ()' (with sample UTF-8 strings hardwired as raw byte arrays) and from there test `scm_string_ref ()', etc., and all of the functions. What do you think? Thanks, Ludo'.