From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: ludo@gnu.org (Ludovic =?iso-8859-1?Q?Court=E8s?=) Newsgroups: gmane.lisp.guile.devel Subject: Re: Unicode I/O Date: Mon, 03 Jan 2011 23:58:30 +0100 Message-ID: <87pqsdr3vt.fsf@gnu.org> References: <87d3san2l6.fsf@inria.fr> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Trace: dough.gmane.org 1294095551 20429 80.91.229.12 (3 Jan 2011 22:59:11 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Mon, 3 Jan 2011 22:59:11 +0000 (UTC) To: guile-devel@gnu.org Original-X-From: guile-devel-bounces+guile-devel=m.gmane.org@gnu.org Mon Jan 03 23:59:03 2011 Return-path: Envelope-to: guile-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1PZtMg-0001Dq-10 for guile-devel@m.gmane.org; Mon, 03 Jan 2011 23:58:58 +0100 Original-Received: from localhost ([127.0.0.1]:58224 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1PZtMf-0001oJ-Hg for guile-devel@m.gmane.org; Mon, 03 Jan 2011 17:58:57 -0500 Original-Received: from [140.186.70.92] (port=49122 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1PZtMV-0001nC-GO for guile-devel@gnu.org; Mon, 03 Jan 2011 17:58:48 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1PZtMU-0008Cx-HC for guile-devel@gnu.org; Mon, 03 Jan 2011 17:58:47 -0500 Original-Received: from lo.gmane.org ([80.91.229.12]:42855) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1PZtMU-0008Ch-Bj for guile-devel@gnu.org; Mon, 03 Jan 2011 17:58:46 -0500 Original-Received: from list by lo.gmane.org with local (Exim 4.69) (envelope-from ) id 1PZtMR-000143-3j for guile-devel@gnu.org; Mon, 03 Jan 2011 23:58:43 +0100 Original-Received: from yoda.fdn.fr ([80.67.169.18]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Mon, 03 Jan 2011 23:58:43 +0100 Original-Received: from ludo by yoda.fdn.fr with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Mon, 03 Jan 2011 23:58:43 +0100 X-Injected-Via-Gmane: http://gmane.org/ Original-Lines: 29 Original-X-Complaints-To: usenet@dough.gmane.org X-Gmane-NNTP-Posting-Host: yoda.fdn.fr X-URL: http://www.fdn.fr/~lcourtes/ X-Revolutionary-Date: 14 =?iso-8859-1?Q?Niv=F4se?= an 219 de la =?iso-8859-1?Q?R=E9volution?= X-PGP-Key-ID: 0xEA52ECF4 X-PGP-Key: http://www.fdn.fr/~lcourtes/ludovic.asc X-PGP-Fingerprint: 83C4 F8E5 10A3 3B4C 5BEA D15D 77DD 95E2 EA52 ECF4 X-OS: x86_64-unknown-linux-gnu User-Agent: Gnus/5.110011 (No Gnus v0.11) Emacs/23.2 (gnu/linux) Cancel-Lock: sha1:tsJYUN30R6HE7uapSbh0sbHGGVQ= X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 3) X-BeenThere: guile-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Developers list for Guile, the GNU extensibility library" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: guile-devel-bounces+guile-devel=m.gmane.org@gnu.org Errors-To: guile-devel-bounces+guile-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.lisp.guile.devel:11290 Archived-At: Hello Guilers, and Happy New Year! :-) My resolution for the beginning of this year is to address this: > Guile currently uses libunistring’s ‘u32_conv_from_encoding’ when > reading text from an input port whose encoding isn’t Latin-1 (similarly > when writing to output ports.) > > An issue with that is that escaping non-representable characters is > handled by libunistring, with a syntax different from the one we’d like > (Guile or R6RS string escapes.) So > ‘scm_i_unistring_escapes_to_{guile,r6rs}_escapes’ kludgely attempt to > substitute the right escapes. > > The problems with this approach are discussed in the thread at: > > http://lists.gnu.org/archive/html/bug-libunistring/2010-09/msg00004.html > > The conclusion is that we’d better use raw ‘iconv’ calls in such > cases... I’ve just pushed a ‘wip-iconv’ branch, which currently changes ports to use ‘iconv’ for input. Remaining tasks include doing it for output, and finding a solution for ‘scm_{to,from}_stringn’ so that it behaves in the same way wrt. to escapes and error handling. Comments, feedback, suggestions, and patches are all welcome! :-) Ludo’.