From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: ludo@gnu.org (Ludovic =?iso-8859-1?Q?Court=E8s?=) Newsgroups: gmane.lisp.guile.devel Subject: Re: Unicode ports patch Date: Tue, 01 Sep 2009 21:34:59 +0200 Message-ID: <87r5uq5uho.fsf@gnu.org> References: <1251212782.18373.10335.camel@localhost.localdomain> <87hbvnnkl8.fsf@gnu.org> <935264.92888.qm@web37907.mail.mud.yahoo.com> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Trace: ger.gmane.org 1251833747 17231 80.91.229.12 (1 Sep 2009 19:35:47 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Tue, 1 Sep 2009 19:35:47 +0000 (UTC) To: guile-devel@gnu.org Original-X-From: guile-devel-bounces+guile-devel=m.gmane.org@gnu.org Tue Sep 01 21:35:40 2009 Return-path: Envelope-to: guile-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1MiZ8m-00012U-0H for guile-devel@m.gmane.org; Tue, 01 Sep 2009 21:35:40 +0200 Original-Received: from localhost ([127.0.0.1]:50924 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1MiZ8k-0004La-V8 for guile-devel@m.gmane.org; Tue, 01 Sep 2009 15:35:39 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1MiZ8c-0004KW-LT for guile-devel@gnu.org; Tue, 01 Sep 2009 15:35:30 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1MiZ8W-0004Bb-NX for guile-devel@gnu.org; Tue, 01 Sep 2009 15:35:29 -0400 Original-Received: from [199.232.76.173] (port=60122 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1MiZ8W-0004BD-9x for guile-devel@gnu.org; Tue, 01 Sep 2009 15:35:24 -0400 Original-Received: from lo.gmane.org ([80.91.229.12]:52959) by monty-python.gnu.org with esmtps (TLS-1.0:RSA_AES_256_CBC_SHA1:32) (Exim 4.60) (envelope-from ) id 1MiZ8V-0008Jh-Qa for guile-devel@gnu.org; Tue, 01 Sep 2009 15:35:24 -0400 Original-Received: from list by lo.gmane.org with local (Exim 4.50) id 1MiZ8T-0000vF-3I for guile-devel@gnu.org; Tue, 01 Sep 2009 21:35:21 +0200 Original-Received: from reverse-83.fdn.fr ([80.67.176.83]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Tue, 01 Sep 2009 21:35:21 +0200 Original-Received: from ludo by reverse-83.fdn.fr with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Tue, 01 Sep 2009 21:35:21 +0200 X-Injected-Via-Gmane: http://gmane.org/ Original-Lines: 56 Original-X-Complaints-To: usenet@ger.gmane.org X-Gmane-NNTP-Posting-Host: reverse-83.fdn.fr X-URL: http://www.fdn.fr/~lcourtes/ X-Revolutionary-Date: 15 Fructidor an 217 de la =?iso-8859-1?Q?R=E9volutio?= =?iso-8859-1?Q?n?= X-PGP-Key-ID: 0xEA52ECF4 X-PGP-Key: http://www.fdn.fr/~lcourtes/ludovic.asc X-PGP-Fingerprint: 821D 815D 902A 7EAB 5CEE D120 7FBA 3D4F EB1F 5364 X-OS: x86_64-unknown-linux-gnu User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.1 (gnu/linux) Cancel-Lock: sha1:EL5tl7S0rk5moqxwZScmOHRS/sU= X-detected-operating-system: by monty-python.gnu.org: GNU/Linux 2.6 (newer, 3) X-BeenThere: guile-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Developers list for Guile, the GNU extensibility library" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: guile-devel-bounces+guile-devel=m.gmane.org@gnu.org Errors-To: guile-devel-bounces+guile-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.lisp.guile.devel:9235 Archived-At: Hi! Mike Gran writes: >> On Tue 01 Sep 2009 10:19, ludo@gnu.org (Ludovic Courtès) writes: >> >> > Mike Gran writes: >> > >> >> The latest commit 'Add full Unicode capability to ports and the default >> >> reader' 889975e51accb80491af76fc5db980aeb3edd342 adds the majority of >> >> the functionality for non-ASCII strings.  >> > >> > This patch adds a few functions related to string ports: >> > >> >  * libguile/strports.c: store string ports in locale encoding >> >    (scm_strport_to_locale_u8vector, scm_call_with_output_locale_u8vector) >> >    (scm_open_input_locale_u8vector, scm_get_output_locale_u8vector): >> >    new functions >> > >> > I think it would be nicer if these used bytevectors instead of u8vectors >> > and were locale-independent (which would match the `string->utf8' & >> > co. API).  Also I would make `scm_strport_to_locale_u8vector ()' >> > private.  And finally, it'd be even better if it were documented in the >> > manual.  :-) > > I don't understand.  "it would be nicer if *these* ..." "These" was for "these functions". > "it would be nicer if these used bytevectors ... and were *locale-independent*" > > It would be nicer if string ports were actually bytevector ports, and that > they were locale-independent?  Or that scm_get_output_bytevector returned a > locale-independent (ergo 8-bit or 32-bit) vector? The latter. >> > Actually I'm not convinced that `call-with-output-locale-*' and >> > `open-input-locale-*' are useful, precisely because we can use a string >> > port to get a string and then `string->utf8' to get at the string bits. > > "We can use a string port to get a string" > > If we write to a string port and pop a result string? Yes, with `with-output-to-string' for instance. > "And then use string->utf8 to get at the string bits" > > And then convert the result string to a UTF-8 encoded bytevector? `string->utf8' returns a bytevector containing the UTF-8-encoded string it is passed. Thanks, Ludo'.