From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Linas Vepstas Newsgroups: gmane.lisp.guile.devel,gmane.lisp.guile.bugs Subject: UTF-8 regression in guile 1.9.5 Date: Sun, 6 Dec 2009 12:43:03 -0600 Message-ID: <3ae3aa420912061043y12a33f27ia3d2c298812ee358@mail.gmail.com> Reply-To: linasvepstas@gmail.com NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1260125003 17805 80.91.229.12 (6 Dec 2009 18:43:23 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Sun, 6 Dec 2009 18:43:23 +0000 (UTC) To: Guile Development , bug-guile@gnu.org Original-X-From: guile-devel-bounces+guile-devel=m.gmane.org@gnu.org Sun Dec 06 19:43:16 2009 Return-path: Envelope-to: guile-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1NHM4f-0002qf-Nx for guile-devel@m.gmane.org; Sun, 06 Dec 2009 19:43:14 +0100 Original-Received: from localhost ([127.0.0.1]:50207 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1NHM4f-0005CV-Gs for guile-devel@m.gmane.org; Sun, 06 Dec 2009 13:43:13 -0500 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1NHM4c-0005Bv-0Z for guile-devel@gnu.org; Sun, 06 Dec 2009 13:43:10 -0500 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1NHM4X-0005BJ-Kc for guile-devel@gnu.org; Sun, 06 Dec 2009 13:43:09 -0500 Original-Received: from [199.232.76.173] (port=39490 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1NHM4X-0005BG-E5; Sun, 06 Dec 2009 13:43:05 -0500 Original-Received: from mail-yw0-f196.google.com ([209.85.211.196]:47916) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1NHM4W-00064w-Vu; Sun, 06 Dec 2009 13:43:05 -0500 Original-Received: by ywh34 with SMTP id 34so3610428ywh.1 for ; Sun, 06 Dec 2009 10:43:03 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:reply-to:date:message-id :subject:from:to:content-type:content-transfer-encoding; bh=5kv5smL+iBZgcah9eHpQ9YTHShqhYbqoO6MFufmQtGI=; b=GY9yp/0hR2PDt/adnU0mNMyiivXdxNQg7ZuE2QDzlnnV3lSV1D95e7vxsvbY3vTQWy i9s4KeUUFlYAWc/0SRiOca9Fi6fU+9a851eUcLd9zITV3JvuDFbZFKMWoOGxxT27zrso +nwGjKN4KhzbIooSmZf+dibuzqFgdoaEyyxFE= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:reply-to:date:message-id:subject:from:to:content-type :content-transfer-encoding; b=QaaLvpGfmmzFgK0ZlPLMG6+qspafV9cxj3agyQK1J02kMsN7gIvLE3yQCxpi2L94hm 9yPrXyh/VeufdUmG8/Y1LEtLRK/HsKK92Kfah+wHD/sLN+ZtuHl3rlL08GjVWvAOw6ca LC6zZSIoUbopgcMqUIWjAdPIQB+6a5tT7I0Ug= Original-Received: by 10.101.113.4 with SMTP id q4mr5138542anm.64.1260124983422; Sun, 06 Dec 2009 10:43:03 -0800 (PST) X-detected-operating-system: by monty-python.gnu.org: GNU/Linux 2.6 (newer, 2) X-BeenThere: guile-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Developers list for Guile, the GNU extensibility library" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: guile-devel-bounces+guile-devel=m.gmane.org@gnu.org Errors-To: guile-devel-bounces+guile-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.lisp.guile.devel:9736 gmane.lisp.guile.bugs:4378 Archived-At: Hi, I seem to see either a regression in guile-1.9.5 with regard to UTF-8 strings, or at least some sort of incompatible change. In guile-1.8.6, I am able to do the following: SCM new_node (SCM sname) { char * cname =3D scm_to_locale_string(sname); printf ("The name is %s\n", cname); free (cname); return SCM_EOL; } scm_c_define_gsubr("new-node", 1, 0, 0, ss_name); Then, from the guile prompt, I can evaluate the following: (new-node "=E3=81=A6=E3=81=BF=E3=81=BE=E3=81=97=E3=81=9F=E3=80=82") and get the output "The name is =E3=81=A6=E3=81=BF=E3=81=BE=E3=81=97=E3=81= =9F=E3=80=82" However, in guile-1.9.5, the above gives me: "The name is =C3=A3=C2=81=C2=A6=C3=A3=C2=81=C2=BF=C3=A3=C2=81=C2=BE=C3= =A3=C2=81=C2=97=C3=A3=C2=81=C2=9F=C3=A3=C2=80" Now, it is very possible that I've forgotten to say (use-modules some-new-utf8-module) but I am unclear on what that module is (and why its not specified by default). In both cases, my shell has: LANG=3Den_US.UTF-8 --linas