From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED!not-for-mail From: Linas Vepstas Newsgroups: gmane.lisp.guile.bugs Subject: bug#25397: guile-2.2 regression in utf8 support in scm_puts scm_lfwrite scm_c_put_string Date: Sun, 8 Jan 2017 12:16:23 -0600 Message-ID: Reply-To: linasvepstas@gmail.com NNTP-Posting-Host: blaine.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Trace: blaine.gmane.org 1483899447 25318 195.159.176.226 (8 Jan 2017 18:17:27 GMT) X-Complaints-To: usenet@blaine.gmane.org NNTP-Posting-Date: Sun, 8 Jan 2017 18:17:27 +0000 (UTC) To: 25397@debbugs.gnu.org Original-X-From: bug-guile-bounces+guile-bugs=m.gmane.org@gnu.org Sun Jan 08 19:17:15 2017 Return-path: Envelope-to: guile-bugs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by blaine.gmane.org with esmtp (Exim 4.84_2) (envelope-from ) id 1cQI1i-0004BR-R9 for guile-bugs@m.gmane.org; Sun, 08 Jan 2017 19:17:07 +0100 Original-Received: from localhost ([::1]:34404 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1cQI1l-0000DG-Hd for guile-bugs@m.gmane.org; Sun, 08 Jan 2017 13:17:09 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:41470) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1cQI1f-0000D5-B0 for bug-guile@gnu.org; Sun, 08 Jan 2017 13:17:04 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1cQI1e-0000pX-AK for bug-guile@gnu.org; Sun, 08 Jan 2017 13:17:03 -0500 Original-Received: from debbugs.gnu.org ([208.118.235.43]:59896) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1cQI1e-0000pT-6w for bug-guile@gnu.org; Sun, 08 Jan 2017 13:17:02 -0500 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1cQI1e-0003jB-2m for bug-guile@gnu.org; Sun, 08 Jan 2017 13:17:02 -0500 X-Loop: help-debbugs@gnu.org Resent-From: Linas Vepstas Original-Sender: "Debbugs-submit" Resent-CC: bug-guile@gnu.org Resent-Date: Sun, 08 Jan 2017 18:17:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: report 25397 X-GNU-PR-Package: guile X-GNU-PR-Keywords: X-Debbugs-Original-To: bug-guile@gnu.org Original-Received: via spool by submit@debbugs.gnu.org id=B.148389941514309 (code B ref -1); Sun, 08 Jan 2017 18:17:01 +0000 Original-Received: (at submit) by debbugs.gnu.org; 8 Jan 2017 18:16:55 +0000 Original-Received: from localhost ([127.0.0.1]:47062 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1cQI1X-0003ij-5a for submit@debbugs.gnu.org; Sun, 08 Jan 2017 13:16:55 -0500 Original-Received: from eggs.gnu.org ([208.118.235.92]:50474) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1cQI1V-0003iW-Fa for submit@debbugs.gnu.org; Sun, 08 Jan 2017 13:16:53 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1cQI1P-0000os-5u for submit@debbugs.gnu.org; Sun, 08 Jan 2017 13:16:48 -0500 Original-Received: from lists.gnu.org ([2001:4830:134:3::11]:50808) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from ) id 1cQI1P-0000on-2W for submit@debbugs.gnu.org; Sun, 08 Jan 2017 13:16:47 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:41444) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1cQI1N-0000Cl-Ru for bug-guile@gnu.org; Sun, 08 Jan 2017 13:16:46 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1cQI1M-0000oL-Pj for bug-guile@gnu.org; Sun, 08 Jan 2017 13:16:45 -0500 Original-Received: from mail-qt0-x22b.google.com ([2607:f8b0:400d:c0d::22b]:34420) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1cQI1M-0000oF-LU for bug-guile@gnu.org; Sun, 08 Jan 2017 13:16:44 -0500 Original-Received: by mail-qt0-x22b.google.com with SMTP id l7so70341608qtd.1 for ; Sun, 08 Jan 2017 10:16:44 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:reply-to:from:date:message-id:subject:to :content-transfer-encoding; bh=JA5bE9eQ9LlQDOyg8QRZUr1IpybPbykncbab2LVibeY=; b=sM/A2M7cjVSCejNvFhem+KE3s3lvgc1L33pq+weaMFOakoBz10QcSMkzl4GD/ok5Yx Huvtv13zZwkeH4t6TbqN3tj8UTseHjyjwP/s06buayKpQI1cx+aWiVaLEK6MllBfFcTw OwV5QriZK54oXteSQQga0aTL0+Yf4DLokyAudyxz3He0FDpMXWrFQUmeR1SDHYk50hBL rN4vTIHGnvTJ5+Md4exkk7Dlc8DJ4hXBloOhsOujU7hc252L6O/wTz3hU3giBwjPNA0C K+V+aFUcru7mO16ZwExTKRinMBqfv3qBaLvtt5B1NEFPbnS/WRPGkGf37d3K9iDriKr2 ytUA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:reply-to:from:date:message-id :subject:to:content-transfer-encoding; bh=JA5bE9eQ9LlQDOyg8QRZUr1IpybPbykncbab2LVibeY=; b=QjsiVT9EO9p8j6xYYEtxAnde6IIgdEDX5c22X5BcRirm3kxjR3jz4LidzlRR83G5/q NH9qC/CNU3uOVLXj+3f9ISfS/0CY2p6sbwB8wQ2a1EXj5etitgFe4gUczx9EvRwLWA9H Ec1Vu/oLeVlgsrH/5II7EEdB/dRCnhMiJaMG9K4gYXUTP2oGLXrzV6gKBgX8ChDEZJ1N XuaI65LDEOk1MX2ya9t4QhLhIvC2degoQyTD0pa1Ca3ZlFmdcjLB7JPfvrMnAIzm5UJo RLyY+fk5N4rJSgRL2UYy2Vu8wa8Tt5zIea37QFQwVZiufbd6yijOLzudL7NvVkyqLT08 vlzw== X-Gm-Message-State: AIkVDXLgCk+MzHwdhaVA4BX3KYKXpMJaOQ6ai+RKAhYrZ8tRgxB7ZaXGiXy2un6i7wkRJj9Zny00qDsy30MCug== X-Received: by 10.237.52.37 with SMTP id w34mr18201493qtd.173.1483899404089; Sun, 08 Jan 2017 10:16:44 -0800 (PST) Original-Received: by 10.12.128.78 with HTTP; Sun, 8 Jan 2017 10:16:23 -0800 (PST) X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 208.118.235.43 X-BeenThere: bug-guile@gnu.org List-Id: "Bug reports for GUILE, GNU's Ubiquitous Extension Language" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-guile-bounces+guile-bugs=m.gmane.org@gnu.org Original-Sender: "bug-guile" Xref: news.gmane.org gmane.lisp.guile.bugs:8500 Archived-At: There appears to be a regression in guile-2.2 with utf8 handling in the scm_puts() scm_lfwrite() and scm_c_put_string() functions. In guile-2.0, one could give these utf8-encoded strings, and these would display just fine. In 2.2 they get mangled. The source of the mangling seems to be an assumption that these three are being given latin1 strings, which they then attempt to convert to utf8, thus wrecking the encoding. See, e.g. libguile/ports.c line 3526 Presumably this change was intentional, but I don't understand why; guile-2.0 seems utf-8 clean, correctly handling utf-8 in essentially all cases. Why would one want to go back to the bad old days of latin1 and iso-8859-1 for guile 2.2? I could submit a patch for this, but would it be wanted? Test case is straight-forward: printf("duuude port-encoding is=3D%s\n", scm_to_utf8_string(scm_port_encoding(scm_current_output_port ()))); scm_puts ("=E4=BF=82 =E6=8B=89 =E4=B8=81 =E5=AD=97 =E6=AF=8D", scm_current_= output_port ()); which works in guile-2.0 but is garbled in 2.2