From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Stefan Monnier Newsgroups: gmane.emacs.bugs Subject: bug#12598: 24.2; utf-8 codepoints in doc-strings and compression of .el and .elc files Date: Thu, 31 Jan 2013 13:15:20 -0500 Message-ID: References: <877gr2b2in.fsf@Rainer.invalid> <87391qavzw.fsf@Rainer.invalid> <87bogdy1rv.fsf@Rainer.invalid> <877gr1y16v.fsf@Rainer.invalid> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain X-Trace: ger.gmane.org 1359656188 19281 80.91.229.3 (31 Jan 2013 18:16:28 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Thu, 31 Jan 2013 18:16:28 +0000 (UTC) Cc: 12598@debbugs.gnu.org To: Achim Gratz Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Thu Jan 31 19:16:47 2013 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1U0ygn-0000tP-9p for geb-bug-gnu-emacs@m.gmane.org; Thu, 31 Jan 2013 19:16:45 +0100 Original-Received: from localhost ([::1]:40277 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1U0ygV-0003Lp-5n for geb-bug-gnu-emacs@m.gmane.org; Thu, 31 Jan 2013 13:16:27 -0500 Original-Received: from eggs.gnu.org ([208.118.235.92]:49107) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1U0ygS-0003Lc-Ae for bug-gnu-emacs@gnu.org; Thu, 31 Jan 2013 13:16:25 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1U0ygM-0005Qb-RM for bug-gnu-emacs@gnu.org; Thu, 31 Jan 2013 13:16:24 -0500 Original-Received: from debbugs.gnu.org ([140.186.70.43]:52459) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1U0ygM-0005QT-N7 for bug-gnu-emacs@gnu.org; Thu, 31 Jan 2013 13:16:18 -0500 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.72) (envelope-from ) id 1U0yh3-00058j-LO for bug-gnu-emacs@gnu.org; Thu, 31 Jan 2013 13:17:01 -0500 X-Loop: help-debbugs@gnu.org Resent-From: Stefan Monnier Original-Sender: debbugs-submit-bounces@debbugs.gnu.org Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Thu, 31 Jan 2013 18:17:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 12598 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: Original-Received: via spool by 12598-submit@debbugs.gnu.org id=B12598.135965616819686 (code B ref 12598); Thu, 31 Jan 2013 18:17:01 +0000 Original-Received: (at 12598) by debbugs.gnu.org; 31 Jan 2013 18:16:08 +0000 Original-Received: from localhost ([127.0.0.1]:57923 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1U0ygC-00057S-0X for submit@debbugs.gnu.org; Thu, 31 Jan 2013 13:16:08 -0500 Original-Received: from chene.dit.umontreal.ca ([132.204.246.20]:54789) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1U0yg9-00057K-QQ for 12598@debbugs.gnu.org; Thu, 31 Jan 2013 13:16:07 -0500 Original-Received: from faina.iro.umontreal.ca (lechon.iro.umontreal.ca [132.204.27.242]) by chene.dit.umontreal.ca (8.14.1/8.14.1) with ESMTP id r0VIFLSp023782; Thu, 31 Jan 2013 13:15:21 -0500 Original-Received: by faina.iro.umontreal.ca (Postfix, from userid 20848) id 1679AB4182; Thu, 31 Jan 2013 13:15:21 -0500 (EST) In-Reply-To: <877gr1y16v.fsf@Rainer.invalid> (Achim Gratz's message of "Mon, 08 Oct 2012 07:36:56 +0200") User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.3.50 (gnu/linux) X-NAI-Spam-Flag: NO X-NAI-Spam-Threshold: 5 X-NAI-Spam-Score: 0 X-NAI-Spam-Rules: 1 Rules triggered RV4478=0 X-NAI-Spam-Version: 2.2.0.9309 : core <4478> : streams <898187> : uri <1333068> X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Received-From: 140.186.70.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:70535 Archived-At: > I've just removed some utf-8 codepoints from docstrings in org-mode > because when I compress either the source (.el.gz) or the resulting > byte-compiled file (.elc.gz), the loader fails after the first function I can't reproduce this problem for the .el.gz case (indeed, I think it's specific to byte-compiled files). > So, any codepoint that is more than a single byte will throw the > byte-compiler off, not just any utf-8 codepoint. Since this has been in > Emacs likely ever since unicode strings have been introduced, I'd > suggest adding a *strong* warning in some prominent place in the > documentation about this even when it gets fixed in a newer version of > Emacs. Otherwise it's all too easy to produce libraries that have > mysterious failures depending on whatever Emacs was used to compile or > run them. I think the problem lies between load-with-code-conversion and eval-buffer, so it dates back to the introduction of load-with-code-conversion, which IIRC predates the internal use of Unicode. Fixing `eval-buffer' so that it skips bytes when it sees #@NN is tricky, so the best fix is probably to change load-with-code-conversion so that (if the file is byte-compiled) it saves the buffer to a temp file and passes that to `load'. Stefan