From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.devel Subject: Re: etags test is broken on MS-Windows Date: Tue, 19 May 2015 18:27:44 +0300 Message-ID: <83h9r8egen.fsf@gnu.org> References: <83y4kmdjmj.fsf@gnu.org> <555A8E62.7060700@cs.ucla.edu> Reply-To: Eli Zaretskii NNTP-Posting-Host: plane.gmane.org X-Trace: ger.gmane.org 1432049297 31944 80.91.229.3 (19 May 2015 15:28:17 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Tue, 19 May 2015 15:28:17 +0000 (UTC) Cc: emacs-devel@gnu.org To: Paul Eggert Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Tue May 19 17:28:08 2015 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1YujR9-00007J-V9 for ged-emacs-devel@m.gmane.org; Tue, 19 May 2015 17:28:08 +0200 Original-Received: from localhost ([::1]:46802 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YujR8-00031B-QV for ged-emacs-devel@m.gmane.org; Tue, 19 May 2015 11:28:06 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:53486) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YujR2-00030X-3A for emacs-devel@gnu.org; Tue, 19 May 2015 11:28:04 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1YujQy-0005Nc-2x for emacs-devel@gnu.org; Tue, 19 May 2015 11:28:00 -0400 Original-Received: from mtaout20.012.net.il ([80.179.55.166]:52519) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YujQx-0005NE-SD for emacs-devel@gnu.org; Tue, 19 May 2015 11:27:56 -0400 Original-Received: from conversion-daemon.a-mtaout20.012.net.il by a-mtaout20.012.net.il (HyperSendmail v2007.08) id <0NOL00100RXTW100@a-mtaout20.012.net.il> for emacs-devel@gnu.org; Tue, 19 May 2015 18:27:54 +0300 (IDT) Original-Received: from HOME-C4E4A596F7 ([87.69.4.28]) by a-mtaout20.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0NOL001MOSAGTY20@a-mtaout20.012.net.il>; Tue, 19 May 2015 18:27:52 +0300 (IDT) In-reply-to: <555A8E62.7060700@cs.ucla.edu> X-012-Sender: halo1@inter.net.il X-detected-operating-system: by eggs.gnu.org: Solaris 10 X-Received-From: 80.179.55.166 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:186623 Archived-At: [Sorry, I didn't mean to discuss this in private, I just forgot to CC the list. Adding it now, and repeating my original message.] I wrote: > > Commit e0117b1 changed the new etags test suite in a way that makes it > > always be skipped on MS-Windows (and in general on any platform that > > doesn't have the 'locale' command or doesn't have a UTF-8 locale > > installed). > > > > I don't understand why a test suite needs to use UTF-8, but I don't > > really mind as long as the tests can run on all supported platforms. > > Can we fix the test to not require these features, please? And Paul answered: > Date: Mon, 18 May 2015 18:14:10 -0700 > From: Paul Eggert > > > I don't > > really mind as long as the tests can run on all supported platforms. > > Without that patch, the tests failed on my GNU/Linux host due to encoding > problems. See attached file I don't think it's due to encoding problem. (AFAIK, etags doesn't regard its input as characters, but as a stream of bytes.) I think it's due to DOS CR-LF EOL format of some files in the test suite. For example, the first file whose tags were different in your testing is dostorture.c, which has DOS EOLs, the second file, c.C, has a lone ^M character at the end of one of its lines, and so on. Could you please verify that this is indeed the source of the problem? (There's also an unrelated problem with the gzip-compressed file in f-src, which seems to be some Windows-specific glitch; I will look into it separately.) > > Can we fix the test to not require these features, please? > > I don't know what will work on MS-Windows, but I checked in a stab > at it. Thanks, it works now, but I have the same problems due to EOL format, and in the same files, just in reverse. If we agree that the problem is due to EOL format, we could try thinking about a solution. The root cause for the problem is that on Windows, etags accounts for the stripped CR characters, while on Unix it treats them as part of the contents, so the byte counts are offset by the number of the preceding lines. > If this fails, I suggest removing all the non-ASCII characters from > these test files and then regenerating the "good" data to match. I don't see this as necessary, not yet. Thanks.