From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Paul Eggert Newsgroups: gmane.emacs.devel Subject: Re: etags test is broken on MS-Windows Date: Fri, 22 May 2015 14:26:27 -0700 Organization: UCLA Computer Science Department Message-ID: <555F9F03.7050506@cs.ucla.edu> References: <83y4kmdjmj.fsf@gnu.org> <555A8E62.7060700@cs.ucla.edu> <83h9r8egen.fsf@gnu.org> <83pp5t6gex.fsf@gnu.org> <555E09AE.9070208@cs.ucla.edu> <83lhgh6fb2.fsf@gnu.org> <555E2C10.4010501@cs.ucla.edu> <83h9r5670s.fsf@gnu.org> <555E6A15.8010404@cs.ucla.edu> <831ti957wp.fsf@gnu.org> <83pp5s4uml.fsf@gnu.org> <555F740D.4030304@cs.ucla.edu> <837fs04egz.fsf@gnu.org> <87oalco1mg.fsf@igel.home> <83y4kg2yjc.fsf@gnu.org> <83wq002yct.fsf@gnu.org> <87fv6oo0h3.fsf@igel.home> <83vbfk2x9l.fsf@gnu.org> <87bnhcnym6.fsf@igel.home> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit X-Trace: ger.gmane.org 1432330012 19996 80.91.229.3 (22 May 2015 21:26:52 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Fri, 22 May 2015 21:26:52 +0000 (UTC) Cc: pot@gnu.org, emacs-devel@gnu.org To: Andreas Schwab , Eli Zaretskii Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Fri May 22 23:26:44 2015 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1YvuSo-0007FM-UK for ged-emacs-devel@m.gmane.org; Fri, 22 May 2015 23:26:43 +0200 Original-Received: from localhost ([::1]:36003 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YvuSn-0000ir-3Y for ged-emacs-devel@m.gmane.org; Fri, 22 May 2015 17:26:41 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:39427) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YvuSk-0000ih-NB for emacs-devel@gnu.org; Fri, 22 May 2015 17:26:39 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1YvuSj-0001r6-P7 for emacs-devel@gnu.org; Fri, 22 May 2015 17:26:38 -0400 Original-Received: from smtp.cs.ucla.edu ([131.179.128.62]:34456) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YvuSf-0001pL-5V; Fri, 22 May 2015 17:26:33 -0400 Original-Received: from localhost (localhost.localdomain [127.0.0.1]) by smtp.cs.ucla.edu (Postfix) with ESMTP id F0BD0A60013; Fri, 22 May 2015 14:26:27 -0700 (PDT) X-Virus-Scanned: amavisd-new at smtp.cs.ucla.edu Original-Received: from smtp.cs.ucla.edu ([127.0.0.1]) by localhost (smtp.cs.ucla.edu [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 7OwTqzZWi6Ii; Fri, 22 May 2015 14:26:27 -0700 (PDT) Original-Received: from Penguin.CS.UCLA.EDU (Penguin.CS.UCLA.EDU [131.179.64.200]) by smtp.cs.ucla.edu (Postfix) with ESMTPSA id 4E9F8A60010; Fri, 22 May 2015 14:26:27 -0700 (PDT) User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.7.0 In-Reply-To: <87bnhcnym6.fsf@igel.home> X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Received-From: 131.179.128.62 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:186746 Archived-At: On 05/22/2015 01:30 PM, Andreas Schwab wrote: >> then those will also be wrong after decoding non-ASCII characters, >> >unless the file was encoded in UTF-8 to begin with, right? > Yes, of course. Emacs will have to cope. > Andreas is right, as usual. TAGS should contain hard info about file contents, not guesswork about what Emacs's internal encoding might be, as the latter depends on user input. If the input file is UTF-8 and isn't munged by CRLF removal etc., file byte offsets should equal buffer byte offsets. If not, it's up to Emacs to map the hard info to its internal representation.