From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.bugs Subject: bug#19393: 25.0.50; Emacs cannot determine coding system of ISO-8859 encoded files Date: Tue, 16 Dec 2014 18:05:38 +0200 Message-ID: <83oar3wpf1.fsf@gnu.org> References: <87sigfpqmx.fsf@thinkpad-t440p.tsdh.org> Reply-To: Eli Zaretskii NNTP-Posting-Host: plane.gmane.org X-Trace: ger.gmane.org 1418745986 5861 80.91.229.3 (16 Dec 2014 16:06:26 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Tue, 16 Dec 2014 16:06:26 +0000 (UTC) Cc: 19393@debbugs.gnu.org To: Tassilo Horn Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Tue Dec 16 17:06:21 2014 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1Y0udf-0002Su-F7 for geb-bug-gnu-emacs@m.gmane.org; Tue, 16 Dec 2014 17:06:19 +0100 Original-Received: from localhost ([::1]:45422 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Y0ude-0005lE-MC for geb-bug-gnu-emacs@m.gmane.org; Tue, 16 Dec 2014 11:06:18 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:55323) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Y0udU-0005ce-He for bug-gnu-emacs@gnu.org; Tue, 16 Dec 2014 11:06:13 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Y0udP-0007jT-FT for bug-gnu-emacs@gnu.org; Tue, 16 Dec 2014 11:06:08 -0500 Original-Received: from debbugs.gnu.org ([140.186.70.43]:38656) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Y0udP-0007is-0x for bug-gnu-emacs@gnu.org; Tue, 16 Dec 2014 11:06:03 -0500 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.80) (envelope-from ) id 1Y0udO-0004n0-GA for bug-gnu-emacs@gnu.org; Tue, 16 Dec 2014 11:06:02 -0500 X-Loop: help-debbugs@gnu.org Resent-From: Eli Zaretskii Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Tue, 16 Dec 2014 16:06:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 19393 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: Original-Received: via spool by 19393-submit@debbugs.gnu.org id=B19393.141874595218394 (code B ref 19393); Tue, 16 Dec 2014 16:06:02 +0000 Original-Received: (at 19393) by debbugs.gnu.org; 16 Dec 2014 16:05:52 +0000 Original-Received: from localhost ([127.0.0.1]:48022 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Y0udD-0004mc-VZ for submit@debbugs.gnu.org; Tue, 16 Dec 2014 11:05:52 -0500 Original-Received: from mtaout26.012.net.il ([80.179.55.182]:52486) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Y0udB-0004mT-NZ for 19393@debbugs.gnu.org; Tue, 16 Dec 2014 11:05:50 -0500 Original-Received: from conversion-daemon.mtaout26.012.net.il by mtaout26.012.net.il (HyperSendmail v2007.08) id <0NGO00200N8EN100@mtaout26.012.net.il> for 19393@debbugs.gnu.org; Tue, 16 Dec 2014 18:04:55 +0200 (IST) Original-Received: from HOME-C4E4A596F7 ([87.69.4.28]) by mtaout26.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0NGO0032VNC75X00@mtaout26.012.net.il>; Tue, 16 Dec 2014 18:04:55 +0200 (IST) In-reply-to: <87sigfpqmx.fsf@thinkpad-t440p.tsdh.org> X-012-Sender: halo1@inter.net.il X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 3.x X-Received-From: 140.186.70.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:97387 Archived-At: > From: Tassilo Horn > Date: Tue, 16 Dec 2014 16:21:10 +0100 > > ftp://ftp.fu-berlin.de/pub/misc/movies/database/movies.list.gz > > which contains all movies known to the international movie database > (IMDb.com). When I open that file using "emacs -Q movies.list.gz" (or > unzip it first) and then do M-x describe-coding-system I can see that it > is "t -- raw-text-unix". As a result of this, the last movie in that > file is displayed as "\374\347 (2012) 2012". > > However, according to the `file' command, the file is plain ISO-8859. Looks like some kind of bug, although with such a large file, it's not easy to be sure. > I also can't force Emacs to use ISO-8859 for that or the original file. > `C-x RET f iso-8859-15 RET' results in a query that certain characters > cannot be encoded using latin-9, e.g., \374 and \347, and I'm expected > to choose another encoding. That's not how you force Emacs to use a specific encoding when visiting a file. You should do this instead: C-x RET c iso-8859-15 RET C-x C-f movies.list RET IOW, revisit the file, forcing Emacs to decode it as ISO-8859-15. (The same works with the original compressed file.)