From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED!not-for-mail From: Marko Rauhamaa Newsgroups: gmane.lisp.guile.user Subject: Re: guile can't find a chinese named file Date: Thu, 16 Feb 2017 13:04:42 +0200 Message-ID: <87h93uv1kl.fsf@elektro.pacujo.net> References: <878tpsqtzl.fsf@fencepost.gnu.org> <87zii8bcdw.fsf@elektro.pacujo.net> <87y3xspcux.fsf@fencepost.gnu.org> <578885360.4452806.1487105647708@mail.yahoo.com> <87r330cwhj.fsf@elektro.pacujo.net> <191859705.4469709.1487109121157@mail.yahoo.com> <20170214221914.1483ddb1@bother.homenet> <20170215091832.GA28017@tuxteam.de> <83inobz8yl.fsf@gnu.org> <20170215202056.GB3723@tuxteam.de> <83d1ejyz2e.fsf@gnu.org> <87y3x7kvwr.fsf@elektro.pacujo.net> <837f4qzo31.fsf@gnu.org> <87h93u4q5e.fsf@elektro.pacujo.net> <831suyzm11.fsf@gnu.org> <83zihmy6wb.fsf@gnu.org> <87wpcq38sa.fsf@elektro.pacujo.net> <87efyy4k4c.fsf@fencepost.gnu.org> <87mvdmv3kg.fsf@elektro.pacujo.net> <8760ka4drd.fsf@fencepost.gnu.org> NNTP-Posting-Host: blaine.gmane.org Mime-Version: 1.0 Content-Type: text/plain X-Trace: blaine.gmane.org 1487243118 5471 195.159.176.226 (16 Feb 2017 11:05:18 GMT) X-Complaints-To: usenet@blaine.gmane.org NNTP-Posting-Date: Thu, 16 Feb 2017 11:05:18 +0000 (UTC) User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/25.1 (gnu/linux) Cc: guile-user@gnu.org To: David Kastrup Original-X-From: guile-user-bounces+guile-user=m.gmane.org@gnu.org Thu Feb 16 12:05:13 2017 Return-path: Envelope-to: guile-user@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by blaine.gmane.org with esmtp (Exim 4.84_2) (envelope-from ) id 1ceJs5-0000er-Fi for guile-user@m.gmane.org; Thu, 16 Feb 2017 12:05:09 +0100 Original-Received: from localhost ([::1]:45776 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ceJsB-0001bO-58 for guile-user@m.gmane.org; Thu, 16 Feb 2017 06:05:15 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:52466) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ceJrk-0001b8-R6 for guile-user@gnu.org; Thu, 16 Feb 2017 06:04:49 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ceJrj-0006sh-T4 for guile-user@gnu.org; Thu, 16 Feb 2017 06:04:48 -0500 Original-Received: from [2001:1bc8:1a0:5384:7a2b:cbff:fe9f:e508] (port=33144 helo=pacujo.net) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ceJri-0006qP-Bw; Thu, 16 Feb 2017 06:04:46 -0500 Original-Received: from elektro.pacujo.net (192.168.1.200) by elektro.pacujo.net; Thu, 16 Feb 2017 13:04:42 +0200 Original-Received: by elektro.pacujo.net (sSMTP sendmail emulation); Thu, 16 Feb 2017 13:04:42 +0200 In-Reply-To: <8760ka4drd.fsf@fencepost.gnu.org> (David Kastrup's message of "Thu, 16 Feb 2017 11:43:34 +0100") X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-Received-From: 2001:1bc8:1a0:5384:7a2b:cbff:fe9f:e508 X-BeenThere: guile-user@gnu.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: General Guile related discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guile-user-bounces+guile-user=m.gmane.org@gnu.org Original-Sender: "guile-user" Xref: news.gmane.org gmane.lisp.guile.user:13252 Archived-At: David Kastrup : > Marko Rauhamaa writes: >> You probably cannot produce valid UTF-8 out of invalid UTF-8 snippets >> with split(1). However split(1) does form filenames out of its >> arguments by concatenation: >> >> split --additional-suffix=suffix file prefix >> >> produces these kinds of filenames: >> >> > > I don't really get your point here. Why would you start with invalid > UTF-8 sequences in the filenames? There's nothing preventing such filenames from appearing on a Linux system. They might come from a zip file with Latin-1 -encoded names, for example. I have files older than UTF-8 on my Linux system. I have files encoded in Latin-3, for example. Worst of all, they might be part of an attack on your system. For example, files whose names contain invalid UTF-8 could evade file listing altogether, they might make your program crash in unexpected ways or you might not be able to remove them. Marko