From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Klaus-Dieter Bauer Newsgroups: gmane.emacs.devel Subject: Re: Passing unicode filenames to start-process on Windows? Date: Fri, 8 Jan 2016 21:01:23 +0100 Message-ID: References: <83si2a3cuo.fsf@gnu.org> <83h9ip2xdg.fsf@gnu.org> <834meoqvkj.fsf@gnu.org> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: multipart/alternative; boundary=001a1146ed8cfaefcf0528d80e15 X-Trace: ger.gmane.org 1452283338 31960 80.91.229.3 (8 Jan 2016 20:02:18 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Fri, 8 Jan 2016 20:02:18 +0000 (UTC) Cc: emacs-devel@gnu.org To: Eli Zaretskii Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Fri Jan 08 21:02:18 2016 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1aHdEm-0002WI-1B for ged-emacs-devel@m.gmane.org; Fri, 08 Jan 2016 21:02:16 +0100 Original-Received: from localhost ([::1]:37956 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1aHdEg-0006Iw-G1 for ged-emacs-devel@m.gmane.org; Fri, 08 Jan 2016 15:02:10 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:59870) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1aHdES-0006Ia-IJ for emacs-devel@gnu.org; Fri, 08 Jan 2016 15:01:57 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1aHdER-0004O8-Fy for emacs-devel@gnu.org; Fri, 08 Jan 2016 15:01:56 -0500 Original-Received: from mail-wm0-x235.google.com ([2a00:1450:400c:c09::235]:38577) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1aHdEP-0004Nu-OP; Fri, 08 Jan 2016 15:01:53 -0500 Original-Received: by mail-wm0-x235.google.com with SMTP id b14so187368353wmb.1; Fri, 08 Jan 2016 12:01:53 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc:content-type; bh=1m8SS9SMlW9Y7zJLBwJIZTWpghWhNrKhHdTc8P5hQZs=; b=yZZcEakeWFoBi8qkyshYWH3x3SJ+xuSojvL288TMfK+2N83DDWat8LR5h0bA+wsgj8 3r3jtOSDemsScnx7C89TpEZdHB6K7BRtT6kBawGzYhgG56WP98I64fYamlB9HTVT2yJu NoOq8VkW00N1SA7FDyY0ysVmCe6x3UShVeirOMLj1uvwEcxqwEpEEPTI17kF5QNb7gxR lVMhC074ka4p9ALSHQRROoSwFqod4VPkTTyxDY0nnIwNU49YDzYSZRo8ATWVSF5PsanQ DLqZ4nWgAqM8L+893tuxvHzeVwY/6nEVcKwau9Ciq/SsmVwmev8RPRlHXhIE2+wJUnnD CUTw== X-Received: by 10.28.212.85 with SMTP id l82mr641792wmg.11.1452283312924; Fri, 08 Jan 2016 12:01:52 -0800 (PST) Original-Received: by 10.27.12.104 with HTTP; Fri, 8 Jan 2016 12:01:23 -0800 (PST) In-Reply-To: <834meoqvkj.fsf@gnu.org> X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 2a00:1450:400c:c09::235 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:197850 Archived-At: --001a1146ed8cfaefcf0528d80e15 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable 2016-01-08 10:17 GMT+01:00 Eli Zaretskii : > > From: Klaus-Dieter Bauer > > Date: Fri, 8 Jan 2016 00:31:38 +0100 > > Cc: emacs-devel@gnu.org > > > > - I want at some point to write an incremental backup utility > > that uses md5sum to identify renamed files. Since precompiled > > Windows binaries are 32bit, only the first 512MB of any given > > file are accessible to elisp however, so I wanted to use > > GnuWin32's md5sum.exe (but it turns out that it doesn't > > support unicode filenames anyway). > > Emacs 25 can be built --with-wide-int, in which case the 512MB limit > goes up to almost 2GB. I believe the precomiled binaries of the next > version will use this configure-time option. So maybe this is still a > relevant alternative for you. > > > - I want to verify a convention where filenames should mirror > > the metadata in my music library. Here I intended to write > > an elisp tool (for easy interactive processing im Emacs) > > and tried to use ffmpeg (which does support unicode filenames > > in cmd.exe). > > You could have Emacs write a batch file that invokes ffmpeg with those > Unicode file names (encoded in UTF-16, of course, not UTF-8!), and > then run the batch file as the sub-process. Will that work for you? > =E2=80=8BIt will, though I'll probably stick with the pipe version where it's possible. It is the more portable solution (in case that I use the script on a linux system). =E2=80=8B > > > I checked and both tools allow reading the input data from > > a pipe (`type UNICODE.mp3 | ffmpeg -i - ...` or `md5sum` > > respectively), so that workaround is applicable to all my usecases. > > Yes, that's another possibility. > --001a1146ed8cfaefcf0528d80e15 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable


2016-01-08 10:17 GMT+01:00 Eli Zaretskii <eliz@gnu.org>:
> From: Klaus-Dieter Bauer <bauer.klaus.dieter@gmail.com&g= t;
> Date: Fri, 8 Jan 2016 00:31:38 +0100
> Cc: emacs-devel@gnu.org
>
> - I want at some point to write an incremental backup utility
> that uses md5sum to identify renamed files. Since precompiled
> Windows binaries are 32bit, only the first 512MB of any given
> file are accessible to elisp however, so I wanted to use
> GnuWin32's md5sum.exe (but it turns out that it doesn't
> support unicode filenames anyway).

Emacs 25 can be built --with-wide-int, in which case the 512MB limit=
goes up to almost 2GB.=C2=A0 I believe the precomiled binaries of the next<= br> version will use this configure-time option.=C2=A0 So maybe this is still a=
relevant alternative for you.

> - I want to verify a convention where filenames should mirror
> the metadata in my music library. Here I intended to write
> an elisp tool (for easy interactive processing im Emacs)
> and tried to use ffmpeg (which does support unicode filenames
> in cmd.exe).

You could have Emacs write a batch file that invokes ffmpeg with tho= se
Unicode file names (encoded in UTF-16, of course, not UTF-8!), and
then run the batch file as the sub-process.=C2=A0 Will that work for you?

=E2=80=8BIt will, though I= 9;ll probably stick with the pipe version
where = it's possible. It is the more portable solution (in=C2=A0
case that I use the script on a linux system).=C2=A0
=E2=80=8B
=C2=A0

> I checked and both tools allow reading the input data from
> a pipe (`type UNICODE.mp3 | ffmpeg -i - ...` or `md5sum`
> respectively), so that workaround is applicable to all my usecases.
Yes, that's another possibility.

--001a1146ed8cfaefcf0528d80e15--