From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Yuri Khan Newsgroups: gmane.emacs.devel Subject: Re: Human-readable file sorting Date: Mon, 22 Feb 2016 01:27:23 +0600 Message-ID: References: <87povs41xg.fsf@gnus.org> <87bn7c3yms.fsf@gnus.org> <83si0npxtn.fsf@gnu.org> <87si0nlirx.fsf@gnus.org> <8360xjpq91.fsf@gnu.org> <87oabbli5g.fsf@gnus.org> <87k2lzpijs.fsf@gmail.com> <878u2ekdpq.fsf@gnus.org> <87h9h2pg0b.fsf@gmail.com> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1456082881 32636 80.91.229.3 (21 Feb 2016 19:28:01 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Sun, 21 Feb 2016 19:28:01 +0000 (UTC) Cc: Lars Ingebrigtsen , Eli Zaretskii , Emacs developers To: Alexis Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Sun Feb 21 20:28:01 2016 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1aXZfk-0000Qr-IN for ged-emacs-devel@m.gmane.org; Sun, 21 Feb 2016 20:28:00 +0100 Original-Received: from localhost ([::1]:43546 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1aXZfj-0005zv-Od for ged-emacs-devel@m.gmane.org; Sun, 21 Feb 2016 14:27:59 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:47680) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1aXZfV-0005za-ET for emacs-devel@gnu.org; Sun, 21 Feb 2016 14:27:46 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1aXZfU-0004Il-JP for emacs-devel@gnu.org; Sun, 21 Feb 2016 14:27:45 -0500 Original-Received: from mail-lf0-x22a.google.com ([2a00:1450:4010:c07::22a]:35327) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1aXZfU-0004Ih-Al; Sun, 21 Feb 2016 14:27:44 -0500 Original-Received: by mail-lf0-x22a.google.com with SMTP id l143so81884889lfe.2; Sun, 21 Feb 2016 11:27:44 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:from:date:message-id :subject:to:cc:content-type:content-transfer-encoding; bh=MvM2lkZMr9ErEaDXL/IIs1ji+H/ZGn2TOQ0t9RovdFo=; b=CFQZ7cwPvZZrPLHmaVMnhtOqYy01xo8WMEz3Gd1nmbh1xqIUd+J2fv+C3cmp4ByzmZ c9tAn/2+tRH6WanM6OfECCt5447vwptebuxUVP0GapkTk0Ytul8XIJnRD+dJt0xxKMno fJFFciRvDNRrotrZgXQT6tJcUY4IV3kGok/dzxbHxk5mOnDf5FfMfkdvgWe3JEG5jAFc 8uxtS0eKMGLviCjK+GrREHSN9NiR5OLX8bPmDlcikMir+MqIxn+tL4hGsgpUdvoGVWGB 0WGO4SAJruWlnbGVSGqHRgohsxCneUqR/rEQkZcfoHJIPoG9xGlRJKOsTLGoTn4kh9GK c19A== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:sender:in-reply-to:references:from :date:message-id:subject:to:cc:content-type :content-transfer-encoding; bh=MvM2lkZMr9ErEaDXL/IIs1ji+H/ZGn2TOQ0t9RovdFo=; b=RjUSxi231O0v8Ca1HFgayDDFHl3qL0Y2eKNuGgF5kIJhEXdl37bkONSa/aJmZIQax6 NuwVHU6DaYc/90XtiBvsoLJuzwUcucRcmGwJt6Te3mDpNt8Ms3Y9QgaLfFLMue7qMd7J rGSM7fMfGhffGzJiHEiP+IqwAAiBXR0MylFde+r2+Sx/9eQFqHxIwekbYGDG4ywjO3GG /LjqyYfyjQYjpR+AhlcoaYdX4FTRJseQVzsFfXWqWXpw80IVepG10/ZNJeBnmH34X6Ps +yIwjRvg/5RgCaUpzZel7WX3e+mdXFRqA5fnk+go92l1+FvhuroCyI16SWeTzW5dWOd3 CXHQ== X-Gm-Message-State: AG10YOTWkO49vocqOxRC0ACErSe4Wl5gio3yZX3z/AaMWChAtKViSBjPSOH1gXMdk1YInyHVYWnV0CCpQyWD4g== X-Received: by 10.25.21.90 with SMTP id l87mr8848891lfi.64.1456082863469; Sun, 21 Feb 2016 11:27:43 -0800 (PST) Original-Received: by 10.112.239.42 with HTTP; Sun, 21 Feb 2016 11:27:23 -0800 (PST) In-Reply-To: <87h9h2pg0b.fsf@gmail.com> X-Google-Sender-Auth: d5mtfhdUr4s1eaF5PJLVQZKg5og X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 2a00:1450:4010:c07::22a X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:200398 Archived-At: On Sun, Feb 21, 2016 at 3:30 PM, Alexis wrote: > Off the top of my head, that sounds like a good option to me (and i assum= e > by 'unicode' you mean "sort by Unicode codepoint"?); but perhaps there ar= e a > number of possible issues with this approach that i'm not aware of .... Sorting lexicographically by codepoint has the issue that it is not consistent with composition equivalence. E.g. U+00E0 =E2=80=9CLatin small letter a with acute=E2=80=9D is equivalent to U+0061 U+0301 (=E2=80=9CLatin= small letter a=E2=80=9D followed by =E2=80=9CCombining acute accent=E2=80=9D), bu= t codepoint sorting will sort them far away.