From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Uwe Brauer Newsgroups: gmane.emacs.devel Subject: Re: sort-lines including non ASCII Date: Thu, 07 Jul 2016 07:41:03 +0000 Message-ID: <87wpkxx5dc.fsf@mat.ucm.es> References: <87bn2b6buh.fsf@mat.ucm.es> <83zipun8cf.fsf@gnu.org> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Trace: ger.gmane.org 1467877647 25114 80.91.229.3 (7 Jul 2016 07:47:27 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Thu, 7 Jul 2016 07:47:27 +0000 (UTC) To: emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Thu Jul 07 09:47:19 2016 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1bL41l-0001uX-OK for ged-emacs-devel@m.gmane.org; Thu, 07 Jul 2016 09:47:17 +0200 Original-Received: from localhost ([::1]:38042 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1bL41l-0004lz-2l for ged-emacs-devel@m.gmane.org; Thu, 07 Jul 2016 03:47:17 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:44781) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1bL3zj-00023k-KT for emacs-devel@gnu.org; Thu, 07 Jul 2016 03:45:12 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1bL3zd-0007qU-M0 for emacs-devel@gnu.org; Thu, 07 Jul 2016 03:45:10 -0400 Original-Received: from plane.gmane.org ([80.91.229.3]:33060) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1bL3zd-0007qA-FG for emacs-devel@gnu.org; Thu, 07 Jul 2016 03:45:05 -0400 Original-Received: from list by plane.gmane.org with local (Exim 4.69) (envelope-from ) id 1bL3za-0000Ng-GQ for emacs-devel@gnu.org; Thu, 07 Jul 2016 09:45:02 +0200 Original-Received: from gilgamesch.quim.ucm.es ([147.96.12.99]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Thu, 07 Jul 2016 09:45:02 +0200 Original-Received: from oub by gilgamesch.quim.ucm.es with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Thu, 07 Jul 2016 09:45:02 +0200 X-Injected-Via-Gmane: http://gmane.org/ Mail-Followup-To: emacs-devel@gnu.org Original-Lines: 27 Original-X-Complaints-To: usenet@ger.gmane.org X-Gmane-NNTP-Posting-Host: gilgamesch.quim.ucm.es Mail-Copies-To: never User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/25.1.50 (gnu/linux) Cancel-Lock: sha1:Iwu+8MbSZJkl+5A7lduQLyDS0Ks= X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-Received-From: 80.91.229.3 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.org gmane.emacs.devel:205303 Archived-At: > Because you are thinking Spanish, I presume. Emacs by default is not > sensitive to the current locale or language, when it compares strings, > and instead does that in binary order of the characters' Unicode > codepoints. The advantage is that the order comes out the same in any > locale. Hm I just made an experiment with Hebrew, with and without niqqud and indeed בית אבא אוויר Is sorted correctly and also אוויר בית אַבָא So the niqqud does not influence the sorting but the accent in spanish does. Most likely Unicode is the culprit here, but it is contra intuitive.