From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.devel Subject: Re: bidi, hebrew: files may need to be utf-8 with a byte order mark Date: Fri, 04 Jan 2019 22:26:29 +0200 Message-ID: <831s5s2mq2.fsf@gnu.org> References: <87pntcuz0l.fsf@mat.ucm.es> <834lao2qyy.fsf@gnu.org> <87h8eociej.fsf@mat.ucm.es> NNTP-Posting-Host: blaine.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Trace: blaine.gmane.org 1546633532 10017 195.159.176.226 (4 Jan 2019 20:25:32 GMT) X-Complaints-To: usenet@blaine.gmane.org NNTP-Posting-Date: Fri, 4 Jan 2019 20:25:32 +0000 (UTC) Cc: emacs-devel@gnu.org To: Uwe Brauer Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Fri Jan 04 21:25:28 2019 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by blaine.gmane.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gfW24-0002VR-7b for ged-emacs-devel@m.gmane.org; Fri, 04 Jan 2019 21:25:28 +0100 Original-Received: from localhost ([127.0.0.1]:38721 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1gfW4B-0001EV-5n for ged-emacs-devel@m.gmane.org; Fri, 04 Jan 2019 15:27:39 -0500 Original-Received: from eggs.gnu.org ([208.118.235.92]:37347) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1gfW3Q-0001A8-NB for emacs-devel@gnu.org; Fri, 04 Jan 2019 15:26:53 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1gfW3N-0004YQ-3w for emacs-devel@gnu.org; Fri, 04 Jan 2019 15:26:52 -0500 Original-Received: from fencepost.gnu.org ([2001:4830:134:3::e]:38039) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1gfW3M-0004XV-VY; Fri, 04 Jan 2019 15:26:49 -0500 Original-Received: from [176.228.60.248] (port=3002 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1gfW3L-0002qW-AF; Fri, 04 Jan 2019 15:26:48 -0500 In-reply-to: <87h8eociej.fsf@mat.ucm.es> (message from Uwe Brauer on Fri, 04 Jan 2019 20:49:40 +0100) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 2001:4830:134:3::e X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.org gmane.emacs.devel:232174 Archived-At: > From: Uwe Brauer > Date: Fri, 04 Jan 2019 20:49:40 +0100 > > > I don't think I understand the question. If you ask whether Emacs > > supports UTF-8 encoding with BOM, then the answer is yes: see > > coding-system utf-8-with-signature. Emacs has supported this since > > long ago, not just on master. But this has no direct relation to > > bidi. > > Well I just included the remark from the keyman project. So BOM means > byte order mark, I presume and as I understand GNU emacs supports it. > > I recall some time ago some specific issue of the hebrew or better bidi > implementation, like some missing feature which was only included > recently (by this I mean ~25 or later. I call it recently since Ubuntu > still ships officially GNU emacs 24). If you mean support for changes introduced in Unicode 6.3, then Emacs supports that since v25.1. But again, what does that have to do with keyboard layouts? > I also thought that BOM is related to bidi No, it is related to automatic detection of the platform's endianness i.e. big-endian vs little-endian. > BTW is that a BOM ‏? > (insert-char 8207 1 t) No, the BOM is 0xFEFF, which is 65279 decimal.