From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Kenichi Handa Newsgroups: gmane.emacs.devel Subject: Re: Usage of standard-display-table in MSDOS Date: Thu, 02 Sep 2010 21:32:21 +0900 Message-ID: References: <83aao8mjzx.fsf@gnu.org> <837hjcm9cw.fsf@gnu.org> <83y6brkxqe.fsf@gnu.org> <201009012333.o81NXrRq016732@beta.mvs.co.il> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: dough.gmane.org 1283430759 31864 80.91.229.12 (2 Sep 2010 12:32:39 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Thu, 2 Sep 2010 12:32:39 +0000 (UTC) Cc: eliz@gnu.org, emacs-devel@gnu.org To: ehud@unix.mvs.co.il Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Thu Sep 02 14:32:37 2010 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1Or8y5-0003q6-JT for ged-emacs-devel@m.gmane.org; Thu, 02 Sep 2010 14:32:37 +0200 Original-Received: from localhost ([127.0.0.1]:35055 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Or8y4-0001lK-RN for ged-emacs-devel@m.gmane.org; Thu, 02 Sep 2010 08:32:36 -0400 Original-Received: from [140.186.70.92] (port=60098 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Or8xy-0001i6-KR for emacs-devel@gnu.org; Thu, 02 Sep 2010 08:32:31 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.69) (envelope-from ) id 1Or8xx-0005VZ-Ha for emacs-devel@gnu.org; Thu, 02 Sep 2010 08:32:30 -0400 Original-Received: from mx1.aist.go.jp ([150.29.246.133]:44642) by eggs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Or8xv-0005UO-DY; Thu, 02 Sep 2010 08:32:27 -0400 Original-Received: from rqsmtp2.aist.go.jp (rqsmtp2.aist.go.jp [150.29.254.123]) by mx1.aist.go.jp with ESMTP id o82CWMKd012020; Thu, 2 Sep 2010 21:32:23 +0900 (JST) env-from (handa@m17n.org) Original-Received: from smtp4.aist.go.jp by rqsmtp2.aist.go.jp with ESMTP id o82CWMMS017711; Thu, 2 Sep 2010 21:32:22 +0900 (JST) env-from (handa@m17n.org) Original-Received: by smtp4.aist.go.jp with ESMTP id o82CWLRa025872; Thu, 2 Sep 2010 21:32:21 +0900 (JST) env-from (handa@m17n.org) Original-Received: from handa by etlken with local (Exim 4.71) (envelope-from ) id 1Or8xp-0001U3-EP; Thu, 02 Sep 2010 21:32:21 +0900 In-Reply-To: <201009012333.o81NXrRq016732@beta.mvs.co.il> (ehud@unix.mvs.co.il) X-detected-operating-system: by eggs.gnu.org: Solaris 9 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:129585 Archived-At: In article <201009012333.o81NXrRq016732@beta.mvs.co.il>, "Ehud Karni" writes: > Problem 1: > On text terminal the language environment has great influence on the > use of the display table - characters not it the language - are > always displayed as ? . So in the "C" locale, all characters > 127 > are displayed as ?. > In the "he_IL" locale (= ISO-8859-8) characters in the range > 191-223 and 251-255 are displayed as ?. > In the "en_GB" locale (= ISO-8859-1) the Hebrew characters (#x5D0- > #x5EA) are displayed as ?. > I really must use the "he_IL" because most of the file my users view > are in ISO-8859-8 and a small part have MSDOS Hebrew (#x80-#x9A), but > I want to see all the characters (#xB0-#xDF) literally (i.e. when a > byte in this range is displayed, its 8 bit value should be sent to > the terminal. I've thought that you are reading files by find-file-literally (thus buffers are unibyte) because you wrote below at first: > I had this code in Emacs 21.3: > > (defun set-standard-display-table () > (setq standard-display-table (make-display-table)) > (standard-display-8bit 127 254)) > > I then set the DOS Hebrew chars (128-144) each to a vector: > [ 169 ] > > Then visit a file (literally). But, as you wrote "Problem 2: When I use `find-file-literally' ...", the "Problem 1" is the case that you don't use `find-file-literally', and a file is read into a multibyte buffer decoded by some coding-system, right? Then, in he_IL locale, by which coding-system your file is decoded? C-h C RET shows that coding-system near the top under the line "Coding system for saving this buffer:". And I don't understand this part. > I then set the DOS Hebrew chars (128-144) each to a vector: > [ 169 ] 169 is not a "UNIX Hebrew char", i.e. not a Unicode character code of a Hebrew char, nor a code-point of a Hebrew character in iso-8859-8 character set. In your mails, you mixup: (1) a code-point in a specific character set, (2) a character code in Emacs (that is Unicode character code), (3) a byte represented by Emacs' 8-bit characters, and that makes it difficult to understand what exactly you are saying. Please write: For (1), "a character of code #xXX in XXX charset". For (2), just "U+XXXX". For (3), just "byte #xXX". And, you wrote "a small part have MSDOS Hebrew (#x80-#x9A)", but #x9a is 154, not 144. Is "144" above just a typo? Perhaps, the following is the best way to understand what you want: (1) You at first make sample files and give me them. (2) Tell me how you want read that file exactly. Just C-x C-f FILENAME RET, or M-x find-file-literally ...., or C-x C-m c no-convesion RET C-x C-f FILENAME RET, or ... (3) Show me how it should be displayed on a terminal by an image. --- Kenichi Handa handa@m17n.org