unofficial mirror of bug-gnu-emacs@gnu.org 
 help / color / mirror / code / Atom feed
From: "Marc Wilhelm Küster" <kuester@saphor.net>
Cc: bug-gnu-emacs@gnu.org
Subject: Re: UTF-8 related display problem
Date: Mon, 07 Oct 2002 09:28:32 +0200	[thread overview]
Message-ID: <5.1.0.14.2.20021007091714.03334510@pop.puretec.de> (raw)
In-Reply-To: <7263-Sun06Oct2002212601+0200-eliz@is.elta.co.il>


> > Opening a largish UTF-8-encoded text file (ca. 800 kb) with Latin, Greek
> > and Hebrew passages in it causes emacs to stop displaying the text about
> > halfway through the text. It is impossible to navigate beyond that break.
> > Shortening or lengthening the text does only move slighlty the point where
> > the text display stops.
> >
> > The break seems always to be in non-Latin text.
> >
> > The file displays without problem in other UTF-8-aware applications, so 
> the
> > UTF-8 itself should be correct.
>
>Are you sure you have the necessary fonts installed?  The list of
>places where you can download Unicode fonts can be found in the file
>INSTALL in the Emacs distribution.

Thanks for the reply!

Yes, the necessary fonts are installed and the text, when extracted into 
another buffer, even displays correctly. Furthermore, saving the file 
actually shortens it to the point where the display ended, something that 
should never happen with pure display problems. It looks to me rather like 
an input stream problem of sorts (though a strange one, since splitting the 
file into parts and work with those parts is a way to get around the problem).

I have checked the UTF-8 by parsing it with Java's InputStreamReader in 
UTF-8 mode, but no problems whatsoever.

However, I cannot reconstruct the problem with any other file. I generated 
for this purpose a list of all existing Unicode characters, all in 
combination with a combining acute, and, except for the documented issue of 
characters bigger than U33FF and smaller than UE200, I could not spot anything.

The file in question contains data that should not be widely circulated. Is 
it possible that you can have a look at the problem and then delete the 
file afterwards?

Best regards,

Marc Küster


*************************
Marc Wilhelm Küster
Saphor GmbH

Fronländer 22
D-72072 Tübingen

Tel.: (+49) / (0)7472 / 949 100
Fax: (+49) / (0)7472 / 949 114

  reply	other threads:[~2002-10-07  7:28 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2002-10-05 20:36 UTF-8 related display problem Marc Wilhelm Küster
2002-10-06 18:26 ` Eli Zaretskii
2002-10-07  7:28   ` Marc Wilhelm Küster [this message]
2002-10-07 14:32     ` Eli Zaretskii
     [not found]     ` <5.1.0.14.2.20021007210058.031a7818@pop.puretec.de>
2002-10-08  1:09       ` Kenichi Handa
2002-10-10  9:12         ` Marc Wilhelm Küster

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://www.gnu.org/software/emacs/

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=5.1.0.14.2.20021007091714.03334510@pop.puretec.de \
    --to=kuester@saphor.net \
    --cc=bug-gnu-emacs@gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).