From: "Marc Wilhelm Küster" <kuester@saphor.net>
Cc: bug-gnu-emacs@gnu.org
Subject: Re: UTF-8 related display problem
Date: Mon, 07 Oct 2002 09:28:32 +0200 [thread overview]
Message-ID: <5.1.0.14.2.20021007091714.03334510@pop.puretec.de> (raw)
In-Reply-To: <7263-Sun06Oct2002212601+0200-eliz@is.elta.co.il>
> > Opening a largish UTF-8-encoded text file (ca. 800 kb) with Latin, Greek
> > and Hebrew passages in it causes emacs to stop displaying the text about
> > halfway through the text. It is impossible to navigate beyond that break.
> > Shortening or lengthening the text does only move slighlty the point where
> > the text display stops.
> >
> > The break seems always to be in non-Latin text.
> >
> > The file displays without problem in other UTF-8-aware applications, so
> the
> > UTF-8 itself should be correct.
>
>Are you sure you have the necessary fonts installed? The list of
>places where you can download Unicode fonts can be found in the file
>INSTALL in the Emacs distribution.
Thanks for the reply!
Yes, the necessary fonts are installed and the text, when extracted into
another buffer, even displays correctly. Furthermore, saving the file
actually shortens it to the point where the display ended, something that
should never happen with pure display problems. It looks to me rather like
an input stream problem of sorts (though a strange one, since splitting the
file into parts and work with those parts is a way to get around the problem).
I have checked the UTF-8 by parsing it with Java's InputStreamReader in
UTF-8 mode, but no problems whatsoever.
However, I cannot reconstruct the problem with any other file. I generated
for this purpose a list of all existing Unicode characters, all in
combination with a combining acute, and, except for the documented issue of
characters bigger than U33FF and smaller than UE200, I could not spot anything.
The file in question contains data that should not be widely circulated. Is
it possible that you can have a look at the problem and then delete the
file afterwards?
Best regards,
Marc Küster
*************************
Marc Wilhelm Küster
Saphor GmbH
Fronländer 22
D-72072 Tübingen
Tel.: (+49) / (0)7472 / 949 100
Fax: (+49) / (0)7472 / 949 114
next prev parent reply other threads:[~2002-10-07 7:28 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2002-10-05 20:36 UTF-8 related display problem Marc Wilhelm Küster
2002-10-06 18:26 ` Eli Zaretskii
2002-10-07 7:28 ` Marc Wilhelm Küster [this message]
2002-10-07 14:32 ` Eli Zaretskii
[not found] ` <5.1.0.14.2.20021007210058.031a7818@pop.puretec.de>
2002-10-08 1:09 ` Kenichi Handa
2002-10-10 9:12 ` Marc Wilhelm Küster
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
List information: https://www.gnu.org/software/emacs/
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=5.1.0.14.2.20021007091714.03334510@pop.puretec.de \
--to=kuester@saphor.net \
--cc=bug-gnu-emacs@gnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
Code repositories for project(s) associated with this public inbox
https://git.savannah.gnu.org/cgit/emacs.git
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).