From: Ilya Zakharevich <ilya@math.berkeley.edu>
To: Eli Zaretskii <eliz@gnu.org>
Cc: 19993@debbugs.gnu.org
Subject: bug#19993: 25.0.50; Unicode fonts defective on Windows
Date: Sat, 7 Mar 2015 23:41:58 -0800 [thread overview]
Message-ID: <20150308074158.GA885@math.berkeley.edu> (raw)
In-Reply-To: <83lhj9mdgn.fsf@gnu.org>
On Sat, Mar 07, 2015 at 10:14:16AM +0200, Eli Zaretskii wrote:
> > What can it mean that a font “supports a script”?
> >
> > Theoretically, it may mean that
> > • it “knows” all the characters in the script, and
> > • has enough extra infrastructure to shape these characters
> > into a correct glyphic representation.
> >
> > I may see that the second part may be described by one bit per
> > script. But what about the first one? A repertoir of a script
> > changes every year (sometimes several times per year). How can this
> > be encapsulated into a bit?
>
> All I know about this is what the MSDN documentation says:
>
> FONTSIGNATURE structure
>
> Contains information identifying the code pages and Unicode subranges
> for which a given font provides glyphs.
> [...]
> Members
>
> fsUsb
>
> A 128-bit Unicode subset bitfield (USB) identifying up to 126
> Unicode subranges. Each bit, except the two most significant bits,
> represents a single subrange. The most significant bit is always 1
> and identifies the bitfield as a font signature; the second most
> significant bit is reserved and must be 0. Unicode subranges are
> numbered in accordance with the ISO 10646 standard. For more
> information, see Unicode Subset Bitfields.
So this bits “identify” a subrange. Of course, nothing is said about
what this actually MEANS. So I did an experiment: Cour.ttf.
The following subrange is “identified”:
9 0400 - 04FF Cyrillic
0500 - 052F Cyrillic Supplement
2DE0 - 2DFF Cyrillic Extended-A
A640 - A69F Cyrillic Extended-B
What is actually supported:
0400 - 04FF Everything but 04d8,04d9 (Schwa, used in Cyrillic Azeri — but contemporary Azeri is written in Latin)
0500 - 052F Only 0500 - 0513, 051a - 051d supported
2DE0 - 2DFF None supported (5.1)
A640 - A69F None supported (5.1 and later)
Looking in DerivedAge.txt:
04D0..04EB ; 1.1 # [28] CYRILLIC CAPITAL LETTER A WITH BREVE..CYRILLIC SMALL LETTER BARRED O WITH DIAERESIS
0500..050F ; 3.2 # [16] CYRILLIC CAPITAL LETTER KOMI DE..CYRILLIC SMALL LETTER KOMI TJE
0510..0513 ; 5.0 # [4] CYRILLIC CAPITAL LETTER REVERSED ZE..CYRILLIC SMALL LETTER EL WITH HOOK
0514..0523 ; 5.1 # [16] CYRILLIC CAPITAL LETTER LHA..CYRILLIC SMALL LETTER EN WITH MIDDLE HOOK
So two characters of 1.1 are not supported, all characters of 3.2 and 5.0 are
supported, and part of 5.1 is supported.
Does it look like a good indication of anything? I would say no… Do
you know any other tool looking at this bitmap for choosing which font
to pick up for a particular character?
Ilya
next prev parent reply other threads:[~2015-03-08 7:41 UTC|newest]
Thread overview: 35+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-03-03 22:02 bug#19993: 25.0.50; Unicode fonts defective on Windows Ilya Zakharevich
2015-03-04 17:59 ` Eli Zaretskii
2015-03-05 21:49 ` Ilya Zakharevich
2015-03-05 22:05 ` Ilya Zakharevich
2015-03-06 10:45 ` Eli Zaretskii
2015-03-06 11:38 ` Ilya Zakharevich
2015-03-06 14:00 ` Eli Zaretskii
2015-03-06 16:21 ` Ilya Zakharevich
2015-03-06 20:11 ` Eli Zaretskii
2015-03-06 21:12 ` Eli Zaretskii
2015-03-06 22:13 ` Ilya Zakharevich
2015-03-07 8:18 ` Eli Zaretskii
2015-03-08 7:45 ` Ilya Zakharevich
2015-03-08 15:52 ` Eli Zaretskii
2015-03-08 8:38 ` Ilya Zakharevich
2015-03-08 8:46 ` Ilya Zakharevich
2015-03-10 16:29 ` Ilya Zakharevich
2015-03-10 17:05 ` Eli Zaretskii
2015-03-10 17:41 ` Eli Zaretskii
2015-03-10 20:32 ` Ilya Zakharevich
2015-03-11 4:28 ` Eli Zaretskii
2015-03-11 19:49 ` Ilya Zakharevich
2015-03-11 20:21 ` Eli Zaretskii
2015-03-12 18:16 ` Eli Zaretskii
2015-03-13 1:52 ` Ilya Zakharevich
2015-03-13 7:34 ` Eli Zaretskii
2015-03-13 4:50 ` Ilya Zakharevich
2015-03-13 6:16 ` Eli Zaretskii
2015-03-08 15:55 ` Eli Zaretskii
2015-03-06 22:08 ` Ilya Zakharevich
2015-03-07 8:14 ` Eli Zaretskii
2015-03-08 7:41 ` Ilya Zakharevich [this message]
2015-03-08 15:51 ` Eli Zaretskii
2015-03-08 16:20 ` Ilya Zakharevich
2015-03-08 17:01 ` Eli Zaretskii
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20150308074158.GA885@math.berkeley.edu \
--to=ilya@math.berkeley.edu \
--cc=19993@debbugs.gnu.org \
--cc=eliz@gnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
Code repositories for project(s) associated with this external index
https://git.savannah.gnu.org/cgit/emacs.git
https://git.savannah.gnu.org/cgit/emacs/org-mode.git
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.