Not able to display \u110BD and \u110CD in Emacs

unofficial mirror of emacs-devel@gnu.org 
 help / color / mirror / code / Atom feed

* Not able to display \u110BD and \u110CD in Emacs
@ 2022-04-28 23:17 समीर सिंह Sameer Singh
  2022-04-29  5:52 ` Eli Zaretskii
  0 siblings, 1 reply; 39+ messages in thread
From: समीर सिंह Sameer Singh @ 2022-04-28 23:17 UTC (permalink / raw)
  To: emacs-devel


[-- Attachment #1.1: Type: text/plain, Size: 2055 bytes --]

Hi! I was trying to implement the Kaithi script in Emacs. Everything seems
to be working fine except for two characters: Kaithi Number Sign (\u110BD)
and Kaithi Number Sign Above (\u110CD), they do not appear when inserted in
a buffer, instead it is just whitespace.
According to the Unicode standard, they behave like the Arabic Number Sign
(\u0600) which spans numbers. I have attached a render from hb-view

This is what I have written in lisp/language/indian.el
Please tell me what I am doing wrong.

;; Kaithi composition rules
(let ((consonant "[\x1108D-\x110AF]")
      (non-consonant "[^\x1108D-\x110AF\x110B9]")
      (vowel "[\x110B0-\x110B8\x110C2]")
      (anusvara-candrabindu "[\x11080\x11081]")
      (virama "\x110B9")
      (number-sign "\x110BD")
      (number-sign-above "\x110CD")
      (numerals "\x966-\x96F"))
  (set-char-table-range composition-function-table
                        '(#x110B0 . #x110B8)
                        (list (vector
                               (concat consonant vowel anusvara-candrabindu
"?")
                               1 'font-shape-gstring)))
  (set-char-table-range composition-function-table
                        '(#x110B9 . #x110B9)
                        (list (vector
                               (concat consonant "\\(?:" virama consonant
"\\)+"
                                       vowel "*\\|" consonant virama)
                               1 'font-shape-gstring)))
  (set-char-table-range composition-function-table
                        '(#x110BD . #x110BD)
                        (list (vector
                               (concat number-sign numerals)
                               1 'font-shape-gstring)))
  (set-char-table-range composition-function-table
                        '(#x110CD . #x110CD)
                        (list (vector
                               (concat number-sign-above numerals)
                               1 'font-shape-gstring))))

In lisp/internation/fontset.el
(kaithi ,(font-spec :registry "iso10646-1" :otf '(kthi nil (rphf))))

[-- Attachment #1.2: Type: text/html, Size: 2723 bytes --]

[-- Attachment #2: hb-view.png --]
[-- Type: image/png, Size: 12978 bytes --]

^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-04-28 23:17 Not able to display \u110BD and \u110CD in Emacs समीर सिंह Sameer Singh
@ 2022-04-29  5:52 ` Eli Zaretskii
  2022-04-29  6:53   ` Eli Zaretskii
  0 siblings, 1 reply; 39+ messages in thread
From: Eli Zaretskii @ 2022-04-29  5:52 UTC (permalink / raw)
  To: समीर सिंह Sameer Singh
  Cc: emacs-devel

> From: समीर सिंह Sameer Singh
>  <lumarzeli30@gmail.com>
> Date: Fri, 29 Apr 2022 04:47:08 +0530
> 
> Hi! I was trying to implement the Kaithi script in Emacs. Everything seems
> to be working fine except for two characters: Kaithi Number Sign (\u110BD)
> and Kaithi Number Sign Above (\u110CD), they do not appear when inserted in
> a buffer, instead it is just whitespace.
> According to the Unicode standard, they behave like the Arabic Number Sign
> (\u0600) which spans numbers. I have attached a render from hb-view
> 
> This is what I have written in lisp/language/indian.el
> Please tell me what I am doing wrong.

You didn't show any Kaithi text that you think is not being displayed
correctly by Emacs.  Without that, it's hard to look into the problem.

>       (numerals "\x966-\x96F"))

I believe this should be

       (numerals "[\x966-\x96F]")

IOW, the "[..]" brackets are missing.  Maybe that is the cause of your
problem.



^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-04-29  5:52 ` Eli Zaretskii
@ 2022-04-29  6:53   ` Eli Zaretskii
  2022-04-29  9:45     ` समीर सिंह Sameer Singh
  0 siblings, 1 reply; 39+ messages in thread
From: Eli Zaretskii @ 2022-04-29  6:53 UTC (permalink / raw)
  To: lumarzeli30; +Cc: emacs-devel

> Date: Fri, 29 Apr 2022 08:52:18 +0300
> From: Eli Zaretskii <eliz@gnu.org>
> Cc: emacs-devel@gnu.org
> 
> >       (numerals "\x966-\x96F"))
> 
> I believe this should be
> 
>        (numerals "[\x966-\x96F]")
> 
> IOW, the "[..]" brackets are missing.  Maybe that is the cause of your
> problem.

And one more potential problem.  Since the rules for the number signs
are:

  (set-char-table-range composition-function-table
                        '(#x110BD . #x110BD)
                        (list (vector
                               (concat number-sign numerals)
                               1 'font-shape-gstring)))
  (set-char-table-range composition-function-table
                        '(#x110CD . #x110CD)
                        (list (vector
                               (concat number-sign-above numerals)
                               1 'font-shape-gstring))))

which means the number signs always come _before_ the character with
which it should be composed, you should use 0 in the rule, not 1.
That number means how many characters to look back for finding the
beginning of a composable sequence, and in these two rules the
sequence _begins_ with the character which triggers composition, so
the look-back is zero, not 1.



^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-04-29  6:53   ` Eli Zaretskii
@ 2022-04-29  9:45     ` समीर सिंह Sameer Singh
  2022-04-29 10:50       ` Eli Zaretskii
  0 siblings, 1 reply; 39+ messages in thread
From: समीर सिंह Sameer Singh @ 2022-04-29  9:45 UTC (permalink / raw)
  To: Eli Zaretskii; +Cc: emacs-devel


[-- Attachment #1.1: Type: text/plain, Size: 3015 bytes --]

I did the changes mentioned.
1. Added the square brackets
2. Changed the 1 to a 0

;; Kaithi composition rules
(let ((consonant "[\x1108D-\x110AF]")
      (non-consonant "[^\x1108D-\x110AF\x110B9]")
      (vowel "[\x110B0-\x110B8\x110C2]")
      (anusvara-candrabindu "[\x11080\x11081]")
      (virama "\x110B9")
      (number-sign "\x110BD")
      (number-sign-above "\x110CD")
      (numerals "[\x966-\x96F]"))
  (set-char-table-range composition-function-table
                        '(#x110B0 . #x110B8)
                        (list (vector
                               (concat consonant vowel anusvara-candrabindu
"?")
                               1 'font-shape-gstring)))
  (set-char-table-range composition-function-table
                        '(#x110B9 . #x110B9)
                        (list (vector
                               (concat consonant "\\(?:" virama consonant
"\\)+"
                                       vowel "*\\|" consonant virama)
                               1 'font-shape-gstring)))
  (set-char-table-range composition-function-table
                        '(#x110BD . #x110BD)
                        (list (vector
                               (concat number-sign numerals)
                               0 'font-shape-gstring)))
  (set-char-table-range composition-function-table
                        '(#x110CD . #x110CD)
                        (list (vector
                               (concat number-sign-above numerals)
                               0 'font-shape-gstring))))

Now when typed alone they do not appear but when written with numbers, they
appear as NOTDEF glyphs

On Fri, Apr 29, 2022 at 12:23 PM Eli Zaretskii <eliz@gnu.org> wrote:

> > Date: Fri, 29 Apr 2022 08:52:18 +0300
> > From: Eli Zaretskii <eliz@gnu.org>
> > Cc: emacs-devel@gnu.org
> >
> > >       (numerals "\x966-\x96F"))
> >
> > I believe this should be
> >
> >        (numerals "[\x966-\x96F]")
> >
> > IOW, the "[..]" brackets are missing.  Maybe that is the cause of your
> > problem.
>
> And one more potential problem.  Since the rules for the number signs
> are:
>
>   (set-char-table-range composition-function-table
>                         '(#x110BD . #x110BD)
>                         (list (vector
>                                (concat number-sign numerals)
>                                1 'font-shape-gstring)))
>   (set-char-table-range composition-function-table
>                         '(#x110CD . #x110CD)
>                         (list (vector
>                                (concat number-sign-above numerals)
>                                1 'font-shape-gstring))))
>
> which means the number signs always come _before_ the character with
> which it should be composed, you should use 0 in the rule, not 1.
> That number means how many characters to look back for finding the
> beginning of a composable sequence, and in these two rules the
> sequence _begins_ with the character which triggers composition, so
> the look-back is zero, not 1.
>

[-- Attachment #1.2: Type: text/html, Size: 4316 bytes --]

[-- Attachment #2: emacs-kaithi-number-signs.png --]
[-- Type: image/png, Size: 52850 bytes --]

^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-04-29  9:45     ` समीर सिंह Sameer Singh
@ 2022-04-29 10:50       ` Eli Zaretskii
  2022-04-29 11:17         ` समीर सिंह Sameer Singh
  0 siblings, 1 reply; 39+ messages in thread
From: Eli Zaretskii @ 2022-04-29 10:50 UTC (permalink / raw)
  To: समीर सिंह Sameer Singh
  Cc: emacs-devel

> From: समीर सिंह Sameer Singh <lumarzeli30@gmail.com>
> Date: Fri, 29 Apr 2022 15:15:41 +0530
> Cc: emacs-devel@gnu.org
> 
> I did the changes mentioned.
> 1. Added the square brackets
> 2. Changed the 1 to a 0
> 
> ;; Kaithi composition rules
> (let ((consonant "[\x1108D-\x110AF]")
>       (non-consonant "[^\x1108D-\x110AF\x110B9]")
>       (vowel "[\x110B0-\x110B8\x110C2]")
>       (anusvara-candrabindu "[\x11080\x11081]")
>       (virama "\x110B9")
>       (number-sign "\x110BD")
>       (number-sign-above "\x110CD")
>       (numerals "[\x966-\x96F]"))
>   (set-char-table-range composition-function-table
>                         '(#x110B0 . #x110B8)
>                         (list (vector
>                                (concat consonant vowel anusvara-candrabindu "?")
>                                1 'font-shape-gstring)))
>   (set-char-table-range composition-function-table
>                         '(#x110B9 . #x110B9)
>                         (list (vector
>                                (concat consonant "\\(?:" virama consonant "\\)+"
>                                        vowel "*\\|" consonant virama)
>                                1 'font-shape-gstring)))
>   (set-char-table-range composition-function-table
>                         '(#x110BD . #x110BD)
>                         (list (vector
>                                (concat number-sign numerals)
>                                0 'font-shape-gstring)))
>   (set-char-table-range composition-function-table
>                         '(#x110CD . #x110CD)
>                         (list (vector
>                                (concat number-sign-above numerals)
>                                0 'font-shape-gstring))))
> 
> Now when typed alone they do not appear but when written with numbers, they appear as NOTDEF glyphs

Are you sure the font supports all of those characters?  What font is
used for them in each case.

And again, you don't show the text that doesn't render correctly (as
characters), so it's impossible to investigate the cases you tried.
Images help to see how stuff is displayed, but they cannot be used to
reliably figure out what text was displayed.  I asked you to show the
actual text many times.



^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-04-29 10:50       ` Eli Zaretskii
@ 2022-04-29 11:17         ` समीर सिंह Sameer Singh
  2022-04-29 12:18           ` Eli Zaretskii
  0 siblings, 1 reply; 39+ messages in thread
From: समीर सिंह Sameer Singh @ 2022-04-29 11:17 UTC (permalink / raw)
  To: Eli Zaretskii; +Cc: emacs-devel

[-- Attachment #1: Type: text/plain, Size: 3090 bytes --]

They seem to be working after changing the devanagari font to Noto Sans
Kaithi but only for one digit though, also they do not appear when alone.
Since the kaithi unicode does not have its own numerals and uses the
devanagari ones, is there any way to only change the devanagari font when
it is near the kaithi number signs?

Here is the Text I entered, sorry for not including them earlier, I thought
that mentioning their codepoints should be enough.
KAITHI NUMBER SIGN: 𑂽
KAITHI NUMBER SIGN + DEVANAGARI 0: 𑂽१

KAITHI NUMBER SIGN ABOVE: 𑃍
KAITHI NUMBER SIGN ABOVE + DEVANAGARI 0: 𑃍०

On Fri, Apr 29, 2022 at 4:20 PM Eli Zaretskii <eliz@gnu.org> wrote:

> > From: समीर सिंह Sameer Singh <lumarzeli30@gmail.com>
> > Date: Fri, 29 Apr 2022 15:15:41 +0530
> > Cc: emacs-devel@gnu.org
> >
> > I did the changes mentioned.
> > 1. Added the square brackets
> > 2. Changed the 1 to a 0
> >
> > ;; Kaithi composition rules
> > (let ((consonant "[\x1108D-\x110AF]")
> >       (non-consonant "[^\x1108D-\x110AF\x110B9]")
> >       (vowel "[\x110B0-\x110B8\x110C2]")
> >       (anusvara-candrabindu "[\x11080\x11081]")
> >       (virama "\x110B9")
> >       (number-sign "\x110BD")
> >       (number-sign-above "\x110CD")
> >       (numerals "[\x966-\x96F]"))
> >   (set-char-table-range composition-function-table
> >                         '(#x110B0 . #x110B8)
> >                         (list (vector
> >                                (concat consonant vowel
> anusvara-candrabindu "?")
> >                                1 'font-shape-gstring)))
> >   (set-char-table-range composition-function-table
> >                         '(#x110B9 . #x110B9)
> >                         (list (vector
> >                                (concat consonant "\\(?:" virama
> consonant "\\)+"
> >                                        vowel "*\\|" consonant virama)
> >                                1 'font-shape-gstring)))
> >   (set-char-table-range composition-function-table
> >                         '(#x110BD . #x110BD)
> >                         (list (vector
> >                                (concat number-sign numerals)
> >                                0 'font-shape-gstring)))
> >   (set-char-table-range composition-function-table
> >                         '(#x110CD . #x110CD)
> >                         (list (vector
> >                                (concat number-sign-above numerals)
> >                                0 'font-shape-gstring))))
> >
> > Now when typed alone they do not appear but when written with numbers,
> they appear as NOTDEF glyphs
>
> Are you sure the font supports all of those characters?  What font is
> used for them in each case.
>
> And again, you don't show the text that doesn't render correctly (as
> characters), so it's impossible to investigate the cases you tried.
> Images help to see how stuff is displayed, but they cannot be used to
> reliably figure out what text was displayed.  I asked you to show the
> actual text many times.
>

[-- Attachment #2: Type: text/html, Size: 4204 bytes --]

^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-04-29 11:17         ` समीर सिंह Sameer Singh
@ 2022-04-29 12:18           ` Eli Zaretskii
  2022-04-29 15:26             ` समीर सिंह Sameer Singh
  0 siblings, 1 reply; 39+ messages in thread
From: Eli Zaretskii @ 2022-04-29 12:18 UTC (permalink / raw)
  To: समीर सिंह Sameer Singh
  Cc: emacs-devel

> From: समीर सिंह Sameer Singh <lumarzeli30@gmail.com>
> Date: Fri, 29 Apr 2022 16:47:22 +0530
> Cc: emacs-devel@gnu.org
> 
> They seem to be working after changing the devanagari font to Noto Sans Kaithi but only for one digit
> though

But that's what the rule you wrote says:

      (numerals "[\x966-\x96F]"))
  [...]
  (set-char-table-range composition-function-table
                        '(#x110BD . #x110BD)
                        (list (vector
                               (concat number-sign numerals)
                               0 'font-shape-gstring)))

The regexp in 'numerals' will match only a single character, so just
one character after number-sign will be composed.  if you want to
compose several ones, you need to say this instead:

      (numerals "[\x966-\x96F]+"))

> also they do not appear when alone.

Do they have any glyphs in the font?  Did you try to use
font-get-glyphs to see if the font can display those characters when
they are alone?

> Since the kaithi unicode does not have its own numerals and uses the devanagari ones, is there any way to
> only change the devanagari font when it is near the kaithi number signs?

That's what your composition rules already do: they are only triggered
when the character preceding the numerals is a number sign.  So I
don't think I understand the problem.

> 
> Here is the Text I entered, sorry for not including them earlier, I thought that mentioning their codepoints
> should be enough.
> KAITHI NUMBER SIGN: 𑂽
> KAITHI NUMBER SIGN + DEVANAGARI 0: 𑂽१
> 
> KAITHI NUMBER SIGN ABOVE: 𑃍
> KAITHI NUMBER SIGN ABOVE + DEVANAGARI 0: 𑃍०

When you put the cursor at the number sign character, don't you see a
thin 1-pixel space there?



^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-04-29 12:18           ` Eli Zaretskii
@ 2022-04-29 15:26             ` समीर सिंह Sameer Singh
  2022-04-29 16:13               ` Eli Zaretskii
  0 siblings, 1 reply; 39+ messages in thread
From: समीर सिंह Sameer Singh @ 2022-04-29 15:26 UTC (permalink / raw)
  To: Eli Zaretskii; +Cc: emacs-devel

[-- Attachment #1: Type: text/plain, Size: 3827 bytes --]

>
> The regexp in 'numerals' will match only a single character, so just
> one character after number-sign will be composed.  if you want to
> compose several ones, you need to say this instead:
>
>       (numerals "[\x966-\x96F]+"))


It is working fine now, Thank You!

 Do they have any glyphs in the font?  Did you try to use
> font-get-glyphs to see if the font can display those characters when
> they are alone?
>

I am an extreme novice, so please bear with me, but I cannot get the
function to work. I tried to enter the following as the argument FONT
OBJECT but none worked:
#<font-object "-GOOG-Noto Sans
Kaithi-regular-normal-normal-*-23-*-*-*-*-0-iso10646-1">
"-GOOG-Noto Sans Kaithi-regular-normal-normal-*-23-*-*-*-*-0-iso10646-1"
"#<font-object -GOOG-Noto Sans
Kaithi-regular-normal-normal-*-23-*-*-*-*-0-iso10646-1>"
"Noto Sans Kaithi"
"/usr/share/fonts/noto/NotoSansKaithi-Regular.ttf"

Though they do appear alone in Firefox and LibreOffice Writer.

That's what your composition rules already do: they are only triggered
> when the character preceding the numerals is a number sign.  So I
> don't think I understand the problem.
>

I want the font of devanagari and kaithi to be different, but since kaithi
uses devanagari numerals and a devanagari font other than Noto Sans Kaithi
does not render the number signs, I was asking that is it possible to only
change the devanagari font to Noto Sans Kaithi if it is around a number
sign.
Though this is a very specific problem of no significance, so it is fine if
you don't answer. I will not even use the number signs very much.

When you put the cursor at the number sign character, don't you see a
> thin 1-pixel space there?
>

Yes, when the character is not visible in Emacs and whenI put the cursor in
its place there is a thin 1-pixel space there.

On Fri, Apr 29, 2022 at 5:48 PM Eli Zaretskii <eliz@gnu.org> wrote:

> > From: समीर सिंह Sameer Singh <lumarzeli30@gmail.com>
> > Date: Fri, 29 Apr 2022 16:47:22 +0530
> > Cc: emacs-devel@gnu.org
> >
> > They seem to be working after changing the devanagari font to Noto Sans
> Kaithi but only for one digit
> > though
>
> But that's what the rule you wrote says:
>
>       (numerals "[\x966-\x96F]"))
>   [...]
>   (set-char-table-range composition-function-table
>                         '(#x110BD . #x110BD)
>                         (list (vector
>                                (concat number-sign numerals)
>                                0 'font-shape-gstring)))
>
> The regexp in 'numerals' will match only a single character, so just
> one character after number-sign will be composed.  if you want to
> compose several ones, you need to say this instead:
>
>       (numerals "[\x966-\x96F]+"))
>
> > also they do not appear when alone.
>
> Do they have any glyphs in the font?  Did you try to use
> font-get-glyphs to see if the font can display those characters when
> they are alone?
>
> > Since the kaithi unicode does not have its own numerals and uses the
> devanagari ones, is there any way to
> > only change the devanagari font when it is near the kaithi number signs?
>
> That's what your composition rules already do: they are only triggered
> when the character preceding the numerals is a number sign.  So I
> don't think I understand the problem.
>
> >
> > Here is the Text I entered, sorry for not including them earlier, I
> thought that mentioning their codepoints
> > should be enough.
> > KAITHI NUMBER SIGN: 𑂽
> > KAITHI NUMBER SIGN + DEVANAGARI 0: 𑂽१
> >
> > KAITHI NUMBER SIGN ABOVE: 𑃍
> > KAITHI NUMBER SIGN ABOVE + DEVANAGARI 0: 𑃍०
>
> When you put the cursor at the number sign character, don't you see a
> thin 1-pixel space there?
>

[-- Attachment #2: Type: text/html, Size: 5419 bytes --]

^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-04-29 15:26             ` समीर सिंह Sameer Singh
@ 2022-04-29 16:13               ` Eli Zaretskii
  2022-04-30  4:52                 ` समीर सिंह Sameer Singh
  0 siblings, 1 reply; 39+ messages in thread
From: Eli Zaretskii @ 2022-04-29 16:13 UTC (permalink / raw)
  To: समीर सिंह Sameer Singh
  Cc: emacs-devel

> From: समीर सिंह Sameer Singh <lumarzeli30@gmail.com>
> Date: Fri, 29 Apr 2022 20:56:56 +0530
> Cc: emacs-devel@gnu.org
> 
>   Do they have any glyphs in the font?  Did you try to use
>  font-get-glyphs to see if the font can display those characters when
>  they are alone?
> 
>  
> I am an extreme novice, so please bear with me, but I cannot get the function to work. I tried to enter the
> following as the argument FONT OBJECT but none worked:
> #<font-object "-GOOG-Noto Sans Kaithi-regular-normal-normal-*-23-*-*-*-*-0-iso10646-1">
> "-GOOG-Noto Sans Kaithi-regular-normal-normal-*-23-*-*-*-*-0-iso10646-1" 
> "#<font-object -GOOG-Noto Sans Kaithi-regular-normal-normal-*-23-*-*-*-*-0-iso10646-1>"
> "Noto Sans Kaithi"
> "/usr/share/fonts/noto/NotoSansKaithi-Regular.ttf"

I suggest to use font-at to get the font-object you need for
font-get-glyphs.

>  That's what your composition rules already do: they are only triggered
>  when the character preceding the numerals is a number sign.  So I
>  don't think I understand the problem.
>  
> I want the font of devanagari and kaithi to be different, but since kaithi uses devanagari numerals and a
> devanagari font other than Noto Sans Kaithi does not render the number signs, I was asking that is it
> possible to only change the devanagari font to Noto Sans Kaithi if it is around a number sign.

No, that's not possible, sorry.

>  When you put the cursor at the number sign character, don't you see a
>  thin 1-pixel space there?
> 
> Yes, when the character is not visible in Emacs and whenI put the cursor in its place there is a thin 1-pixel
> space there.

So the character is actually visible, it is just displayed as a thin
space.  Which means that either its glyph in the font is like that, or
that the font lacks a glyph for it.  What does "C-u C-x =" say when
the cursor is on that thin 1-pixel space?



^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-04-29 16:13               ` Eli Zaretskii
@ 2022-04-30  4:52                 ` समीर सिंह Sameer Singh
  2022-04-30  6:03                   ` Eli Zaretskii
  0 siblings, 1 reply; 39+ messages in thread
From: समीर सिंह Sameer Singh @ 2022-04-30  4:52 UTC (permalink / raw)
  To: Eli Zaretskii; +Cc: emacs-devel


[-- Attachment #1.1: Type: text/plain, Size: 3928 bytes --]

>
> I suggest to use font-at to get the font-object you need for
> font-get-glyphs.

I had already used that and got
#<font-object "-GOOG-Noto Sans
Kaithi-regular-normal-normal-*-23-*-*-*-*-0-iso10646-1">
but font-get-glyphs was not accepting it.


> So the character is actually visible, it is just displayed as a thin
> space.  Which means that either its glyph in the font is like that, or
> that the font lacks a glyph for it.  What does "C-u C-x =" say when
> the cursor is on that thin 1-pixel space?
>

position: 89 of 89 (99%), column: 0
            character: 𑂽 (displayed as 𑂽) (codepoint 69821, #o210275,
#x110bd)
              charset: unicode (Unicode (ISO10646))
code point in charset: 0x110BD
               script: kaithi
               syntax: w which means: word
             category: L:Strong L2R
             to input: type "C-x 8 RET 110bd" or "C-x 8 RET KAITHI NUMBER
SIGN"
          buffer code: #xF0 #x91 #x82 #xBD
            file code: #xF0 #x91 #x82 #xBD (encoded by coding system utf-8)
              display: by this font (glyph code):
    ftcrhb:-GOOG-Noto Sans
Kaithi-regular-normal-normal-*-23-*-*-*-*-0-iso10646-1 (#x48)

Character code properties: customize what to show
  name: KAITHI NUMBER SIGN
  general-category: Cf (Other, Format)
  decomposition: (69821) ('𑂽')

There is an overlay here:
 From 89 to 90
  face                 hl-line
  priority             -50
  window               #<window 3 on *scratch*>


There are text properties here:
  fontified            t
  rear-nonsticky       t

In the character section the character is not displayed in emacs but
displayed in firefox, but it is displayed in the decomposition section in
emacs. I have attached the images
Opening the font file in font forge also shows their glyphs

On Fri, Apr 29, 2022 at 9:43 PM Eli Zaretskii <eliz@gnu.org> wrote:

> > From: समीर सिंह Sameer Singh <lumarzeli30@gmail.com>
> > Date: Fri, 29 Apr 2022 20:56:56 +0530
> > Cc: emacs-devel@gnu.org
> >
> >   Do they have any glyphs in the font?  Did you try to use
> >  font-get-glyphs to see if the font can display those characters when
> >  they are alone?
> >
> >
> > I am an extreme novice, so please bear with me, but I cannot get the
> function to work. I tried to enter the
> > following as the argument FONT OBJECT but none worked:
> > #<font-object "-GOOG-Noto Sans
> Kaithi-regular-normal-normal-*-23-*-*-*-*-0-iso10646-1">
> > "-GOOG-Noto Sans Kaithi-regular-normal-normal-*-23-*-*-*-*-0-iso10646-1"
> > "#<font-object -GOOG-Noto Sans
> Kaithi-regular-normal-normal-*-23-*-*-*-*-0-iso10646-1>"
> > "Noto Sans Kaithi"
> > "/usr/share/fonts/noto/NotoSansKaithi-Regular.ttf"
>
> I suggest to use font-at to get the font-object you need for
> font-get-glyphs.
>
> >  That's what your composition rules already do: they are only triggered
> >  when the character preceding the numerals is a number sign.  So I
> >  don't think I understand the problem.
> >
> > I want the font of devanagari and kaithi to be different, but since
> kaithi uses devanagari numerals and a
> > devanagari font other than Noto Sans Kaithi does not render the number
> signs, I was asking that is it
> > possible to only change the devanagari font to Noto Sans Kaithi if it is
> around a number sign.
>
> No, that's not possible, sorry.
>
> >  When you put the cursor at the number sign character, don't you see a
> >  thin 1-pixel space there?
> >
> > Yes, when the character is not visible in Emacs and whenI put the cursor
> in its place there is a thin 1-pixel
> > space there.
>
> So the character is actually visible, it is just displayed as a thin
> space.  Which means that either its glyph in the font is like that, or
> that the font lacks a glyph for it.  What does "C-u C-x =" say when
> the cursor is on that thin 1-pixel space?
>

[-- Attachment #1.2: Type: text/html, Size: 5186 bytes --]

[-- Attachment #2: firefox-gmail.png --]
[-- Type: image/png, Size: 6935 bytes --]

[-- Attachment #3: emacs.png --]
[-- Type: image/png, Size: 9979 bytes --]

[-- Attachment #4: emacs-decomposition.png --]
[-- Type: image/png, Size: 6317 bytes --]

[-- Attachment #5: font-forge.png --]
[-- Type: image/png, Size: 2728 bytes --]

^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-04-30  4:52                 ` समीर सिंह Sameer Singh
@ 2022-04-30  6:03                   ` Eli Zaretskii
  2022-04-30  6:10                     ` समीर सिंह Sameer Singh
  2022-05-05 15:53                     ` Madhu
  0 siblings, 2 replies; 39+ messages in thread
From: Eli Zaretskii @ 2022-04-30  6:03 UTC (permalink / raw)
  To: समीर सिंह Sameer Singh
  Cc: emacs-devel

> From: समीर सिंह Sameer Singh <lumarzeli30@gmail.com>
> Date: Sat, 30 Apr 2022 10:22:00 +0530
> Cc: emacs-devel@gnu.org
> 
>  I suggest to use font-at to get the font-object you need for
>  font-get-glyphs.
> 
> I had already used that and got
> #<font-object "-GOOG-Noto Sans Kaithi-regular-normal-normal-*-23-*-*-*-*-0-iso10646-1">
> but font-get-glyphs was not accepting it.

"Not accepting" how?  It works for me, so I don't understand what goes
wrong in your case.

>  So the character is actually visible, it is just displayed as a thin
>  space.  Which means that either its glyph in the font is like that, or
>  that the font lacks a glyph for it.  What does "C-u C-x =" say when
>  the cursor is on that thin 1-pixel space?
> 
> position: 89 of 89 (99%), column: 0
>             character: 𑂽 (displayed as 𑂽) (codepoint 69821, #o210275, #x110bd)
>               charset: unicode (Unicode (ISO10646))
> code point in charset: 0x110BD
>                script: kaithi
>                syntax: w which means: word
>              category: L:Strong L2R
>              to input: type "C-x 8 RET 110bd" or "C-x 8 RET KAITHI NUMBER SIGN"
>           buffer code: #xF0 #x91 #x82 #xBD
>             file code: #xF0 #x91 #x82 #xBD (encoded by coding system utf-8)
>               display: by this font (glyph code):
>     ftcrhb:-GOOG-Noto Sans Kaithi-regular-normal-normal-*-23-*-*-*-*-0-iso10646-1 (#x48)

Since this shows the "by this font" part, it means the character _is_
displayed according to the font's glyph for it, and according to
Emacs's rules for displaying such "format-control" characters.  So now
I don't think I understand why you say this character is not shown by
Emacs, when the above clearly says it is displayed.

> In the character section the character is not displayed in emacs but displayed in firefox, but it is displayed in
> the decomposition section in emacs. I have attached the images
> Opening the font file in font forge also shows their glyphs

You are confusing the display of a lone codepoint with what Font Forge
does and what Emacs does in the "decomposition" display.  Those do not
show the lone character, they show it with special characters before
or after, to show how the character will look when combined with
others.

IOW, I think your expectations from how this character should be
displayed as a lone character are incorrect.

Can you tell why you want this to be displayed differently when it is
a lone character?  AFAIU, the only meaningful display of this
character is when it precedes numerals.

^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-04-30  6:03                   ` Eli Zaretskii
@ 2022-04-30  6:10                     ` समीर सिंह Sameer Singh
  2022-04-30  7:00                       ` Eli Zaretskii
  2022-05-05 15:53                     ` Madhu
  1 sibling, 1 reply; 39+ messages in thread
From: समीर सिंह Sameer Singh @ 2022-04-30  6:10 UTC (permalink / raw)
  To: Eli Zaretskii; +Cc: emacs-devel

[-- Attachment #1: Type: text/plain, Size: 3429 bytes --]

>
> Can you tell why you want this to be displayed differently when it is
> a lone character?  AFAIU, the only meaningful display of this
> character is when it precedes numerals.

Yes you are correct, I was just confused as to why it was not being
displayed, but this works too.

btw, is the way how I have written the composition table correct? I have
copied it from the Brahmi composition one, or is it more apporpriate to do
it in the way of the devanagari, bengali, malayalam ones?

Also I plan to include more writing scripts in emacs, so should I send the
patches one by one or in bulk?



On Sat, Apr 30, 2022 at 11:33 AM Eli Zaretskii <eliz@gnu.org> wrote:

> > From: समीर सिंह Sameer Singh <lumarzeli30@gmail.com>
> > Date: Sat, 30 Apr 2022 10:22:00 +0530
> > Cc: emacs-devel@gnu.org
> >
> >  I suggest to use font-at to get the font-object you need for
> >  font-get-glyphs.
> >
> > I had already used that and got
> > #<font-object "-GOOG-Noto Sans
> Kaithi-regular-normal-normal-*-23-*-*-*-*-0-iso10646-1">
> > but font-get-glyphs was not accepting it.
>
> "Not accepting" how?  It works for me, so I don't understand what goes
> wrong in your case.
>
> >  So the character is actually visible, it is just displayed as a thin
> >  space.  Which means that either its glyph in the font is like that, or
> >  that the font lacks a glyph for it.  What does "C-u C-x =" say when
> >  the cursor is on that thin 1-pixel space?
> >
> > position: 89 of 89 (99%), column: 0
> >             character: 𑂽 (displayed as 𑂽) (codepoint 69821, #o210275,
> #x110bd)
> >               charset: unicode (Unicode (ISO10646))
> > code point in charset: 0x110BD
> >                script: kaithi
> >                syntax: w which means: word
> >              category: L:Strong L2R
> >              to input: type "C-x 8 RET 110bd" or "C-x 8 RET KAITHI
> NUMBER SIGN"
> >           buffer code: #xF0 #x91 #x82 #xBD
> >             file code: #xF0 #x91 #x82 #xBD (encoded by coding system
> utf-8)
> >               display: by this font (glyph code):
> >     ftcrhb:-GOOG-Noto Sans
> Kaithi-regular-normal-normal-*-23-*-*-*-*-0-iso10646-1 (#x48)
>
> Since this shows the "by this font" part, it means the character _is_
> displayed according to the font's glyph for it, and according to
> Emacs's rules for displaying such "format-control" characters.  So now
> I don't think I understand why you say this character is not shown by
> Emacs, when the above clearly says it is displayed.
>
> > In the character section the character is not displayed in emacs but
> displayed in firefox, but it is displayed in
> > the decomposition section in emacs. I have attached the images
> > Opening the font file in font forge also shows their glyphs
>
> You are confusing the display of a lone codepoint with what Font Forge
> does and what Emacs does in the "decomposition" display.  Those do not
> show the lone character, they show it with special characters before
> or after, to show how the character will look when combined with
> others.
>
> IOW, I think your expectations from how this character should be
> displayed as a lone character are incorrect.
>
> Can you tell why you want this to be displayed differently when it is
> a lone character?  AFAIU, the only meaningful display of this
> character is when it precedes numerals.
>

[-- Attachment #2: Type: text/html, Size: 4425 bytes --]

^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-04-30  6:10                     ` समीर सिंह Sameer Singh
@ 2022-04-30  7:00                       ` Eli Zaretskii
  2022-04-30  7:03                         ` समीर सिंह Sameer Singh
  0 siblings, 1 reply; 39+ messages in thread
From: Eli Zaretskii @ 2022-04-30  7:00 UTC (permalink / raw)
  To: समीर सिंह Sameer Singh
  Cc: emacs-devel

> From: समीर सिंह Sameer Singh <lumarzeli30@gmail.com>
> Date: Sat, 30 Apr 2022 11:40:03 +0530
> Cc: emacs-devel@gnu.org
> 
> btw, is the way how I have written the composition table correct? I have copied it from the Brahmi
> composition one, or is it more apporpriate to do it in the way of the devanagari, bengali, malayalam ones?

It doesn't matter.  Whatever is more convenient for reading and
understanding the rules is preferable.

> Also I plan to include more writing scripts in emacs, so should I send the patches one by one or in bulk?

Separate patches, please.

Please also include with the patches the relevant additions to the
etc/HELLO file, and also don't forget to make sure
script-representative-chars has representative characters for the
script -- this is important for selecting fonts for those characters.

However, if you intend to submit more patches, I suggest to start your
legal paperwork of assigning the copyright to the FSF, so that we
could accept those future contributions from you.  If you agree, I
will send you the form now.

^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-04-30  7:00                       ` Eli Zaretskii
@ 2022-04-30  7:03                         ` समीर सिंह Sameer Singh
  2022-04-30  7:19                           ` Eli Zaretskii
  0 siblings, 1 reply; 39+ messages in thread
From: समीर सिंह Sameer Singh @ 2022-04-30  7:03 UTC (permalink / raw)
  To: Eli Zaretskii; +Cc: emacs-devel

[-- Attachment #1: Type: text/plain, Size: 1291 bytes --]

Yes, please send the form.

शनि, 30 अप्रैल 2022, 12:30 pm को Eli Zaretskii <eliz@gnu.org> ने लिखा:

> > From: समीर सिंह Sameer Singh <lumarzeli30@gmail.com>
> > Date: Sat, 30 Apr 2022 11:40:03 +0530
> > Cc: emacs-devel@gnu.org
> >
> > btw, is the way how I have written the composition table correct? I have
> copied it from the Brahmi
> > composition one, or is it more apporpriate to do it in the way of the
> devanagari, bengali, malayalam ones?
>
> It doesn't matter.  Whatever is more convenient for reading and
> understanding the rules is preferable.
>
> > Also I plan to include more writing scripts in emacs, so should I send
> the patches one by one or in bulk?
>
> Separate patches, please.
>
> Please also include with the patches the relevant additions to the
> etc/HELLO file, and also don't forget to make sure
> script-representative-chars has representative characters for the
> script -- this is important for selecting fonts for those characters.
>
> However, if you intend to submit more patches, I suggest to start your
> legal paperwork of assigning the copyright to the FSF, so that we
> could accept those future contributions from you.  If you agree, I
> will send you the form now.
>

[-- Attachment #2: Type: text/html, Size: 1768 bytes --]

^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-04-30  7:03                         ` समीर सिंह Sameer Singh
@ 2022-04-30  7:19                           ` Eli Zaretskii
  2022-04-30  8:01                             ` समीर सिंह Sameer Singh
  0 siblings, 1 reply; 39+ messages in thread
From: Eli Zaretskii @ 2022-04-30  7:19 UTC (permalink / raw)
  To: समीर सिंह Sameer Singh
  Cc: emacs-devel

> From: समीर सिंह Sameer Singh <lumarzeli30@gmail.com>
> Date: Sat, 30 Apr 2022 12:33:11 +0530
> Cc: emacs-devel@gnu.org
> 
> Yes, please send the form.

Thanks, form sent off-list.



^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-04-30  7:19                           ` Eli Zaretskii
@ 2022-04-30  8:01                             ` समीर सिंह Sameer Singh
  0 siblings, 0 replies; 39+ messages in thread
From: समीर सिंह Sameer Singh @ 2022-04-30  8:01 UTC (permalink / raw)
  To: Eli Zaretskii; +Cc: emacs-devel

[-- Attachment #1: Type: text/plain, Size: 556 bytes --]

> From: समीर सिंह Sameer Singh <lumarzeli30@gmail.com>
> Date: Sat, 30 Apr 2022 12:33:11 +0530
> Cc: emacs-devel@gnu.org
>
> Yes, please send the form.

Thanks, form sent off-list.

Thanks a lot! I have sent the form to assign@gnu.org


On Sat, Apr 30, 2022 at 12:49 PM Eli Zaretskii <eliz@gnu.org> wrote:

> > From: समीर सिंह Sameer Singh <lumarzeli30@gmail.com>
> > Date: Sat, 30 Apr 2022 12:33:11 +0530
> > Cc: emacs-devel@gnu.org
> >
> > Yes, please send the form.
>
> Thanks, form sent off-list.
>

[-- Attachment #2: Type: text/html, Size: 1244 bytes --]

^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-04-30  6:03                   ` Eli Zaretskii
  2022-04-30  6:10                     ` समीर सिंह Sameer Singh
@ 2022-05-05 15:53                     ` Madhu
  2022-05-05 16:09                       ` समीर सिंह Sameer Singh
                                         ` (2 more replies)
  1 sibling, 3 replies; 39+ messages in thread
From: Madhu @ 2022-05-05 15:53 UTC (permalink / raw)
  To: emacs-devel

* Eli Zaretskii <837d779kgx.fsf @gnu.org> :
Wrote on Sat, 30 Apr 2022 09:03:10 +0300:

> Since this shows the "by this font" part, it means the character _is_
> displayed according to the font's glyph for it, and according to
> Emacs's rules for displaying such "format-control" characters.  So now
> I don't think I understand why you say this character is not shown by
> Emacs, when the above clearly says it is displayed.
>
>> In the character section the character is not displayed in emacs but
>> displayed in firefox, but it is displayed in
>> the decomposition section in emacs. I have attached the images
>> Opening the font file in font forge also shows their glyphs
>
> You are confusing the display of a lone codepoint with what Font Forge
> does and what Emacs does in the "decomposition" display.  Those do not
> show the lone character, they show it with special characters before
> or after, to show how the character will look when combined with
> others.
>
> IOW, I think your expectations from how this character should be
> displayed as a lone character are incorrect.
>
> Can you tell why you want this to be displayed differently when it is
> a lone character?  AFAIU, the only meaningful display of this
> character is when it precedes numerals.

I believe there is a similar problen with composition of devanagari
stress accents with devanagari numerals: The devanagari numerals 1
(#x967) & 3 (#x969) can be combined with the svarita (#x951, incorrectly
named in unicode as DEVANAGARI STRESS SIGN UDATTA) and the anudatta
(#x952) to form the 4 different accents.

numeral one + anudatta + svarita
preceding anudatta + numeral three + anudatta + svarita
numeral one +  anudatta
numeral three + anudatta + svarita

e.g.  (#x969 #x951 #x952), when composed, the devanagari numeral 3
should have an anudatta ("line below") and svarita ("vertical line
above"), but it ends up rendering the numeral three, and two then two
thin empty vertical boxes. If the boxes render the stress accents I
can't see them, they just appear as boxes.

  name: DEVANAGARI STRESS SIGN ANUDATTA
          buffer code: #xE0 #xA5 #x92
              display: by this font (glyph code):
    ftcrhb:-IITB-Shobhika-regular-normal-normal-*-17-*-*-*-*-0-iso10646-1 (#x312)




^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-05-05 15:53                     ` Madhu
@ 2022-05-05 16:09                       ` समीर सिंह Sameer Singh
  2022-05-05 16:43                         ` Madhu
  2022-05-05 16:47                       ` Eli Zaretskii
  2022-05-07  7:01                       ` Eli Zaretskii
  2 siblings, 1 reply; 39+ messages in thread
From: समीर सिंह Sameer Singh @ 2022-05-05 16:09 UTC (permalink / raw)
  To: Madhu; +Cc: emacs-devel

[-- Attachment #1: Type: text/plain, Size: 2710 bytes --]

>
> I believe there is a similar problen with composition of devanagari
> stress accents with devanagari numerals


Does it even make sense for the stress accents to appear with the numerals?
Shouldn't they only appear with the consonants and vowels?
Please correct me if I am wrong.

On Thu, May 5, 2022 at 9:26 PM Madhu <enometh@meer.net> wrote:

> * Eli Zaretskii <837d779kgx.fsf @gnu.org> :
> Wrote on Sat, 30 Apr 2022 09:03:10 +0300:
>
> > Since this shows the "by this font" part, it means the character _is_
> > displayed according to the font's glyph for it, and according to
> > Emacs's rules for displaying such "format-control" characters.  So now
> > I don't think I understand why you say this character is not shown by
> > Emacs, when the above clearly says it is displayed.
> >
> >> In the character section the character is not displayed in emacs but
> >> displayed in firefox, but it is displayed in
> >> the decomposition section in emacs. I have attached the images
> >> Opening the font file in font forge also shows their glyphs
> >
> > You are confusing the display of a lone codepoint with what Font Forge
> > does and what Emacs does in the "decomposition" display.  Those do not
> > show the lone character, they show it with special characters before
> > or after, to show how the character will look when combined with
> > others.
> >
> > IOW, I think your expectations from how this character should be
> > displayed as a lone character are incorrect.
> >
> > Can you tell why you want this to be displayed differently when it is
> > a lone character?  AFAIU, the only meaningful display of this
> > character is when it precedes numerals.
>
> I believe there is a similar problen with composition of devanagari
> stress accents with devanagari numerals: The devanagari numerals 1
> (#x967) & 3 (#x969) can be combined with the svarita (#x951, incorrectly
> named in unicode as DEVANAGARI STRESS SIGN UDATTA) and the anudatta
> (#x952) to form the 4 different accents.
>
> numeral one + anudatta + svarita
> preceding anudatta + numeral three + anudatta + svarita
> numeral one +  anudatta
> numeral three + anudatta + svarita
>
> e.g.  (#x969 #x951 #x952), when composed, the devanagari numeral 3
> should have an anudatta ("line below") and svarita ("vertical line
> above"), but it ends up rendering the numeral three, and two then two
> thin empty vertical boxes. If the boxes render the stress accents I
> can't see them, they just appear as boxes.
>
>   name: DEVANAGARI STRESS SIGN ANUDATTA
>           buffer code: #xE0 #xA5 #x92
>               display: by this font (glyph code):
>     ftcrhb:-IITB-Shobhika-regular-normal-normal-*-17-*-*-*-*-0-iso10646-1
> (#x312)
>
>
>

[-- Attachment #2: Type: text/html, Size: 3563 bytes --]

^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-05-05 16:09                       ` समीर सिंह Sameer Singh
@ 2022-05-05 16:43                         ` Madhu
  2022-05-05 17:06                           ` समीर सिंह Sameer Singh
  0 siblings, 1 reply; 39+ messages in thread
From: Madhu @ 2022-05-05 16:43 UTC (permalink / raw)
  To: emacs-devel

*  Sameer Singh <CAOR1sLw5J_vrJHo9H8N=SS=tSidAvNytfUoXYT6B8HG1HpEHFg @mail.gmail.com> :
Wrote on Thu, 5 May 2022 21:39:29 +0530:
>> I believe there is a similar problen with composition of devanagari
>> stress accents with devanagari numerals
>
> Does it even make sense for the stress accents to appear with the numerals?
> Shouldn't they only appear with the consonants and vowels?
> Please correct me if I am wrong.

Can't speak for the unicode consortium, but I believe it is the practice
in marking certain specific accents in rig veda and atharva veda (which
I enumerated). check the table on the last page of Wikner's document at
http://www.evertype.com/standards/iso10646/pdf/vedic/Vedic_accents_doc.pdf
which shows the examples I wrote about.

(I think I've come across unicode text in the wild which used these, and
so assumed these are rendered "visually correctly" by other software)




^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-05-05 15:53                     ` Madhu
  2022-05-05 16:09                       ` समीर सिंह Sameer Singh
@ 2022-05-05 16:47                       ` Eli Zaretskii
  2022-05-06  3:59                         ` Madhu
  2022-05-07  7:01                       ` Eli Zaretskii
  2 siblings, 1 reply; 39+ messages in thread
From: Eli Zaretskii @ 2022-05-05 16:47 UTC (permalink / raw)
  To: Madhu; +Cc: emacs-devel

> From: Madhu <enometh@meer.net>
> Date: Thu, 05 May 2022 21:23:50 +0530
> 
> > Can you tell why you want this to be displayed differently when it is
> > a lone character?  AFAIU, the only meaningful display of this
> > character is when it precedes numerals.
> 
> I believe there is a similar problen with composition of devanagari
> stress accents with devanagari numerals:

AFAIU, it's definitely NOT the same problem.

> The devanagari numerals 1
> (#x967) & 3 (#x969) can be combined with the svarita (#x951, incorrectly
> named in unicode as DEVANAGARI STRESS SIGN UDATTA) and the anudatta
> (#x952) to form the 4 different accents.
> 
> numeral one + anudatta + svarita
> preceding anudatta + numeral three + anudatta + svarita
> numeral one +  anudatta
> numeral three + anudatta + svarita

I see no sign of such composition rules for Devanagai in
lisp/language/indian.el.  So it isn't a surprise that Emacs doesn't
display those sequences like you expect.

I will take a look at this when I have time, but please file a bug
report about this issue.  If you can include in the report the results
of running hb-view (from the HarfBuzz distribution) on the text
sequences you describe, that would help.

Thanks.



^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-05-05 16:43                         ` Madhu
@ 2022-05-05 17:06                           ` समीर सिंह Sameer Singh
  0 siblings, 0 replies; 39+ messages in thread
From: समीर सिंह Sameer Singh @ 2022-05-05 17:06 UTC (permalink / raw)
  To: Madhu; +Cc: emacs-devel

[-- Attachment #1: Type: text/plain, Size: 1259 bytes --]

>check the table on the last page of Wikner's >document at
>http://www.evertype.com/standards/iso10646/pdf/vedic/Vedic_accents_doc.pdf
>which shows the examples I wrote about.

Got it. Thanks 👍

गुरु, 5 मई 2022, 10:26 pm को Madhu <enometh@meer.net> ने लिखा:

> *  Sameer Singh <CAOR1sLw5J_vrJHo9H8N=SS=tSidAvNytfUoXYT6B8HG1HpEHFg @
> mail.gmail.com> :
> Wrote on Thu, 5 May 2022 21:39:29 +0530:
> >> I believe there is a similar problen with composition of devanagari
> >> stress accents with devanagari numerals
> >
> > Does it even make sense for the stress accents to appear with the
> numerals?
> > Shouldn't they only appear with the consonants and vowels?
> > Please correct me if I am wrong.
>
> Can't speak for the unicode consortium, but I believe it is the practice
> in marking certain specific accents in rig veda and atharva veda (which
> I enumerated). check the table on the last page of Wikner's document at
> http://www.evertype.com/standards/iso10646/pdf/vedic/Vedic_accents_doc.pdf
> which shows the examples I wrote about.
>
> (I think I've come across unicode text in the wild which used these, and
> so assumed these are rendered "visually correctly" by other software)
>
>
>

[-- Attachment #2: Type: text/html, Size: 1995 bytes --]

^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-05-05 16:47                       ` Eli Zaretskii
@ 2022-05-06  3:59                         ` Madhu
  2022-05-06  5:56                           ` Eli Zaretskii
  0 siblings, 1 reply; 39+ messages in thread
From: Madhu @ 2022-05-06  3:59 UTC (permalink / raw)
  To: emacs-devel


* Eli Zaretskii <835ymk2afp.fsf@gnu.org> :
Wrote on Thu, 05 May 2022 19:47:38 +0300:
>> From: Madhu <enometh@meer.net>
>> Date: Thu, 05 May 2022 21:23:50 +0530
>> > Can you tell why you want this to be displayed differently when it is
>> > a lone character?  AFAIU, the only meaningful display of this
>> > character is when it precedes numerals.
>> I believe there is a similar problen with composition of devanagari
>> stress accents with devanagari numerals:
>
> AFAIU, it's definitely NOT the same problem.

I'm sure you're right.

[bear with me: I downloaded NotoSansKaithi-Regular.ttf and tried to
replicate what Sameer had done: First I was not able to get hb-view to
produce the output shown in the first post: the #x110bc number sign
doesn't span the following numbers. Then After mutating the
composition-function-table as indicated, i had to set

(set-fontset-font "fontset-default" '(#x11080 . #x110cf)
		  (font-spec :name "Noto Sans Kaithi"
			     ;; :otf '(kthi nil (rphf))
			     :registry "iso10646-1" ))

is this incorrect?  if i add the :otf key then "find-font doesn't find
the font", but then i still see no span-composition, and only see the
thin white space with #x110bd (even after using this font for devanagri
numerals (#x966 . #x96f)) - so I haven't understood what the resolution
was]

>> The devanagari numerals 1
>> (#x967) & 3 (#x969) can be combined with the svarita (#x951, incorrectly
>> named in unicode as DEVANAGARI STRESS SIGN UDATTA) and the anudatta
>> (#x952) to form the 4 different accents.
>
> I see no sign of such composition rules for Devanagai in
> lisp/language/indian.el.  So it isn't a surprise that Emacs doesn't
> display those sequences like you expect.
>
> I will take a look at this when I have time, but please file a bug
> report about this issue.  If you can include in the report the results
> of running hb-view (from the HarfBuzz distribution) on the text
> sequences you describe, that would help.

I'll try to this by the following week.





^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-05-06  3:59                         ` Madhu
@ 2022-05-06  5:56                           ` Eli Zaretskii
  0 siblings, 0 replies; 39+ messages in thread
From: Eli Zaretskii @ 2022-05-06  5:56 UTC (permalink / raw)
  To: Madhu; +Cc: emacs-devel

> From: Madhu <enometh@meer.net>
> Date: Fri, 06 May 2022 09:29:45 +0530
> 
> [bear with me: I downloaded NotoSansKaithi-Regular.ttf and tried to
> replicate what Sameer had done: First I was not able to get hb-view to
> produce the output shown in the first post:

I don't think I see any hb-view output in the first post; what did I
miss?

> the #x110bc number sign
> doesn't span the following numbers. Then After mutating the
> composition-function-table as indicated, i had to set
> 
> (set-fontset-font "fontset-default" '(#x11080 . #x110cf)
> 		  (font-spec :name "Noto Sans Kaithi"
> 			     ;; :otf '(kthi nil (rphf))
> 			     :registry "iso10646-1" ))
> 
> is this incorrect?

You should probably use 'prepend as the ADD argument of
set-fontset-font, but otherwise this looks OK to me.  (But does Kaithi
really need the rphf feature?)

> if i add the :otf key then "find-font doesn't find
> the font"

Maybe this font doesn't have some of these OTF features?

> > I will take a look at this when I have time, but please file a bug
> > report about this issue.  If you can include in the report the results
> > of running hb-view (from the HarfBuzz distribution) on the text
> > sequences you describe, that would help.
> 
> I'll try to this by the following week.

TIA



^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-05-05 15:53                     ` Madhu
  2022-05-05 16:09                       ` समीर सिंह Sameer Singh
  2022-05-05 16:47                       ` Eli Zaretskii
@ 2022-05-07  7:01                       ` Eli Zaretskii
  2022-05-07 14:19                         ` Madhu
  2 siblings, 1 reply; 39+ messages in thread
From: Eli Zaretskii @ 2022-05-07  7:01 UTC (permalink / raw)
  To: Madhu; +Cc: emacs-devel

> From: Madhu <enometh@meer.net>
> Date: Thu, 05 May 2022 21:23:50 +0530
> 
> The devanagari numerals 1 (#x967) & 3 (#x969) can be combined with
> the svarita (#x951, incorrectly named in unicode as DEVANAGARI
> STRESS SIGN UDATTA) and the anudatta (#x952) to form the 4 different
> accents.
> 
> numeral one + anudatta + svarita
> preceding anudatta + numeral three + anudatta + svarita
> numeral one +  anudatta
> numeral three + anudatta + svarita
> 
> e.g.  (#x969 #x951 #x952), when composed, the devanagari numeral 3
> should have an anudatta ("line below") and svarita ("vertical line
> above")

Where are these rules documented?  I don't see them in the latest
Unicode Standard 14.0.



^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-05-07  7:01                       ` Eli Zaretskii
@ 2022-05-07 14:19                         ` Madhu
  2022-05-07 14:22                           ` Eli Zaretskii
  0 siblings, 1 reply; 39+ messages in thread
From: Madhu @ 2022-05-07 14:19 UTC (permalink / raw)
  To: eliz; +Cc: emacs-devel

*  Eli Zaretskii <eliz@gnu.org> <83h761zv0w.fsf@gnu.org>
Wrote on Sat, 07 May 2022 10:01:03 +0300

>> From: Madhu <enometh@meer.net>
>> Date: Thu, 05 May 2022 21:23:50 +0530
>>
>> The devanagari numerals 1 (#x967) & 3 (#x969) can be combined with
>> the svarita (#x951, incorrectly named in unicode as DEVANAGARI
>> STRESS SIGN UDATTA) and the anudatta (#x952) to form the 4 different
>> accents.
>>
>> numeral one + anudatta + svarita
>> preceding anudatta + numeral three + anudatta + svarita
>> numeral one +  anudatta
>> numeral three + anudatta + svarita
>>
>> e.g.  (#x969 #x951 #x952), when composed, the devanagari numeral 3
>> should have an anudatta ("line below") and svarita ("vertical line
>> above")
>
> Where are these rules documented?  I don't see them in the latest
> Unicode Standard 14.0.

I have no idea.  I assumed the rulesfollow from (some defined) rules
for composition, but I stopped dealing with unicode for sanskrit (
vedic accents) back in 2002 when I saw the direction it was going in.




^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-05-07 14:19                         ` Madhu
@ 2022-05-07 14:22                           ` Eli Zaretskii
  2022-05-07 14:29                             ` समीर सिंह Sameer Singh
  2022-05-07 14:57                             ` Madhu
  0 siblings, 2 replies; 39+ messages in thread
From: Eli Zaretskii @ 2022-05-07 14:22 UTC (permalink / raw)
  To: Madhu; +Cc: emacs-devel

> Date: Sat, 07 May 2022 19:49:20 +0530 (IST)
> Cc: emacs-devel@gnu.org
> From: Madhu <enometh@meer.net>
> 
> *  Eli Zaretskii <eliz@gnu.org> <83h761zv0w.fsf@gnu.org>
> Wrote on Sat, 07 May 2022 10:01:03 +0300
> 
> >> From: Madhu <enometh@meer.net>
> >> Date: Thu, 05 May 2022 21:23:50 +0530
> >>
> >> The devanagari numerals 1 (#x967) & 3 (#x969) can be combined with
> >> the svarita (#x951, incorrectly named in unicode as DEVANAGARI
> >> STRESS SIGN UDATTA) and the anudatta (#x952) to form the 4 different
> >> accents.
> >>
> >> numeral one + anudatta + svarita
> >> preceding anudatta + numeral three + anudatta + svarita
> >> numeral one +  anudatta
> >> numeral three + anudatta + svarita
> >>
> >> e.g.  (#x969 #x951 #x952), when composed, the devanagari numeral 3
> >> should have an anudatta ("line below") and svarita ("vertical line
> >> above")
> >
> > Where are these rules documented?  I don't see them in the latest
> > Unicode Standard 14.0.
> 
> I have no idea.  I assumed the rulesfollow from (some defined) rules
> for composition, but I stopped dealing with unicode for sanskrit (
> vedic accents) back in 2002 when I saw the direction it was going in.

It is strange that I seem to be unable to find such rules anywhere on
the Internet, not just in the Unicode Standard text.



^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-05-07 14:22                           ` Eli Zaretskii
@ 2022-05-07 14:29                             ` समीर सिंह Sameer Singh
  2022-05-07 14:56                               ` Eli Zaretskii
  2022-05-07 14:57                             ` Madhu
  1 sibling, 1 reply; 39+ messages in thread
From: समीर सिंह Sameer Singh @ 2022-05-07 14:29 UTC (permalink / raw)
  To: Eli Zaretskii; +Cc: Madhu, emacs-devel


[-- Attachment #1.1: Type: text/plain, Size: 1920 bytes --]

>
> It is strange that I seem to be unable to find such rules anywhere on
> the Internet, not just in the Unicode Standard text.

I was not able to find the rules, but they definitely exist.

http://वेद.com/en/rigveda/1/38/3 <http://xn--j2b3a4c.com/en/rigveda/1/38/3>
http://वेद.com/en/rigveda/1/34/9 <http://xn--j2b3a4c.com/en/rigveda/1/34/9>
http://वेद.com/en/rigveda/1/48/6 <http://xn--j2b3a4c.com/en/rigveda/1/48/6>

On Sat, May 7, 2022 at 7:53 PM Eli Zaretskii <eliz@gnu.org> wrote:

> > Date: Sat, 07 May 2022 19:49:20 +0530 (IST)
> > Cc: emacs-devel@gnu.org
> > From: Madhu <enometh@meer.net>
> >
> > *  Eli Zaretskii <eliz@gnu.org> <83h761zv0w.fsf@gnu.org>
> > Wrote on Sat, 07 May 2022 10:01:03 +0300
> >
> > >> From: Madhu <enometh@meer.net>
> > >> Date: Thu, 05 May 2022 21:23:50 +0530
> > >>
> > >> The devanagari numerals 1 (#x967) & 3 (#x969) can be combined with
> > >> the svarita (#x951, incorrectly named in unicode as DEVANAGARI
> > >> STRESS SIGN UDATTA) and the anudatta (#x952) to form the 4 different
> > >> accents.
> > >>
> > >> numeral one + anudatta + svarita
> > >> preceding anudatta + numeral three + anudatta + svarita
> > >> numeral one +  anudatta
> > >> numeral three + anudatta + svarita
> > >>
> > >> e.g.  (#x969 #x951 #x952), when composed, the devanagari numeral 3
> > >> should have an anudatta ("line below") and svarita ("vertical line
> > >> above")
> > >
> > > Where are these rules documented?  I don't see them in the latest
> > > Unicode Standard 14.0.
> >
> > I have no idea.  I assumed the rulesfollow from (some defined) rules
> > for composition, but I stopped dealing with unicode for sanskrit (
> > vedic accents) back in 2002 when I saw the direction it was going in.
>
> It is strange that I seem to be unable to find such rules anywhere on
> the Internet, not just in the Unicode Standard text.
>
>

[-- Attachment #1.2: Type: text/html, Size: 3000 bytes --]

[-- Attachment #2: rig-veda1.png --]
[-- Type: image/png, Size: 32198 bytes --]

[-- Attachment #3: rig-veda3.png --]
[-- Type: image/png, Size: 16821 bytes --]

^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-05-07 14:29                             ` समीर सिंह Sameer Singh
@ 2022-05-07 14:56                               ` Eli Zaretskii
  2022-05-07 15:47                                 ` Madhu
  0 siblings, 1 reply; 39+ messages in thread
From: Eli Zaretskii @ 2022-05-07 14:56 UTC (permalink / raw)
  To: समीर सिंह Sameer Singh
  Cc: enometh, emacs-devel

> From: समीर सिंह Sameer Singh <lumarzeli30@gmail.com>
> Date: Sat, 7 May 2022 19:59:10 +0530
> Cc: Madhu <enometh@meer.net>, emacs-devel@gnu.org
> 
>  It is strange that I seem to be unable to find such rules anywhere on
>  the Internet, not just in the Unicode Standard text.
> 
> I was not able to find the rules, but they definitely exist.
> 
> http://वेद.com/en/rigveda/1/38/3
> http://वेद.com/en/rigveda/1/34/9
> http://वेद.com/en/rigveda/1/48/6 

Very well, I've now added those rules.



^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-05-07 14:22                           ` Eli Zaretskii
  2022-05-07 14:29                             ` समीर सिंह Sameer Singh
@ 2022-05-07 14:57                             ` Madhu
  1 sibling, 0 replies; 39+ messages in thread
From: Madhu @ 2022-05-07 14:57 UTC (permalink / raw)
  To: eliz; +Cc: emacs-devel

*  Eli Zaretskii <eliz@gnu.org> <83pmkpxw15.fsf@gnu.org>
Wrote on Sat, 07 May 2022 17:22:14 +0300
>> Date: Sat, 07 May 2022 19:49:20 +0530 (IST)
>> Cc: emacs-devel@gnu.org
>> From: Madhu <enometh@meer.net>
>> *  Eli Zaretskii <eliz@gnu.org> <83h761zv0w.fsf@gnu.org>
>> Wrote on Sat, 07 May 2022 10:01:03 +0300
>> >> From: Madhu <enometh@meer.net>
>> >> Date: Thu, 05 May 2022 21:23:50 +0530
>> >> e.g.  (#x969 #x951 #x952), when composed, the devanagari numeral 3
>> >> should have an anudatta ("line below") and svarita ("vertical line
>> >> above")
>> >
>> > Where are these rules documented?  I don't see them in the latest
>> > Unicode Standard 14.0.
>>
>> I have no idea.  I assumed the rulesfollow from (some defined) rules
>> for composition, but I stopped dealing with unicode for sanskrit (
>> vedic accents) back in 2002 when I saw the direction it was going in.
>
> It is strange that I seem to be unable to find such rules anywhere on
> the Internet, not just in the Unicode Standard text.

The Wikner I posted upthread mentions the tradition concisely in on
p.3/6 under (a)Rgveda:

http://www.evertype.com/standards/iso10646/pdf/vedic/Vedic_accents_doc.pdf

   (a) R.gveda has udattaa unmarked the svarita (jatya or suddha) is
   marked with a vertical line above the syllable (e.g. XX), but the
   kampa is indicated by a numeral 1 following the vowel if it is
   short, or the numeral 3 if it is long, and in both cases there is a
   vertical line above the numeral and a horizontal line below it in
   the case of the long kampa vowel the syllable usually also has a
   horizontal line below it (e.g. XX ).

The canonical work (this version not searchable unfortunately)
https://archive.org/detauls/vedicgrammar00macduoft/vedicgrammar00macduoft
- pdf p.468, p.450 talks about the accents 1 and 3 numerals used in
this way.

I've seen a number of pdfs introducing vedic accents in the wild that
document this. Maybe I'll try to make a bibliography



^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-05-07 14:56                               ` Eli Zaretskii
@ 2022-05-07 15:47                                 ` Madhu
  2022-05-07 16:01                                   ` समीर सिंह Sameer Singh
  0 siblings, 1 reply; 39+ messages in thread
From: Madhu @ 2022-05-07 15:47 UTC (permalink / raw)
  To: eliz; +Cc: lumarzeli30, emacs-devel

*  Eli Zaretskii <eliz@gnu.org> <83mtftxugf.fsf@gnu.org>
Wrote on Sat, 07 May 2022 17:56:16 +0300
>> http://वेद.com/en/rigveda/1/48/6
>
> Very well, I've now added those rules.

Thanks, I'll skip on the promised bugreport then (unless I notice
something when I update :)

[BTW, on a subthread I mentioned I couldn't get hb-view to produce the
output shown on the first post: the post is
https://lists.gnu.org/archive/html/emacs-devel/2022-04/msg01297.html
and the displayed image is at
https://lists.gnu.org/archive/html/emacs-devel/2022-04/png4ClBZc1zV4.png
]

^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-05-07 15:47                                 ` Madhu
@ 2022-05-07 16:01                                   ` समीर सिंह Sameer Singh
  2022-05-07 16:44                                     ` Madhu
  0 siblings, 1 reply; 39+ messages in thread
From: समीर सिंह Sameer Singh @ 2022-05-07 16:01 UTC (permalink / raw)
  To: Madhu; +Cc: Eli Zaretskii, emacs-devel

[-- Attachment #1: Type: text/plain, Size: 1161 bytes --]

> [BTW, on a subthread I mentioned I couldn't get hb-view to produce the
> output shown on the first post

Can you reproduce these steps?
1. hb-view /usr/share/fonts/noto/NotoSansKaithi-Regular.ttf
--output-file=hb-view.png (Press Enter)
2, Then type: 𑂽०१२३४५६७८९ (Press Enter)
3. Then type: 𑃍०१२३४५६७८९ (Press Enter)
4. Press Ctrl-D

This should produce the output as needed.

On Sat, May 7, 2022 at 9:17 PM Madhu <enometh@meer.net> wrote:

> *  Eli Zaretskii <eliz@gnu.org> <83mtftxugf.fsf@gnu.org>
> Wrote on Sat, 07 May 2022 17:56:16 +0300
> >> http://वेद.com/en/rigveda/1/48/6
> <http://xn--j2b3a4c.com/en/rigveda/1/48/6>
> >
> > Very well, I've now added those rules.
>
> Thanks, I'll skip on the promised bugreport then (unless I notice
> something when I update :)
>
> [BTW, on a subthread I mentioned I couldn't get hb-view to produce the
> output shown on the first post: the post is
> https://lists.gnu.org/archive/html/emacs-devel/2022-04/msg01297.html
> and the displayed image is at
> https://lists.gnu.org/archive/html/emacs-devel/2022-04/png4ClBZc1zV4.png
> ]
>

[-- Attachment #2: Type: text/html, Size: 1964 bytes --]

^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-05-07 16:01                                   ` समीर सिंह Sameer Singh
@ 2022-05-07 16:44                                     ` Madhu
  2022-05-07 16:54                                       ` समीर सिंह Sameer Singh
  0 siblings, 1 reply; 39+ messages in thread
From: Madhu @ 2022-05-07 16:44 UTC (permalink / raw)
  To: lumarzeli30; +Cc: eliz, emacs-devel

[-- Attachment #1: Type: Text/Plain, Size: 706 bytes --]

*  Sameer Singh <lumarzeli30@gmail.com> <CAOR1sLyB1BEkgzh=HwE5qinm5h79Z0zNx6rxQUrhh3aimfwmyQ@mail.gmail.com>
Wrote on Sat, 7 May 2022 21:31:51 +0530

>> [BTW, on a subthread I mentioned I couldn't get hb-view to produce the
>> output shown on the first post
>
> Can you reproduce these steps?
> 1. hb-view /usr/share/fonts/noto/NotoSansKaithi-Regular.ttf
> --output-file=hb-view.png (Press Enter)

Unfortunately not. My system has harfbuzz 3.0.0 and the font file I
acquired is probably from the 2017 release:
87260 Mar 24  2020 NotoSansKaithi-Regular.ttf
My output is linear.
hb-view --font-file=$HOME/.fonts/NotoSansKaithi-Regular.ttf --text-file=12.txt --output-file 12.png

I'll try updating hb later.

[-- Attachment #2: 12.txt --]
[-- Type: Text/Plain, Size: 28 bytes --]

𑂽१०८
𑂼१०८

[-- Attachment #3: 12.png --]
[-- Type: Image/Png, Size: 17196 bytes --]

^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-05-07 16:44                                     ` Madhu
@ 2022-05-07 16:54                                       ` समीर सिंह Sameer Singh
  2022-05-08 17:25                                         ` Madhu
  0 siblings, 1 reply; 39+ messages in thread
From: समीर सिंह Sameer Singh @ 2022-05-07 16:54 UTC (permalink / raw)
  To: Madhu; +Cc: Eli Zaretskii, emacs-devel

[-- Attachment #1: Type: text/plain, Size: 926 bytes --]

I have harfbuzz 4.2.1 so yeah maybe the old version is the problem.

शनि, 7 मई 2022, 10:14 pm को Madhu <enometh@meer.net> ने लिखा:

> *  Sameer Singh <lumarzeli30@gmail.com> <CAOR1sLyB1BEkgzh=
> HwE5qinm5h79Z0zNx6rxQUrhh3aimfwmyQ@mail.gmail.com>
> Wrote on Sat, 7 May 2022 21:31:51 +0530
>
> >> [BTW, on a subthread I mentioned I couldn't get hb-view to produce the
> >> output shown on the first post
> >
> > Can you reproduce these steps?
> > 1. hb-view /usr/share/fonts/noto/NotoSansKaithi-Regular.ttf
> > --output-file=hb-view.png (Press Enter)
>
> Unfortunately not. My system has harfbuzz 3.0.0 and the font file I
> acquired is probably from the 2017 release:
> 87260 Mar 24  2020 NotoSansKaithi-Regular.ttf
> My output is linear.
> hb-view --font-file=$HOME/.fonts/NotoSansKaithi-Regular.ttf
> --text-file=12.txt --output-file 12.png
>
> I'll try updating hb later.
>

[-- Attachment #2: Type: text/html, Size: 1433 bytes --]

^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-05-07 16:54                                       ` समीर सिंह Sameer Singh
@ 2022-05-08 17:25                                         ` Madhu
  2022-05-08 17:35                                           ` Eli Zaretskii
  0 siblings, 1 reply; 39+ messages in thread
From: Madhu @ 2022-05-08 17:25 UTC (permalink / raw)
  To: lumarzeli30; +Cc: eliz, emacs-devel

*  Sameer Singh <CAOR1sLwk4opLov7O95XnF56iiF7Kh88EuDsxfB6QdRqisLzTHA@mail.gmail.com>
Wrote on Sat, 7 May 2022 22:24:04 +0530

> I have harfbuzz 4.2.1 so yeah maybe the old version is the problem.

I updated harfbuzz to 4.2.1 (and upgraded freetype after that) and the
output is byte-identical to the earlier version 3.0.0.  I'm not sure
what I'm missing.

a few more notes to self: The Khaiti number signs also compose only
linearly in emacs and only when I use the same font for numerals, say:

(set-fontset-font "fontset-default" '(#x966 . #x96f)  "NotoSans Kaithi"

Otherwise I can't see glyph for #x110bd whic precedes a number, I only
see a thin vertical empty box rendered.

Likewise for the newly installed composition rules, I can only see
them with the Siddhanta font. All the other fonts that display
devanagari seem to show the thin empty vertical box to render the
stress accents.


>> *  Sameer Singh <lumarzeli30@gmail.com> <CAOR1sLyB1BEkgzh=
>> HwE5qinm5h79Z0zNx6rxQUrhh3aimfwmyQ@mail.gmail.com>
>> Wrote on Sat, 7 May 2022 21:31:51 +0530
>>
>> >> [BTW, on a subthread I mentioned I couldn't get hb-view to produce the
>> >> output shown on the first post
>> >
>> > Can you reproduce these steps?
>> > 1. hb-view /usr/share/fonts/noto/NotoSansKaithi-Regular.ttf
>> > --output-file=hb-view.png (Press Enter)
>>
>> Unfortunately not. My system has harfbuzz 3.0.0 and the font file I
>> acquired is probably from the 2017 release:
>> 87260 Mar 24  2020 NotoSansKaithi-Regular.ttf
>> My output is linear.
>> hb-view --font-file=$HOME/.fonts/NotoSansKaithi-Regular.ttf
>> --text-file=12.txt --output-file 12.png
>>
>> I'll try updating hb later.



^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-05-08 17:25                                         ` Madhu
@ 2022-05-08 17:35                                           ` Eli Zaretskii
  2022-05-08 18:21                                             ` Madhu
  0 siblings, 1 reply; 39+ messages in thread
From: Eli Zaretskii @ 2022-05-08 17:35 UTC (permalink / raw)
  To: Madhu; +Cc: lumarzeli30, emacs-devel

> Date: Sun, 08 May 2022 22:55:53 +0530 (IST)
> Cc: eliz@gnu.org, emacs-devel@gnu.org
> From: Madhu <enometh@meer.net>
> 
> I updated harfbuzz to 4.2.1 (and upgraded freetype after that) and the
> output is byte-identical to the earlier version 3.0.0.  I'm not sure
> what I'm missing.

And I don't understand what problems you see.  I don't think you
described them in enough detail.

> a few more notes to self: The Khaiti number signs also compose only
> linearly in emacs

What do you mean by "compose linearly"?

> and only when I use the same font for numerals, say:

Emacs can only compose characters if all of the characters of a
composable sequence are supported by the same font.  It is
practically impossible to compose character glyphs from different
fonts, so Emacs doesn't.  If that's what you see, then it's the
intended behavior.

> Likewise for the newly installed composition rules, I can only see
> them with the Siddhanta font. All the other fonts that display
> devanagari seem to show the thin empty vertical box to render the
> stress accents.

If those "other fonts" cause the characters in a sequence not to be
supported by a single font, then again, this is the intended behavior.

^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-05-08 17:35                                           ` Eli Zaretskii
@ 2022-05-08 18:21                                             ` Madhu
  2022-05-08 18:59                                               ` Eli Zaretskii
  0 siblings, 1 reply; 39+ messages in thread
From: Madhu @ 2022-05-08 18:21 UTC (permalink / raw)
  To: eliz; +Cc: lumarzeli30, emacs-devel

*  Eli Zaretskii <eliz@gnu.org> <83h760udua.fsf@gnu.org>
Wrote on Sun, 08 May 2022 20:35:41 +0300
>> Date: Sun, 08 May 2022 22:55:53 +0530 (IST)
>> From: Madhu <enometh@meer.net>
>>
>> I updated harfbuzz to 4.2.1 (and upgraded freetype after that) and the
>> output is byte-identical to the earlier version 3.0.0.  I'm not sure
>> what I'm missing.
>
> And I don't understand what problems you see.  I don't think you
> described them in enough detail.

I've explained it in the messages upthread with the
references. briefly Sameer exhibited this picture
https://lists.gnu.org/archive/html/emacs-devel/2022-04/png4ClBZc1zV4.png
in his first message on this thread
https://lists.gnu.org/archive/html/emacs-devel/2022-04/msg01297.html

The composition shows that the first character spans the numerals.

I tried to reproduce it. I showed my work in this message:
https://lists.gnu.org/archive/html/emacs-devel/2022-05/msg00341.html
where I showed the shell command to invoke hb-view, and attached the
text file 12.txt and the output 12.png. (the png attachment is not
visible on lists.gnu.org - what happened?), message-id was
<20220507.221454.1080380889007217525.enometh@meer.net>
>
>> a few more notes to self: The Khaiti number signs also compose only
>> linearly in emacs
>
> What do you mean by "compose linearly"?

the 12.png file would have show the composition is "linear" in that
#x110bd and #x110bc glyphs do not span the following digits like in
the png url which Sameer posted.

>> and only when I use the same font for numerals, say:
>
> Emacs can only compose characters if all of the characters of a
> composable sequence are supported by the same font.  It is
> practically impossible to compose character glyphs from different
> fonts, so Emacs doesn't.  If that's what you see, then it's the
> intended behavior.

Yes, that is what I believe I'm seeing.

>> Likewise for the newly installed composition rules, I can only see
>> them with the Siddhanta font. All the other fonts that display
>> devanagari seem to show the thin empty vertical box to render the
>> stress accents.
>
> If those "other fonts" cause the characters in a sequence not to be
> supported by a single font, then again, this is the intended behavior.

Thanks



^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-05-08 18:21                                             ` Madhu
@ 2022-05-08 18:59                                               ` Eli Zaretskii
  2022-05-09  2:06                                                 ` Madhu
  0 siblings, 1 reply; 39+ messages in thread
From: Eli Zaretskii @ 2022-05-08 18:59 UTC (permalink / raw)
  To: Madhu; +Cc: lumarzeli30, emacs-devel

> Date: Sun, 08 May 2022 23:51:57 +0530 (IST)
> Cc: lumarzeli30@gmail.com, emacs-devel@gnu.org
> From: Madhu <enometh@meer.net>
> 
> I've explained it in the messages upthread with the
> references. briefly Sameer exhibited this picture
> https://lists.gnu.org/archive/html/emacs-devel/2022-04/png4ClBZc1zV4.png
> in his first message on this thread
> https://lists.gnu.org/archive/html/emacs-devel/2022-04/msg01297.html
> 
> The composition shows that the first character spans the numerals.
> 
> I tried to reproduce it. I showed my work in this message:
> https://lists.gnu.org/archive/html/emacs-devel/2022-05/msg00341.html
> where I showed the shell command to invoke hb-view, and attached the
> text file 12.txt and the output 12.png. (the png attachment is not
> visible on lists.gnu.org - what happened?), message-id was
> <20220507.221454.1080380889007217525.enometh@meer.net>
> >
> >> a few more notes to self: The Khaiti number signs also compose only
> >> linearly in emacs
> >
> > What do you mean by "compose linearly"?
> 
> the 12.png file would have show the composition is "linear" in that
> #x110bd and #x110bc glyphs do not span the following digits like in
> the png url which Sameer posted.
> 
> >> and only when I use the same font for numerals, say:
> >
> > Emacs can only compose characters if all of the characters of a
> > composable sequence are supported by the same font.  It is
> > practically impossible to compose character glyphs from different
> > fonts, so Emacs doesn't.  If that's what you see, then it's the
> > intended behavior.
> 
> Yes, that is what I believe I'm seeing.
> 
> >> Likewise for the newly installed composition rules, I can only see
> >> them with the Siddhanta font. All the other fonts that display
> >> devanagari seem to show the thin empty vertical box to render the
> >> stress accents.
> >
> > If those "other fonts" cause the characters in a sequence not to be
> > supported by a single font, then again, this is the intended behavior.
> 
> Thanks

So are we clear about what should and does happen, or are there still
issues to be looked into in this respect?



^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-05-08 18:59                                               ` Eli Zaretskii
@ 2022-05-09  2:06                                                 ` Madhu
  2022-05-09  2:39                                                   ` Eli Zaretskii
  0 siblings, 1 reply; 39+ messages in thread
From: Madhu @ 2022-05-09  2:06 UTC (permalink / raw)
  To: eliz; +Cc: lumarzeli30, emacs-devel

*  Eli Zaretskii <eliz@gnu.org> <83ee13voin.fsf@gnu.org>
Wrote on Sun, 08 May 2022 21:59:44 +0300

> So are we clear about what should and does happen, or are there
> still issues to be looked into in this respect?

What's not clear is the behaviour of 'arfbuzz, which on my system does
not match what is indicated in the discussion.  This possibly affects
how emacs renders the text but it wouldn't be an emacs problem.






^ permalink raw reply	[flat|nested] 39+ messages in thread

* Re: Not able to display \u110BD and \u110CD in Emacs
  2022-05-09  2:06                                                 ` Madhu
@ 2022-05-09  2:39                                                   ` Eli Zaretskii
  0 siblings, 0 replies; 39+ messages in thread
From: Eli Zaretskii @ 2022-05-09  2:39 UTC (permalink / raw)
  To: Madhu; +Cc: lumarzeli30, emacs-devel

> Date: Mon, 09 May 2022 07:36:57 +0530 (IST)
> Cc: lumarzeli30@gmail.com, emacs-devel@gnu.org
> From: Madhu <enometh@meer.net>
> 
> *  Eli Zaretskii <eliz@gnu.org> <83ee13voin.fsf@gnu.org>
> Wrote on Sun, 08 May 2022 21:59:44 +0300
> 
> > So are we clear about what should and does happen, or are there
> > still issues to be looked into in this respect?
> 
> What's not clear is the behaviour of 'arfbuzz, which on my system does
> not match what is indicated in the discussion.  This possibly affects
> how emacs renders the text but it wouldn't be an emacs problem.

Then please describe the behavior you expect and what you actually
see.  Given the missing information and images in your previous posts,
I think I need a clear description of the issue to be of any help
here.

Thanks.



^ permalink raw reply	[flat|nested] 39+ messages in thread

end of thread, other threads:[~2022-05-09  2:39 UTC | newest]

Thread overview: 39+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2022-04-28 23:17 Not able to display \u110BD and \u110CD in Emacs समीर सिंह Sameer Singh
2022-04-29  5:52 ` Eli Zaretskii
2022-04-29  6:53   ` Eli Zaretskii
2022-04-29  9:45     ` समीर सिंह Sameer Singh
2022-04-29 10:50       ` Eli Zaretskii
2022-04-29 11:17         ` समीर सिंह Sameer Singh
2022-04-29 12:18           ` Eli Zaretskii
2022-04-29 15:26             ` समीर सिंह Sameer Singh
2022-04-29 16:13               ` Eli Zaretskii
2022-04-30  4:52                 ` समीर सिंह Sameer Singh
2022-04-30  6:03                   ` Eli Zaretskii
2022-04-30  6:10                     ` समीर सिंह Sameer Singh
2022-04-30  7:00                       ` Eli Zaretskii
2022-04-30  7:03                         ` समीर सिंह Sameer Singh
2022-04-30  7:19                           ` Eli Zaretskii
2022-04-30  8:01                             ` समीर सिंह Sameer Singh
2022-05-05 15:53                     ` Madhu
2022-05-05 16:09                       ` समीर सिंह Sameer Singh
2022-05-05 16:43                         ` Madhu
2022-05-05 17:06                           ` समीर सिंह Sameer Singh
2022-05-05 16:47                       ` Eli Zaretskii
2022-05-06  3:59                         ` Madhu
2022-05-06  5:56                           ` Eli Zaretskii
2022-05-07  7:01                       ` Eli Zaretskii
2022-05-07 14:19                         ` Madhu
2022-05-07 14:22                           ` Eli Zaretskii
2022-05-07 14:29                             ` समीर सिंह Sameer Singh
2022-05-07 14:56                               ` Eli Zaretskii
2022-05-07 15:47                                 ` Madhu
2022-05-07 16:01                                   ` समीर सिंह Sameer Singh
2022-05-07 16:44                                     ` Madhu
2022-05-07 16:54                                       ` समीर सिंह Sameer Singh
2022-05-08 17:25                                         ` Madhu
2022-05-08 17:35                                           ` Eli Zaretskii
2022-05-08 18:21                                             ` Madhu
2022-05-08 18:59                                               ` Eli Zaretskii
2022-05-09  2:06                                                 ` Madhu
2022-05-09  2:39                                                   ` Eli Zaretskii
2022-05-07 14:57                             ` Madhu

Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).