all messages for Emacs-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
From: "समीर सिंह Sameer Singh" <lumarzeli30@gmail.com>
To: rms@gnu.org
Cc: Eli Zaretskii <eliz@gnu.org>, 58159@debbugs.gnu.org
Subject: bug#58159: [PATCH] Add support for the Wancho script
Date: Sun, 9 Oct 2022 06:38:53 +0530	[thread overview]
Message-ID: <CAOR1sLwiq+ixOQ-70HzsUGTw6z6SGV0x9zr9cFrCugAXJAdRGQ@mail.gmail.com> (raw)
In-Reply-To: <E1ohIPw-0008J7-2h@fencepost.gnu.org>

[-- Attachment #1: Type: text/plain, Size: 4500 bytes --]

>
> Normally a feature like this requires documentation in a manual as
> well as code to implement it.
>

Can you elaborate on what changes are needed in which manual?

The code is already implemented i.e. the foundations to support these
scripts are already there,
someone just needs to take their time and extend this support to a specific
script, and I am doing
exactly that. This is nothing more than some grunt work.

This is what a typical patch for adding a script in Emacs looks like:
1. A one line entry in etc/NEWS announcing the support of the script and
its language environment.
2. A one line greeting in the language/script which is added in etc/HELLO
(optional)
3. A one line entry in script-representative-chars in
lisp/international/fontset.el so that Emacs can select an appropriate font
for it.
4. Adding the script name in setup-default-fontset in
lisp/international/fontset.el
5. Defining a language environment for the script in the lisp/language/*.el
files which includes the following entries:
its charset (usually unicode), its coding-system (usually utf-8), its
coding-priority (usually utf-8), its input-method, its sample text (the
same text which is added in etc/HELLO),
a one line documentation usually in the following template: "foo language
and its script bar are supported in this language environment."
6. Adding composition rules for the script (optional, only needed for
complex scripts)
7. Adding an input-method for the script in lisp/leim/quail/*.el files

Adding one of these patches does not mean introducing any significant or
breaking changes.
All the heavy lifting functions or programs were implemented earlier.
We already parse all of the information from unicode so Emacs knows about
these characters,
composite.el and harfbuzz take care of composition and quail takes care of
input-methods.

The average size of my patches appears to be around 126 lines with the
input method and 36 lines without the input-method,
which is a given since input method is needed to be defined for nearly
every key on the keyboard.
I have added around 27 scripts since May of this year.

My point is that when Unicode incorporates scripts that aren't and
> never were used very much, and were developed for PR motives,
> incorporation into Unicode is not by itself a reason to add support
> into Emacs
>

These scripts were not developed for "PR motives", they were developed to
serve the needs of the community.
For example this what was said by the inventor of the Wancho script[1]

> "I found out that it was not possible to translate the language as it did
> not capture all of its sounds. So I started researching on phonetics of the
> language," Losu said.
>

It is necessary for Unicode to support them because this is not the age of
pen and paper where the only thing limiting you to write any script for
communication is... you.
For computers this is not possible therefore efforts should be made to
rectify this both at the Unicode level and the application level.

I don't speak either Urdu or Hindi, but I've read that Urdu has a lot
> of vocabulary derived from Persian or Arabic.  With such a difference,
> they are not "virtually the same."
>

Urdu and Hindi have virtually the same grammar, having some different
vocabulary does not make it
a different language. Hindi and Urdu are regarded as two different
registers of the same language.
see: https://en.wikipedia.org/wiki/Hindustani_language

[1]
https://www.indiatoday.in/education-today/news/story/this-arunachal-student-worked-for-over-12-years-to-create-a-new-alphabet-for-a-dying-ancient-tribal-language-1597122-2019-09-09

On Sun, Oct 9, 2022 at 4:05 AM Richard Stallman <rms@gnu.org> wrote:

> [[[ To any NSA and FBI agents reading my email: please consider    ]]]
> [[[ whether defending the US Constitution against all enemies,     ]]]
> [[[ foreign or domestic, requires you to follow Snowden's example. ]]]
>
>   > > But if Unicode is inclined to do things like this, how many more
>   > > barely-used scripts will it adopt?  How many more has it already
>   > > adopted?
>
>   > That is not our question to answer.
>
> They are questions about the future, so we cannot look for answers
> today.  But they do affect what our attitude towards Unicode should
> be.
>
>
> --
> Dr Richard Stallman (https://stallman.org)
> Chief GNUisance of the GNU Project (https://gnu.org)
> Founder, Free Software Foundation (https://fsf.org)
> Internet Hall-of-Famer (https://internethalloffame.org)
>
>
>

[-- Attachment #2: Type: text/html, Size: 6540 bytes --]

  reply	other threads:[~2022-10-09  1:08 UTC|newest]

Thread overview: 17+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-09-29 11:07 bug#58159: [PATCH] Add support for the Wancho script समीर सिंह Sameer Singh
2022-09-29 11:09 ` समीर सिंह Sameer Singh
2022-09-29 13:15   ` Eli Zaretskii
2022-09-29 13:21     ` समीर सिंह Sameer Singh
2022-09-29 14:27   ` Robert Pluim
2022-09-29 15:19     ` समीर सिंह Sameer Singh
2022-09-29 15:41       ` Robert Pluim
2022-10-01  1:58   ` Richard Stallman
2022-10-01  4:53     ` समीर सिंह Sameer Singh
2022-10-03  1:06       ` Richard Stallman
2022-10-03  2:38         ` Eli Zaretskii
2022-10-08 22:35           ` Richard Stallman
2022-10-09  1:08             ` समीर सिंह Sameer Singh [this message]
2022-10-14 21:24               ` Richard Stallman
2022-10-15  6:35                 ` Eli Zaretskii
2022-10-09  4:22             ` Eli Zaretskii
2022-10-01  6:03     ` Eli Zaretskii

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=CAOR1sLwiq+ixOQ-70HzsUGTw6z6SGV0x9zr9cFrCugAXJAdRGQ@mail.gmail.com \
    --to=lumarzeli30@gmail.com \
    --cc=58159@debbugs.gnu.org \
    --cc=eliz@gnu.org \
    --cc=rms@gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/emacs.git
	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.