all messages for Emacs-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
From: Kenichi Handa <handa@m17n.org>
Cc: emacs-devel@gnu.org
Subject: Re: Problem of auto-fill-mode for wide character
Date: Wed, 28 Dec 2005 16:46:27 +0900	[thread overview]
Message-ID: <E1ErW0x-0005jN-00@etlken> (raw)
In-Reply-To: <BAY109-F3569866FC9384654E0EDBBDA3D0@phx.gbl> (herberteuler@hotmail.com)

In article <BAY109-F3569866FC9384654E0EDBBDA3D0@phx.gbl>, "Herbert Euler" <herberteuler@hotmail.com> writes:

> Hello everyone,
> I'm very happy to see that Emacs supports Unicode internally,
> but there is a problem in auto-fill-mode with this modification.
> I'm going to explain why I think it's because of Unicode support.
[...]
> In the current Unicode 2 branch, Chinese characters are inserted
> with the command 'self-insert-command' as ASCII characters.
> This makes auto-fill in Chinese like other languages such as English,
> since "in Auto Fill mode, lines are broken automatically _at spaces_
> when they get longer than the desired width". This is good to
> languages in which words are separated with spaces, but at
> least it's not appropriate for Chinese, because there are _no_
> spaces between Chinese characters. So one can force Emacs
> auto-fill either by inserting spaces or by pressing M-q to invoke
> 'fill-paragraph', both of them is not "natural" in Chinese editing.

> Is my understanding correct? Could somebody help solve this
> problem? Thanks.

Thank you for reminding of this unsolved problem.  The
reason why Chinese characters invoke auto-fill is that they
are not yet registered in the char-table auto-fill-chars.
And why I have not yet done in Unicode 2 branch is that I
don't know any "authorized" information about that.

I've just registered these apparent characters:
  U+3041..U+30FF, U+3400..U+4DB5, U+4e00..U+9fbb, U+F900..U+FAFF,
  U+FF00..U+FF9F, U+20000..U+2FFFF
So, now auto-fill should work for most Han characters.

But, there are many more questionable characters, for instance:
  U+3000..U+303F, U+3200..U+33FF, ...

Do you have some idea about exactly which set of characters
to register in auto-fill-chars?

---
Kenichi Handa
handa@m17n.org

  reply	other threads:[~2005-12-28  7:46 UTC|newest]

Thread overview: 8+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2005-12-17 15:58 Problem of auto-fill-mode for wide character Herbert Euler
2005-12-28  7:46 ` Kenichi Handa [this message]
2005-12-30  2:43   ` Herbert Euler
2006-01-04  4:28     ` Kenichi Handa
2006-01-09  2:56   ` Herbert Euler
2006-01-10  1:20     ` Kenichi Handa
2006-01-10  1:58       ` Herbert Euler
  -- strict thread matches above, loose matches on Subject: below --
2005-12-18  4:56 Herbert Euler

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=E1ErW0x-0005jN-00@etlken \
    --to=handa@m17n.org \
    --cc=emacs-devel@gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/emacs.git
	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.