From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: martin rudalics Newsgroups: gmane.emacs.bugs Subject: bug#11948: 24.1.50; word-wrap should allow wrapping at all breaking space characters, not just space and tab Date: Tue, 17 Jul 2012 11:49:56 +0200 Message-ID: <50053544.8020608@gmx.at> References: <871ukbubbp.fsf@sc3d.org> <837gu3bsiq.fsf@gnu.org> <83wr23a3pe.fsf@gnu.org> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-15; format=flowed Content-Transfer-Encoding: 7bit X-Trace: dough.gmane.org 1342519527 29155 80.91.229.3 (17 Jul 2012 10:05:27 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Tue, 17 Jul 2012 10:05:27 +0000 (UTC) Cc: 11948@debbugs.gnu.org, Reuben Thomas To: Eli Zaretskii Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Tue Jul 17 12:05:27 2012 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1Sr4eb-00050j-VY for geb-bug-gnu-emacs@m.gmane.org; Tue, 17 Jul 2012 12:05:18 +0200 Original-Received: from localhost ([::1]:38694 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Sr4ea-0003yy-7M for geb-bug-gnu-emacs@m.gmane.org; Tue, 17 Jul 2012 06:05:16 -0400 Original-Received: from eggs.gnu.org ([208.118.235.92]:45825) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Sr4eT-0003wh-D2 for bug-gnu-emacs@gnu.org; Tue, 17 Jul 2012 06:05:15 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Sr4eM-0004u2-Tx for bug-gnu-emacs@gnu.org; Tue, 17 Jul 2012 06:05:09 -0400 Original-Received: from debbugs.gnu.org ([140.186.70.43]:35835) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Sr4eM-0004tq-RD for bug-gnu-emacs@gnu.org; Tue, 17 Jul 2012 06:05:02 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.72) (envelope-from ) id 1Sr4kA-0000q3-1D for bug-gnu-emacs@gnu.org; Tue, 17 Jul 2012 06:11:02 -0400 X-Loop: help-debbugs@gnu.org Resent-From: martin rudalics Original-Sender: debbugs-submit-bounces@debbugs.gnu.org Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Tue, 17 Jul 2012 10:11:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 11948 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: Original-Received: via spool by 11948-submit@debbugs.gnu.org id=B11948.13425198123169 (code B ref 11948); Tue, 17 Jul 2012 10:11:01 +0000 Original-Received: (at 11948) by debbugs.gnu.org; 17 Jul 2012 10:10:12 +0000 Original-Received: from localhost ([127.0.0.1]:45381 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1Sr4jL-0000p4-Uj for submit@debbugs.gnu.org; Tue, 17 Jul 2012 06:10:12 -0400 Original-Received: from mailout-de.gmx.net ([213.165.64.22]:55377) by debbugs.gnu.org with smtp (Exim 4.72) (envelope-from ) id 1Sr4jJ-0000ov-7h for 11948@debbugs.gnu.org; Tue, 17 Jul 2012 06:10:10 -0400 Original-Received: (qmail invoked by alias); 17 Jul 2012 09:48:58 -0000 Original-Received: from 62-47-34-58.adsl.highway.telekom.at (EHLO [62.47.34.58]) [62.47.34.58] by mail.gmx.net (mp034) with SMTP; 17 Jul 2012 11:48:58 +0200 X-Authenticated: #14592706 X-Provags-ID: V01U2FsdGVkX1+H68AB8f+H85uucFJexq8C95qTA6iX+YAnbrPpBL FiC/y2JWySVwEl In-Reply-To: <83wr23a3pe.fsf@gnu.org> X-Y-GMX-Trusted: 0 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-Received-From: 140.186.70.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:62039 Archived-At: >> The Unicode line breaking algorithm is probably the place to go: >> >> http://unicode.org/reports/tr14/ > > Thanks, but that's not what I meant. Implementing UAX#14 in full is > an effort similar (although smaller) to what was required for > implementing UAX#9, the Unicode Bidirectional Algorithm. The main > problem is that, like with UAX#9, the algorithms in UAX#14 are > specified assuming that text is processed for display in batches. By > contrast, the Emacs display engine, which implements word-wrap, > examines and processes characters one by one. So one needs to > "serialize", so to speak, the UAX#14 algorithms so that its decisions > could be made on a character-by-character basis. > > I think just supporting more characters from LineBreak.txt on which to > wrap should be a good start, and much easier than implementing UAX#14. > Even for that, we will need an efficient char-table for the related > properties, probably via the uniprop_table machinery, like what bidi.c > uses. Otherwise, referencing the ordinary char-tables of character > properties for each character we display could slow down redisplay too > much. While you're all there: If anybody has any idea how to support a practical and simplified version of collation, see http://www.unicode.org/reports/tr10/ in emacs, I'd be all ears. martin