all messages for Emacs-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
* bug#1726: 23.0.60; end-of-sentence and non-breaking space
@ 2008-12-29 10:23 Richard M Stallman
  2011-09-11 18:43 ` Lars Magne Ingebrigtsen
  0 siblings, 1 reply; 22+ messages in thread
From: Richard M Stallman @ 2008-12-29 10:23 UTC (permalink / raw)
  To: emacs-pretest-bug

forward-sentence does not treat non-breaking space as a space
for purposes of sentence ends.

I will fix this as soon as I know a way to put non-breaking space
into a string constant.



In GNU Emacs 23.0.60.15 (mipsel-unknown-linux-gnu, GTK+ Version 2.12.11)
 of 2008-12-22 on lemote-yeeloong
configured using `configure  'CFLAGS=-O0 -g -Wno-pointer-sign' 'mipsel-unknown-linux-gnu' 'build_alias=mipsel-unknown-linux-gnu' 'host_alias=mipsel-unknown-linux-gnu' 'target_alias=mipsel-unknown-linux-gnu''

Important settings:
  value of $LC_ALL: nil
  value of $LC_COLLATE: nil
  value of $LC_CTYPE: nil
  value of $LC_MESSAGES: nil
  value of $LC_MONETARY: nil
  value of $LC_NUMERIC: nil
  value of $LC_TIME: nil
  value of $LANG: en_US.UTF-8
  value of $XMODIFIERS: nil
  locale-coding-system: utf-8-unix
  default-enable-multibyte-characters: t

Major mode: Mail

Minor modes in effect:
  gpm-mouse-mode: t
  tooltip-mode: t
  tool-bar-mode: t
  menu-bar-mode: t
  file-name-shadow-mode: t
  global-font-lock-mode: t
  font-lock-mode: t
  global-auto-composition-mode: t
  auto-composition-mode: t
  auto-encryption-mode: t
  auto-compression-mode: t
  line-number-mode: t
  transient-mark-mode: t
  abbrev-mode: t

Recent input:
r e SPC s h o u l d SPC b e SPC a SPC C-x C-f ESC p 
ESC DEL ESC DEL l r e a d . c RET C-s C-j r e a d _ 
s c DEL DEL e s c C-v ESC v C-v C-v C-v C-v C-v C-v 
C-u C-x m C-a C-k P e r h a p s SPC \ N SPC s h o u 
d l SPC g i v e ESC b C-b C-b C-t C-e SPC n o n - b 
r e a k i n g SPC s o p a c e . ESC b C-f C-d C-c C-c 
C-x b o u t g TAB RET g C-p C-p C-n C-n R o u t - 2 
5 RET C-p C-p e C-u C-u C-n C-u C-u C-n C-p C-p C-o 
C-o I SPC n e e d SPC t h i s SPC ESC DEL t o SPC s 
o l v e SPC t h i s SPC p r o b l e m SPC i n SPC o 
r d e r SPC t o SPC i DEL f i x SPC p a r a g r a p 
h s . e l RET t o SPC h a b d l e C-b C-b C-b DEL n 
C-p C-e ESC DEL ESC DEL f o r w a r d - s e n t e n 
c e C-n SPC n o n ESC / ESC SPC ESC / . C-x C-s C-a 
C-p C-p C-@ C-n C-n C-n C-w C-x C-s ESC x r e p o r 
t SPC e m a v s SPC DEL DEL c s SPC b u g RET

Recent messages:
Auto save file for draft message exists; consider M-x mail-recover
Sending...
Wrote /home/rms/outgoing/out-24
Sending...done
Move: 1 of 1
Move: 1 file
Wrote /home/rms/outgoing/out-24
Mark set
Auto-saving...done
Wrote /home/rms/outgoing/out-24






^ permalink raw reply	[flat|nested] 22+ messages in thread
* bug#1726: 23.0.60; end-of-sentence and non-breaking space
@ 2009-01-01  3:47 Chong Yidong
  0 siblings, 0 replies; 22+ messages in thread
From: Chong Yidong @ 2009-01-01  3:47 UTC (permalink / raw)
  To: emacs-devel; +Cc: 1726, rms, 1727

From bug#1726 and bug#1727:

> forward-sentence does not treat non-breaking space as a space for
> purposes of sentence ends.
...
> When I type C-x = at a non-breaking space, it tells me that it
> has code 160, hex a0.  But when I execute (insert "\xa0"),
> it inserts something that displays as `\240' and for which C-x =
> displays this:
>
>    Char:   (4194208, #o17777640, #x3fffa0, raw-byte) point=198 of 211
>    (93%) column=5
>
> Is that a bug?  It seems quite confusing to me.

ISTR that there was an extended discussion about classifying
non-breaking spaces on this list a while back.  But I can't find it now.
Does anyone remember the details?






^ permalink raw reply	[flat|nested] 22+ messages in thread
* Re: 23.0.60; end-of-sentence and non-breaking space
@ 2009-01-01  3:47 Chong Yidong
  2009-01-02  1:25 ` bug#1726: " Richard M Stallman
  2009-01-02  4:11 ` Stefan Monnier
  0 siblings, 2 replies; 22+ messages in thread
From: Chong Yidong @ 2009-01-01  3:47 UTC (permalink / raw)
  To: emacs-devel; +Cc: 1726, rms, 1727

From bug#1726 and bug#1727:

> forward-sentence does not treat non-breaking space as a space for
> purposes of sentence ends.
...
> When I type C-x = at a non-breaking space, it tells me that it
> has code 160, hex a0.  But when I execute (insert "\xa0"),
> it inserts something that displays as `\240' and for which C-x =
> displays this:
>
>    Char:   (4194208, #o17777640, #x3fffa0, raw-byte) point=198 of 211
>    (93%) column=5
>
> Is that a bug?  It seems quite confusing to me.

ISTR that there was an extended discussion about classifying
non-breaking spaces on this list a while back.  But I can't find it now.
Does anyone remember the details?




^ permalink raw reply	[flat|nested] 22+ messages in thread
* Re: 23.0.60; end-of-sentence and non-breaking space
  2009-01-01  3:47 Chong Yidong
@ 2009-01-02  4:11 Stefan Monnier
  2009-01-02 17:13 ` Richard M Stallman
  2009-01-02 17:13 ` Richard M Stallman
  1 sibling, 2 replies; 22+ messages in thread
From: Stefan Monnier @ 2009-01-02  4:11 UTC (permalink / raw)
  To: Chong Yidong; +Cc: 1726, 1727, rms, emacs-devel

>> has code 160, hex a0.  But when I execute (insert "\xa0"),
>> it inserts something that displays as `\240' and for which C-x =
>> displays this:
>> 
>> Char:   (4194208, #o17777640, #x3fffa0, raw-byte) point=198 of 211
>> (93%) column=5
>> 
>> Is that a bug?  It seems quite confusing to me.

This raw-byte char is what used to be called an eight-bit-control (or
eight-bit-graphic depending on the actual value) char.

I.e. "\xa0" is treated as a string that contains the \xa0 byte (i.e. an
eight-bit-* (aka raw-byte) char) rather than the \xa0 char (a latin-1
non-breaking space).


        Stefan




^ permalink raw reply	[flat|nested] 22+ messages in thread
* Re: 23.0.60; end-of-sentence and non-breaking space
  2009-01-04  2:16         ` Richard M Stallman
@ 2009-01-04  4:18 Eli Zaretskii
  2009-01-04 21:42 ` bug#1726: " Richard M Stallman
  3 siblings, 1 reply; 22+ messages in thread
From: Eli Zaretskii @ 2009-01-04  4:18 UTC (permalink / raw)
  To: rms; +Cc: cyd, 1726, monnier, emacs-devel

> From: Richard M Stallman <rms@gnu.org>
> CC: cyd@stupidchicken.com, emacs-devel@gnu.org,
> 	monnier@iro.umontreal.ca, 1726@emacsbugs.donarmstrong.com
> Date: Sat, 03 Jan 2009 21:16:21 -0500
> 
>     We need some way of inserting raw 8-bit bytes, because otherwise code
>     that encodes and decodes text in Lisp will not work.  For inserting
>     characters, we have the \u alternative; but I don't think there's
>     alternative for raw bytes except insert \xNN.
> 
> Naybe that is a valid reason for the current behavior, but that
> doesn't alter the need for the manual to document the behavior.

That was an attempt at explaining the reasons, not telling they don't
need to be documented.

> Meanwhile, the Chinese and Chinese-derived character codes
> do not follow Unicode.  So you can't enter them with \u.
> What is the way to enter them?

The problem at hand exists only for codes that are less than FF hex.




^ permalink raw reply	[flat|nested] 22+ messages in thread
* Re: 23.0.60; end-of-sentence and non-breaking space
  2009-01-04  2:16         ` Richard M Stallman
@ 2009-01-05  7:11 Kenichi Handa
  2009-01-06  0:01 ` bug#1726: " Richard M Stallman
  3 siblings, 1 reply; 22+ messages in thread
From: Kenichi Handa @ 2009-01-05  7:11 UTC (permalink / raw)
  To: rms; +Cc: eliz, emacs-devel, cyd, monnier, 1726

In article <E1LJIXN-0008Vg-Pe@fencepost.gnu.org>, Richard M Stallman <rms@gnu.org> writes:

>     We need some way of inserting raw 8-bit bytes, because otherwise code
>     that encodes and decodes text in Lisp will not work.  For inserting
>     characters, we have the \u alternative; but I don't think there's
>     alternative for raw bytes except insert \xNN.

> Naybe that is a valid reason for the current behavior, but that
> doesn't alter the need for the manual to document the behavior.

> Meanwhile, the Chinese and Chinese-derived character codes
> do not follow Unicode.  So you can't enter them with \u.
> What is the way to enter them?

Most of Chinese and Chinese-derived character codes are
unified into Unicode area.  Only a few codes can't be
unified with Unicode, and thus decoded into the character
space over #x110000.  But, in that sense, Chinese and
Chinese-derived character codes are not special.  There
exist several non-Chinese character sets (e.g. tibetan)
containing characters that doesn't exist in Unicode, and
they are decoded into the character space over #x110000 too.

But, all of them can be accessed by "\U00XXXXXX".

---
Kenichi Handa
handa@m17n.org




^ permalink raw reply	[flat|nested] 22+ messages in thread

end of thread, other threads:[~2012-03-27 23:27 UTC | newest]

Thread overview: 22+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-12-29 10:23 bug#1726: 23.0.60; end-of-sentence and non-breaking space Richard M Stallman
2011-09-11 18:43 ` Lars Magne Ingebrigtsen
2012-03-27 23:27   ` Glenn Morris
  -- strict thread matches above, loose matches on Subject: below --
2009-01-01  3:47 Chong Yidong
2009-01-01  3:47 Chong Yidong
2009-01-02  1:25 ` bug#1726: " Richard M Stallman
2009-01-02  4:11 ` Stefan Monnier
2009-01-02  4:11 Stefan Monnier
2009-01-02 17:13 ` Richard M Stallman
2009-01-03  3:06   ` bug#1726: " Stefan Monnier
2009-01-03  3:06   ` Stefan Monnier
2009-01-03  9:54     ` bug#1726: " Eli Zaretskii
2009-01-03 15:21     ` Richard M Stallman
2009-01-03 15:21     ` Richard M Stallman
2009-01-03 16:44       ` Eli Zaretskii
2009-01-04  2:16         ` Richard M Stallman
2009-01-04  4:18           ` bug#1726: " Eli Zaretskii
2009-01-04  4:29           ` Jason Rumney
2009-01-04 16:45             ` Eli Zaretskii
2009-01-04 16:45             ` Eli Zaretskii
2009-01-04  4:29           ` Jason Rumney
2009-01-05  7:11           ` Kenichi Handa
2009-01-04  2:16         ` Richard M Stallman
2009-01-05  6:37         ` Kenichi Handa
2009-01-03 16:44       ` Eli Zaretskii
2009-01-02 17:13 ` Richard M Stallman
2009-01-04  4:18 Eli Zaretskii
2009-01-04 21:42 ` bug#1726: " Richard M Stallman
2009-01-05  7:11 Kenichi Handa
2009-01-06  0:01 ` bug#1726: " Richard M Stallman
2009-01-06  4:08   ` Eli Zaretskii

Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/emacs.git
	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.