unofficial mirror of guile-devel@gnu.org 
 help / color / mirror / Atom feed
* Re: GNU Guile branch, stable-2.0, updated. v2.0.5-5-gbf8d845
       [not found] <E1Rsnjh-0000kv-9A@vcs.savannah.gnu.org>
@ 2012-02-02 18:04 ` Mark H Weaver
  2012-02-02 18:52   ` Mike Gran
  2012-02-02 19:18   ` Mike Gran
  0 siblings, 2 replies; 5+ messages in thread
From: Mark H Weaver @ 2012-02-02 18:04 UTC (permalink / raw)
  To: Mike Gran; +Cc: guile-devel

Hi Mike,

Thanks for the Unicode 6.1 update!  Now, however:

  FAIL: srfi-14.test: Latin-1 (8-bit charset): char-set:symbol

Would you be willing to investigate?

  Many thanks!
     Mark


"Mike Gran" <spk121-/E1597aS9LQAvxtiuMwx3w@public.gmane.org> writes:
> commit bf8d845468fa71debf45e91a4e40f4b219dab4b0
> Author: Mike Gran <spk121-/E1597aS9LQAvxtiuMwx3w@public.gmane.org>
> Date:   Wed Feb 1 19:51:35 2012 -0800
>
>     Update srfi-14 character sets to Unicode 6.1
>     
>     * libguile/srfi-14.i.c (cs_lower_case_ranges, cs_lower_case)
>       (cs_upper_case_ranges, cs_upper_case, cs_letter_ranges, cs_letter)
>       (cs_digit_ranges, cs_digit, cs_letter_plus_digit_ranges, cs_letter_plus_digit)
>       (cs_graphic_ranges, cs_graphic, cs_printing_ranges, cs_printing)
>       (cs_punctuation_ranges, cs_punctuation, cs_symbol_ranges, cs_symbol)
>       (cs_designated_ranges, cs_designated): modified
>     * doc/ref/api-data.texi: modified



^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: GNU Guile branch, stable-2.0, updated. v2.0.5-5-gbf8d845
  2012-02-02 18:04 ` GNU Guile branch, stable-2.0, updated. v2.0.5-5-gbf8d845 Mark H Weaver
@ 2012-02-02 18:52   ` Mike Gran
  2012-02-02 19:18   ` Mike Gran
  1 sibling, 0 replies; 5+ messages in thread
From: Mike Gran @ 2012-02-02 18:52 UTC (permalink / raw)
  To: Mark H Weaver, Mike Gran; +Cc: guile-devel@gnu.org

Hi Mark-
 
> Thanks for the Unicode 6.1 update!  Now, however:
> 
>   FAIL: srfi-14.test: Latin-1 (8-bit charset): char-set:symbol
> 
> Would you be willing to investigate?

Strange.  I'll check it out today.

-Mike



^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: GNU Guile branch, stable-2.0, updated. v2.0.5-5-gbf8d845
  2012-02-02 18:04 ` GNU Guile branch, stable-2.0, updated. v2.0.5-5-gbf8d845 Mark H Weaver
  2012-02-02 18:52   ` Mike Gran
@ 2012-02-02 19:18   ` Mike Gran
  2012-02-02 19:48     ` Mark H Weaver
  2012-02-02 20:07     ` David Kastrup
  1 sibling, 2 replies; 5+ messages in thread
From: Mike Gran @ 2012-02-02 19:18 UTC (permalink / raw)
  To: Mark H Weaver, Mike Gran; +Cc: guile-devel@gnu.org

> Hi Mike,
> 
> Thanks for the Unicode 6.1 update!  Now, however:
> 
>   FAIL: srfi-14.test: Latin-1 (8-bit charset): char-set:symbol
> 
> Would you be willing to investigate?

Looks like Unicode 6.1 has recategorized some of the symbols, including
a few in Latin-1.
 
"§" U+00A7 SECTION SIGN from Symbol_Other to Punctuation_Other
"¶" U+00B6 PILCROW SIGN from Symbol_Other to Punctuation_Other
 
It seems that the correct response would be just to change
the Latin-1 test cases.
 
A wrinkle, though, is that in SRFI-14, they call out "§" and "¶"
as symbols.  But my interpretation of the text in SRFI-14 is that
they intended to follow Unicode's categorization.
 
http://srfi.schemers.org/srfi-14/srfi-14.html
 
WDYT?
 
Thanks,
 
Mike



^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: GNU Guile branch, stable-2.0, updated. v2.0.5-5-gbf8d845
  2012-02-02 19:18   ` Mike Gran
@ 2012-02-02 19:48     ` Mark H Weaver
  2012-02-02 20:07     ` David Kastrup
  1 sibling, 0 replies; 5+ messages in thread
From: Mark H Weaver @ 2012-02-02 19:48 UTC (permalink / raw)
  To: Mike Gran; +Cc: guile-devel

Mike Gran <spk121@yahoo.com> writes:

>> Thanks for the Unicode 6.1 update!  Now, however:
>> 
>>   FAIL: srfi-14.test: Latin-1 (8-bit charset): char-set:symbol
>> 
>> Would you be willing to investigate?
>
> Looks like Unicode 6.1 has recategorized some of the symbols, including
> a few in Latin-1.
>  
> "§" U+00A7 SECTION SIGN from Symbol_Other to Punctuation_Other
> "¶" U+00B6 PILCROW SIGN from Symbol_Other to Punctuation_Other
>  
> It seems that the correct response would be just to change
> the Latin-1 test cases.
>  
> A wrinkle, though, is that in SRFI-14, they call out "§" and "¶"
> as symbols.  But my interpretation of the text in SRFI-14 is that
> they intended to follow Unicode's categorization.

Agreed.

> http://srfi.schemers.org/srfi-14/srfi-14.html

SRFI-14 states:

  char-set:symbol

  In Unicode, a symbol is any character that has one of the symbol
  categories in the Unicode character database (Sm, Sc, Sk, or So).

and I think that this is intended to be the normative definition.
SRFI-14 then proceeds to list the symbols of ASCII and Latin-1, but I
interpret that as non-normative, to save the reader the trouble of
consulting Unicode.  IMHO, anyway.

   Thanks!
     Mark



^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: GNU Guile branch, stable-2.0, updated. v2.0.5-5-gbf8d845
  2012-02-02 19:18   ` Mike Gran
  2012-02-02 19:48     ` Mark H Weaver
@ 2012-02-02 20:07     ` David Kastrup
  1 sibling, 0 replies; 5+ messages in thread
From: David Kastrup @ 2012-02-02 20:07 UTC (permalink / raw)
  To: guile-devel

Mike Gran <spk121@yahoo.com> writes:

>> Thanks for the Unicode 6.1 update!  Now, however:
>> 
>>   FAIL: srfi-14.test: Latin-1 (8-bit charset): char-set:symbol
>> 
>> Would you be willing to investigate?
>
> Looks like Unicode 6.1 has recategorized some of the symbols, including
> a few in Latin-1.
>  
> "§" U+00A7 SECTION SIGN from Symbol_Other to Punctuation_Other

Utter nonsense.

> "¶" U+00B6 PILCROW SIGN from Symbol_Other to Punctuation_Other

Makes sense.

> It seems that the correct response would be just to change
> the Latin-1 test cases.

Differing from the standard would be asking for even more trouble.  I
have no idea what sense declaring § as punctuation makes (it is not used
in that manner), but at least for ¶, there is some motivation.

-- 
David Kastrup




^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2012-02-02 20:07 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <E1Rsnjh-0000kv-9A@vcs.savannah.gnu.org>
2012-02-02 18:04 ` GNU Guile branch, stable-2.0, updated. v2.0.5-5-gbf8d845 Mark H Weaver
2012-02-02 18:52   ` Mike Gran
2012-02-02 19:18   ` Mike Gran
2012-02-02 19:48     ` Mark H Weaver
2012-02-02 20:07     ` David Kastrup

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).