* Re: GNU Guile branch, stable-2.0, updated. v2.0.5-5-gbf8d845
[not found] <E1Rsnjh-0000kv-9A@vcs.savannah.gnu.org>
@ 2012-02-02 18:04 ` Mark H Weaver
2012-02-02 18:52 ` Mike Gran
2012-02-02 19:18 ` Mike Gran
0 siblings, 2 replies; 5+ messages in thread
From: Mark H Weaver @ 2012-02-02 18:04 UTC (permalink / raw)
To: Mike Gran; +Cc: guile-devel
Hi Mike,
Thanks for the Unicode 6.1 update! Now, however:
FAIL: srfi-14.test: Latin-1 (8-bit charset): char-set:symbol
Would you be willing to investigate?
Many thanks!
Mark
"Mike Gran" <spk121-/E1597aS9LQAvxtiuMwx3w@public.gmane.org> writes:
> commit bf8d845468fa71debf45e91a4e40f4b219dab4b0
> Author: Mike Gran <spk121-/E1597aS9LQAvxtiuMwx3w@public.gmane.org>
> Date: Wed Feb 1 19:51:35 2012 -0800
>
> Update srfi-14 character sets to Unicode 6.1
>
> * libguile/srfi-14.i.c (cs_lower_case_ranges, cs_lower_case)
> (cs_upper_case_ranges, cs_upper_case, cs_letter_ranges, cs_letter)
> (cs_digit_ranges, cs_digit, cs_letter_plus_digit_ranges, cs_letter_plus_digit)
> (cs_graphic_ranges, cs_graphic, cs_printing_ranges, cs_printing)
> (cs_punctuation_ranges, cs_punctuation, cs_symbol_ranges, cs_symbol)
> (cs_designated_ranges, cs_designated): modified
> * doc/ref/api-data.texi: modified
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: GNU Guile branch, stable-2.0, updated. v2.0.5-5-gbf8d845
2012-02-02 18:04 ` GNU Guile branch, stable-2.0, updated. v2.0.5-5-gbf8d845 Mark H Weaver
@ 2012-02-02 18:52 ` Mike Gran
2012-02-02 19:18 ` Mike Gran
1 sibling, 0 replies; 5+ messages in thread
From: Mike Gran @ 2012-02-02 18:52 UTC (permalink / raw)
To: Mark H Weaver, Mike Gran; +Cc: guile-devel@gnu.org
Hi Mark-
> Thanks for the Unicode 6.1 update! Now, however:
>
> FAIL: srfi-14.test: Latin-1 (8-bit charset): char-set:symbol
>
> Would you be willing to investigate?
Strange. I'll check it out today.
-Mike
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: GNU Guile branch, stable-2.0, updated. v2.0.5-5-gbf8d845
2012-02-02 18:04 ` GNU Guile branch, stable-2.0, updated. v2.0.5-5-gbf8d845 Mark H Weaver
2012-02-02 18:52 ` Mike Gran
@ 2012-02-02 19:18 ` Mike Gran
2012-02-02 19:48 ` Mark H Weaver
2012-02-02 20:07 ` David Kastrup
1 sibling, 2 replies; 5+ messages in thread
From: Mike Gran @ 2012-02-02 19:18 UTC (permalink / raw)
To: Mark H Weaver, Mike Gran; +Cc: guile-devel@gnu.org
> Hi Mike,
>
> Thanks for the Unicode 6.1 update! Now, however:
>
> FAIL: srfi-14.test: Latin-1 (8-bit charset): char-set:symbol
>
> Would you be willing to investigate?
Looks like Unicode 6.1 has recategorized some of the symbols, including
a few in Latin-1.
"§" U+00A7 SECTION SIGN from Symbol_Other to Punctuation_Other
"¶" U+00B6 PILCROW SIGN from Symbol_Other to Punctuation_Other
It seems that the correct response would be just to change
the Latin-1 test cases.
A wrinkle, though, is that in SRFI-14, they call out "§" and "¶"
as symbols. But my interpretation of the text in SRFI-14 is that
they intended to follow Unicode's categorization.
http://srfi.schemers.org/srfi-14/srfi-14.html
WDYT?
Thanks,
Mike
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: GNU Guile branch, stable-2.0, updated. v2.0.5-5-gbf8d845
2012-02-02 19:18 ` Mike Gran
@ 2012-02-02 19:48 ` Mark H Weaver
2012-02-02 20:07 ` David Kastrup
1 sibling, 0 replies; 5+ messages in thread
From: Mark H Weaver @ 2012-02-02 19:48 UTC (permalink / raw)
To: Mike Gran; +Cc: guile-devel
Mike Gran <spk121@yahoo.com> writes:
>> Thanks for the Unicode 6.1 update! Now, however:
>>
>> FAIL: srfi-14.test: Latin-1 (8-bit charset): char-set:symbol
>>
>> Would you be willing to investigate?
>
> Looks like Unicode 6.1 has recategorized some of the symbols, including
> a few in Latin-1.
>
> "§" U+00A7 SECTION SIGN from Symbol_Other to Punctuation_Other
> "¶" U+00B6 PILCROW SIGN from Symbol_Other to Punctuation_Other
>
> It seems that the correct response would be just to change
> the Latin-1 test cases.
>
> A wrinkle, though, is that in SRFI-14, they call out "§" and "¶"
> as symbols. But my interpretation of the text in SRFI-14 is that
> they intended to follow Unicode's categorization.
Agreed.
> http://srfi.schemers.org/srfi-14/srfi-14.html
SRFI-14 states:
char-set:symbol
In Unicode, a symbol is any character that has one of the symbol
categories in the Unicode character database (Sm, Sc, Sk, or So).
and I think that this is intended to be the normative definition.
SRFI-14 then proceeds to list the symbols of ASCII and Latin-1, but I
interpret that as non-normative, to save the reader the trouble of
consulting Unicode. IMHO, anyway.
Thanks!
Mark
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: GNU Guile branch, stable-2.0, updated. v2.0.5-5-gbf8d845
2012-02-02 19:18 ` Mike Gran
2012-02-02 19:48 ` Mark H Weaver
@ 2012-02-02 20:07 ` David Kastrup
1 sibling, 0 replies; 5+ messages in thread
From: David Kastrup @ 2012-02-02 20:07 UTC (permalink / raw)
To: guile-devel
Mike Gran <spk121@yahoo.com> writes:
>> Thanks for the Unicode 6.1 update! Now, however:
>>
>> FAIL: srfi-14.test: Latin-1 (8-bit charset): char-set:symbol
>>
>> Would you be willing to investigate?
>
> Looks like Unicode 6.1 has recategorized some of the symbols, including
> a few in Latin-1.
>
> "§" U+00A7 SECTION SIGN from Symbol_Other to Punctuation_Other
Utter nonsense.
> "¶" U+00B6 PILCROW SIGN from Symbol_Other to Punctuation_Other
Makes sense.
> It seems that the correct response would be just to change
> the Latin-1 test cases.
Differing from the standard would be asking for even more trouble. I
have no idea what sense declaring § as punctuation makes (it is not used
in that manner), but at least for ¶, there is some motivation.
--
David Kastrup
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2012-02-02 20:07 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <E1Rsnjh-0000kv-9A@vcs.savannah.gnu.org>
2012-02-02 18:04 ` GNU Guile branch, stable-2.0, updated. v2.0.5-5-gbf8d845 Mark H Weaver
2012-02-02 18:52 ` Mike Gran
2012-02-02 19:18 ` Mike Gran
2012-02-02 19:48 ` Mark H Weaver
2012-02-02 20:07 ` David Kastrup
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).