unofficial mirror of bug-guile@gnu.org 
 help / color / mirror / Atom feed
* bug#19478: [PATCH] Improve SXPath documentation
@ 2014-12-31 16:45 rekado
  2015-03-04  9:23 ` Ludovic Courtès
  0 siblings, 1 reply; 6+ messages in thread
From: rekado @ 2014-12-31 16:45 UTC (permalink / raw)
  To: 19478

[-- Attachment #1: Type: text/plain, Size: 1918 bytes --]

Hi,

the SXPath documentation in the Guile manual is rather sparse.  To
figure out how to use SXPath functions I had to look up the source code
at module/sxml/upstream/SXPath-old.scm.  When I did, I noticed that
there are lots of useful comments that really should be part of the
documentation.

Attached is a patch that takes the comments from the sources and adds
them to the Texinfo sources.  I chose to rewrite a few comments to make
them a little clearer and added some Texinfo markup, but most of the
documentation is unchanged from Oleg's comments in the source.

While these changes don't result in great SXML documentation on par
with the rest of the Guile documentation, I do think they make the
SXPath section a lot more useful.

This is the first time I wrote Texinfo documentation (it's not even
close to being the obstacle to contributing that some people on another
mailing list make it out to be), so I'm not sure I chose the most
appropriate markup in all cases.

There is also one instance where I think I did the right thing but the
results are wrong: I added a link to an example further down the page
but in my compiled version of the manual I end up far from the anchor
when I follow the link.

This is the section containing the reference:

    Similarly to XPath, SXPath defines full and abbreviated notations for
    location paths.  In both cases, the abbreviated notation can be
    mechanically expanded into the full form by simple rewriting rules.  In
    case of SXPath the corresponding rules are given in the documentation of
    the @code{sxpath} procedure.  @xref{sxpath-procedure-docs,,SXPath
    procedure documentation}.
    ...

And here's the anchor:

    @anchor{sxpath-procedure-docs}
    @deffn {Scheme Procedure} sxpath path
    Evaluate an abbreviated SXPath.
    ...

I would appreciate it if you could take a look at my changes and suggest
improvements.

Cheers,
rekado


[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #2: 0001-doc-Add-SXPath-documentation-from-sources.patch --]
[-- Type: text/x-patch, Size: 13980 bytes --]

From 1fc011f3c8d25c3a14db2d7f5e9ecb6627bc9102 Mon Sep 17 00:00:00 2001
From: rekado <rekado@elephly.net>
Date: Wed, 31 Dec 2014 16:48:32 +0100
Subject: [PATCH] doc: Add SXPath documentation from sources

* doc/ref/sxml.texi (SXPath): Add procedure documentation from sources.
---
 doc/ref/sxml.texi | 295 +++++++++++++++++++++++++++++++++++++++++++++++-------
 1 file changed, 261 insertions(+), 34 deletions(-)

diff --git a/doc/ref/sxml.texi b/doc/ref/sxml.texi
index 75867f3..2f9cd8d 100644
--- a/doc/ref/sxml.texi
+++ b/doc/ref/sxml.texi
@@ -250,8 +250,8 @@ internal and external parsed entities, user-controlled handling of
 whitespace, and validation.  This module therefore is intended to be a
 framework, a set of ``Lego blocks'' you can use to build a parser
 following any discipline and performing validation to any degree.  As an
-example of the parser construction, this file includes a semi-validating
-SXML parser.
+example of the parser construction, the source file includes a
+semi-validating SXML parser.
 
 SSAX has a ``sequential'' feel of SAX yet a ``functional style'' of DOM.
 Like a SAX parser, the framework scans the document only once and
@@ -725,95 +725,322 @@ location path is a relative path applied to the root node.
 Similarly to XPath, SXPath defines full and abbreviated notations for
 location paths.  In both cases, the abbreviated notation can be
 mechanically expanded into the full form by simple rewriting rules.  In
-case of SXPath the corresponding rules are given as comments to a sxpath
-function, below.  The regression test suite at the end of this file shows
-a representative sample of SXPaths in both notations, juxtaposed with
-the corresponding XPath expressions.  Most of the samples are borrowed
+case of SXPath the corresponding rules are given in the documentation of
+the @code{sxpath} procedure.  @xref{sxpath-procedure-docs,,SXPath
+procedure documentation}.
+
+The regression test suite at the end of the file SXPATH-old.scm shows a
+representative sample of SXPaths in both notations, juxtaposed with the
+corresponding XPath expressions.  Most of the samples are borrowed
 literally from the XPath specification, while the others are adjusted
-for our running example, tree1.
+for our running example, @code{tree1}.
+
+
+@subsubsection Basic converters and applicators
+
+A converter is a function mapping a nodeset (or a single node) to another
+nodeset.  Its type can be represented like this:
+
+@smallexample
+  type Converter = Node|Nodeset -> Nodeset
+@end smallexample
+
+A converter can also play the role of a predicate: in that case, if a
+converter, applied to a node or a nodeset, yields a non-empty nodeset,
+the converter-predicate is deemed satisfied.  Likewise, an empty nodeset
+is equivalent to @code{#f} in denoting failure.
 
-@subsubsection Usage
 @deffn {Scheme Procedure} nodeset? x
+Return @code{#t} if @var{x} is a nodeset.
 @end deffn
 
 @deffn {Scheme Procedure} node-typeof? crit
+This function implements a 'Node test' as defined in Sec. 2.3 of XPath
+document.  A node test is one of the components of a location step.  It
+is also a converter-predicate in SXPath.
+
+The function @code{node-typeof?} takes a type criterion and returns a
+function, which, when applied to a node, will tell if the node satisfies
+the test.
+
+The criterion @var{crit} is a symbol, one of the following:
+
+@table @code
+@item id
+tests if the node has the right name (id)
+
+@item @@
+tests if the node is an <attributes-coll>
+
+@item *
+tests if the node is an <Element>
+
+@item *text*
+tests if the node is a text node
+
+@item *PI*
+tests if the node is a PI (processing instruction) node
+
+@item *any*
+@code{#t} for any type of node
+@end table
 @end deffn
 
 @deffn {Scheme Procedure} node-eq? other
+A curried equivalence converter predicate that takes a node @var{other}
+and returns a function that takes another node.  The two nodes are
+compared using @code{eq?}.
 @end deffn
 
 @deffn {Scheme Procedure} node-equal? other
+A curried equivalence converter predicate that takes a node @var{other}
+and returns a function that takes another node.  The two nodes are
+compared using @code{equal?}.
 @end deffn
 
 @deffn {Scheme Procedure} node-pos n
+Select the @var{n}'th element of a nodeset and return as a singular
+nodeset.  If the @var{n}'th element does not exist, return an empty
+nodeset.  If @var{n} is a negative number the node is picked from the
+tail of the list.
+
+@example
+((node-pos 1) nodeset)  ; return the the head of the nodeset (if exists)
+((node-pos 2) nodeset)  ; return the node after that (if exists)
+((node-pos -1) nodeset) ; selects the last node of a non-empty nodeset
+((node-pos -2) nodeset) ; selects the last but one node, if exists.
+@end example
 @end deffn
 
 @deffn {Scheme Procedure} filter pred?
-@verbatim 
- -- Scheme Procedure: filter pred list
-     Return all the elements of 2nd arg LIST that satisfy predicate
-     PRED.  The list is not disordered - elements that appear in the
-     result list occur in the same order as they occur in the argument
-     list.  The returned list may share a common tail with the argument
-     list.  The dynamic order in which the various applications of pred
-     are made is not specified.
-
-          (filter even? '(0 7 8 8 43 -4)) => (0 8 8 -4)
-
- 
-@end verbatim
+A filter applicator, which introduces a filtering context.  The argument
+converter @var{pred?} is considered a predicate, with either @code{#f}
+or @code{nil} meaning failure.
 @end deffn
 
 @deffn {Scheme Procedure} take-until pred?
+@smallexample
+  take-until:: Converter -> Converter, or
+  take-until:: Pred -> Node|Nodeset -> Nodeset
+@end smallexample
+
+Given a converter-predicate @var{pred?} and a nodeset, apply the
+predicate to each element of the nodeset, until the predicate yields
+anything but @code{#f} or @code{nil}.  Return the elements of the input
+nodeset that have been processed until that moment (that is, which fail
+the predicate).
+
+@code{take-until} is a variation of the @code{filter} above:
+@code{take-until} passes elements of an ordered input set up to (but not
+including) the first element that satisfies the predicate.  The nodeset
+returned by @code{((take-until (not pred)) nset)} is a subset -- to be
+more precise, a prefix -- of the nodeset returned by @code{((filter
+pred) nset)}.
 @end deffn
 
 @deffn {Scheme Procedure} take-after pred?
+@smallexample
+  take-after:: Converter -> Converter, or
+  take-after:: Pred -> Node|Nodeset -> Nodeset
+@end smallexample
+
+Given a converter-predicate @var{pred?} and a nodeset, apply the
+predicate to each element of the nodeset, until the predicate yields
+anything but @code{#f} or @code{nil}.  Return the elements of the input
+nodeset that have not been processed: that is, return the elements of
+the input nodeset that follow the first element that satisfied the
+predicate.
+
+@code{take-after} along with @code{take-until} partition an input
+nodeset into three parts: the first element that satisfies a predicate,
+all preceding elements and all following elements.
 @end deffn
 
 @deffn {Scheme Procedure} map-union proc lst
+Apply @var{proc} to each element of @var{lst} and return the list of results.
+If @var{proc} returns a nodeset, splice it into the result
+
+From another point of view, @code{map-union} is a function
+@code{Converter->Converter}, which places an argument-converter in a joining
+context.
 @end deffn
 
 @deffn {Scheme Procedure} node-reverse node-or-nodeset
+@smallexample
+  node-reverse :: Converter, or
+  node-reverse:: Node|Nodeset -> Nodeset
+@end smallexample
+
+Reverses the order of nodes in the nodeset.  This basic converter is
+needed to implement a reverse document order (see the XPath
+Recommendation).
 @end deffn
 
 @deffn {Scheme Procedure} node-trace title
+@smallexample
+  node-trace:: String -> Converter
+@end smallexample
+
+@code{(node-trace title)} is an identity converter.  In addition it
+prints out the node or nodeset it is applied to, prefixed with the
+@var{title}.  This converter is very useful for debugging.
 @end deffn
 
+@subsubsection Converter combinators
+
+Combinators are higher-order functions that transmogrify a converter or
+glue a sequence of converters into a single, non-trivial converter.  The
+goal is to arrive at converters that correspond to XPath location paths.
+
+From a different point of view, a combinator is a fixed, named
+@emph{pattern} of applying converters.  Given below is a complete set of
+such patterns that together implement XPath location path specification.
+As it turns out, all these combinators can be built from a small number
+of basic blocks: regular functional composition, @code{map-union} and
+@code{filter} applicators, and the nodeset union.
+
 @deffn {Scheme Procedure} select-kids test-pred?
+@code{select-kids} takes a converter (or a predicate) as an argument and
+returns another converter.  The resulting converter applied to a nodeset
+returns an ordered subset of its children that satisfy the predicate
+@var{test-pred?}.
 @end deffn
 
 @deffn {Scheme Procedure} node-self pred?
-@verbatim 
- -- Scheme Procedure: filter pred list
-     Return all the elements of 2nd arg LIST that satisfy predicate
-     PRED.  The list is not disordered - elements that appear in the
-     result list occur in the same order as they occur in the argument
-     list.  The returned list may share a common tail with the argument
-     list.  The dynamic order in which the various applications of pred
-     are made is not specified.
-
-          (filter even? '(0 7 8 8 43 -4)) => (0 8 8 -4)
-
- 
-@end verbatim
+Similar to @code{select-kids} except that the predicate @var{pred?} is
+applied to the node itself rather than to its children.  The resulting
+nodeset will contain either one component, or will be empty if the node
+failed the predicate.
 @end deffn
 
 @deffn {Scheme Procedure} node-join . selectors
+@smallexample
+  node-join:: [LocPath] -> Node|Nodeset -> Nodeset, or
+  node-join:: [Converter] -> Converter
+@end smallexample
+
+Join the sequence of location steps or paths as described above.
 @end deffn
 
 @deffn {Scheme Procedure} node-reduce . converters
+@smallexample
+  node-reduce:: [LocPath] -> Node|Nodeset -> Nodeset, or
+  node-reduce:: [Converter] -> Converter
+@end smallexample
+
+A regular functional composition of converters.  From a different point
+of view, @code{((apply node-reduce converters) nodeset)} is equivalent
+to @code{(foldl apply nodeset converters)}, i.e., folding, or reducing,
+a list of converters with the nodeset as a seed.
 @end deffn
 
 @deffn {Scheme Procedure} node-or . converters
+@smallexample
+  node-or:: [Converter] -> Converter
+@end smallexample
+
+This combinator applies all converters to a given node and produces the
+union of their results.  This combinator corresponds to a union
+(@code{|} operation) for XPath location paths.
 @end deffn
 
 @deffn {Scheme Procedure} node-closure test-pred?
+@smallexample
+  node-closure:: Converter -> Converter
+@end smallexample
+
+Select all @emph{descendants} of a node that satisfy a
+converter-predicate @var{test-pred?}.  This combinator is similar to
+@code{select-kids} but applies to grand... children as well.  This
+combinator implements the @code{descendant::} XPath axis.  Conceptually,
+this combinator can be expressed as
+
+@smallexample
+(define (node-closure f)
+     (node-or
+       (select-kids f)
+       (node-reduce (select-kids (node-typeof? '*)) (node-closure f))))
+@end smallexample
+
+This definition, as written, looks somewhat like a fixpoint, and it will
+run forever.  It is obvious however that sooner or later
+@code{(select-kids (node-typeof? '*))} will return an empty nodeset.  At
+this point further iterations will no longer affect the result and can
+be stopped.
 @end deffn
 
 @deffn {Scheme Procedure} node-parent rootnode
+@smallexample
+  node-parent:: RootNode -> Converter
+@end smallexample
+
+@code{(node-parent rootnode)} yields a converter that returns a parent
+of a node it is applied to.  If applied to a nodeset, it returns the
+list of parents of nodes in the nodeset.  The @var{rootnode} does not
+have to be the root node of the whole SXML tree -- it may be a root node
+of a branch of interest.
+
+Given the notation of Philip Wadler's paper on semantics of XSLT,
+
+@verbatim
+  parent(x) = { y | y=subnode*(root), x=subnode(y) }
+@end verbatim
+
+Therefore, @code{node-parent} is not the fundamental converter: it can
+be expressed through the existing ones.  Yet @code{node-parent} is a
+rather convenient converter.  It corresponds to a @code{parent::} axis
+of SXPath.  Note that the @code{parent::} axis can be used with an
+attribute node as well.
 @end deffn
 
+@anchor{sxpath-procedure-docs}
 @deffn {Scheme Procedure} sxpath path
+Evaluate an abbreviated SXPath.
+
+@smallexample
+  sxpath:: AbbrPath -> Converter, or
+  sxpath:: AbbrPath -> Node|Nodeset -> Nodeset
+@end smallexample
+
+@var{path} is a list.  It is translated to the full SXPath according to
+the following rewriting rules:
+
+@example
+(sxpath '())
+@result{} (node-join)
+
+(sxpath '(path-component ...))
+@result{} (node-join (sxpath1 path-component) (sxpath '(...)))
+
+(sxpath1 '//)
+@result{} (node-or
+   (node-self (node-typeof? '*any*))
+   (node-closure (node-typeof? '*any*)))
+
+(sxpath1 '(equal? x))
+@result{} (select-kids (node-equal? x))
+
+(sxpath1 '(eq? x))
+@result{} (select-kids (node-eq? x))
+
+(sxpath1 ?symbol)
+@result{} (select-kids (node-typeof? ?symbol)
+
+(sxpath1 procedure)
+@result{} procedure
+
+(sxpath1 '(?symbol ...))
+@result{} (sxpath1 '((?symbol) ...))
+
+(sxpath1 '(path reducer ...))
+@result{} (node-reduce (sxpath path) (sxpathr reducer) ...)
+
+(sxpathr number)
+@result{} (node-pos number)
+
+(sxpathr path-filter)
+@result{} (filter (sxpath path-filter))
+@end example
 @end deffn
 
 @node sxml ssax input-parse
-- 
2.1.0


^ permalink raw reply related	[flat|nested] 6+ messages in thread

* bug#19478: [PATCH] Improve SXPath documentation
  2014-12-31 16:45 bug#19478: [PATCH] Improve SXPath documentation rekado
@ 2015-03-04  9:23 ` Ludovic Courtès
  2015-08-30  9:27   ` Ricardo Wurmus
  0 siblings, 1 reply; 6+ messages in thread
From: Ludovic Courtès @ 2015-03-04  9:23 UTC (permalink / raw)
  To: rekado; +Cc: 19478

rekado <rekado@elephly.net> skribis:

> Attached is a patch that takes the comments from the sources and adds
> them to the Texinfo sources.  I chose to rewrite a few comments to make
> them a little clearer and added some Texinfo markup, but most of the
> documentation is unchanged from Oleg's comments in the source.

Nice!

> This is the first time I wrote Texinfo documentation (it's not even
> close to being the obstacle to contributing that some people on another
> mailing list make it out to be)

Good to hear.  ;-)

> There is also one instance where I think I did the right thing but the
> results are wrong: I added a link to an example further down the page
> but in my compiled version of the manual I end up far from the anchor
> when I follow the link.
>
> This is the section containing the reference:
>
>     Similarly to XPath, SXPath defines full and abbreviated notations for
>     location paths.  In both cases, the abbreviated notation can be
>     mechanically expanded into the full form by simple rewriting rules.  In
>     case of SXPath the corresponding rules are given in the documentation of
>     the @code{sxpath} procedure.  @xref{sxpath-procedure-docs,,SXPath
>     procedure documentation}.
>     ...
>
> And here's the anchor:
>
>     @anchor{sxpath-procedure-docs}
>     @deffn {Scheme Procedure} sxpath path
>     Evaluate an abbreviated SXPath.
>     ...

I just tried and this seems to be a bug in the Emacs Info reader.  The
standalone Info reader brings you to the right place, and the HTML and
PDF outputs are OK.

Could you report it?

Some superficial comments follow.  I realize some of these may be
present in the original SXPath source, but it seems best to fix them
anyway.

BTW, I think we should add a sentence at the beginning of the “SXPath”
section saying that the material is taken from the SXPath source by Oleg
et al.

>  mechanically expanded into the full form by simple rewriting rules.  In
> -case of SXPath the corresponding rules are given as comments to a sxpath
> -function, below.  The regression test suite at the end of this file shows
> -a representative sample of SXPaths in both notations, juxtaposed with
> -the corresponding XPath expressions.  Most of the samples are borrowed
> +case of SXPath the corresponding rules are given in the documentation of

Missing “the” (“In the case of SXPath”.)

> +The regression test suite at the end of the file SXPATH-old.scm shows a

@file{SXPATH-old.scm}

> +representative sample of SXPaths in both notations, juxtaposed with the
> +corresponding XPath expressions.  Most of the samples are borrowed
>  literally from the XPath specification, while the others are adjusted
> -for our running example, tree1.
> +for our running example, @code{tree1}.

The issue was already there, but apparently ‘tree1’ is not used
elsewhere in the documentation.  Perhaps it’s best to remove it?

> +@subsubsection Basic converters and applicators

Capital C and capital A.

> +A converter is a function mapping a nodeset (or a single node) to another
> +nodeset.  Its type can be represented like this:
> +
> +@smallexample
> +  type Converter = Node|Nodeset -> Nodeset
> +@end smallexample

Rather @example (@smallexample means small typeface) and no space before
‘type’ (same for the other examples below.)

>  @deffn {Scheme Procedure} node-typeof? crit
> +This function implements a 'Node test' as defined in Sec. 2.3 of XPath
> +document.  A node test is one of the components of a location step.  It

Missing “the” (“of the XPath document”.)

> +@subsubsection Converter combinators

Capital C.

> +Combinators are higher-order functions that transmogrify a converter or
> +glue a sequence of converters into a single, non-trivial converter.  The
> +goal is to arrive at converters that correspond to XPath location paths.
> +
> +From a different point of view, a combinator is a fixed, named
> +@emph{pattern} of applying converters.  Given below is a complete set of

When introducing a term, use @dfn; so @dfn{pattern}.

> +@smallexample
> +(define (node-closure f)
> +     (node-or
> +       (select-kids f)
> +       (node-reduce (select-kids (node-typeof? '*)) (node-closure f))))

Align below the ‘e’ of ‘define’.

Could you send an updated patch?

Thanks!

Ludo’.





^ permalink raw reply	[flat|nested] 6+ messages in thread

* bug#19478: [PATCH] Improve SXPath documentation
  2015-03-04  9:23 ` Ludovic Courtès
@ 2015-08-30  9:27   ` Ricardo Wurmus
  2015-09-04 20:59     ` Ludovic Courtès
  0 siblings, 1 reply; 6+ messages in thread
From: Ricardo Wurmus @ 2015-08-30  9:27 UTC (permalink / raw)
  To: Ludovic Courtès; +Cc: 19478

[-- Attachment #1: Type: text/plain, Size: 3555 bytes --]

Hi Ludo,

my apologies for not responding to your comments for so long.

> Some superficial comments follow.  I realize some of these may be
> present in the original SXPath source, but it seems best to fix them
> anyway.
>
> BTW, I think we should add a sentence at the beginning of the “SXPath”
> section saying that the material is taken from the SXPath source by Oleg
> et al.
>
>>  mechanically expanded into the full form by simple rewriting rules.  In
>> -case of SXPath the corresponding rules are given as comments to a sxpath
>> -function, below.  The regression test suite at the end of this file shows
>> -a representative sample of SXPaths in both notations, juxtaposed with
>> -the corresponding XPath expressions.  Most of the samples are borrowed
>> +case of SXPath the corresponding rules are given in the documentation of
>
> Missing “the” (“In the case of SXPath”.)

Fixed.

>> +The regression test suite at the end of the file SXPATH-old.scm shows a
>
> @file{SXPATH-old.scm}

Ok.

>> +representative sample of SXPaths in both notations, juxtaposed with the
>> +corresponding XPath expressions.  Most of the samples are borrowed
>>  literally from the XPath specification, while the others are adjusted
>> -for our running example, tree1.
>> +for our running example, @code{tree1}.
>
> The issue was already there, but apparently ‘tree1’ is not used
> elsewhere in the documentation.  Perhaps it’s best to remove it?

I removed the last sentence as it doesn’t make any sense unless the
example is added to the documentation.  Maybe I’ll add additional
examples in a later patch.

>> +@subsubsection Basic converters and applicators
>
> Capital C and capital A.

Oh, I forgot that the default is American English.  I also changed the
other headings I added.

>> +A converter is a function mapping a nodeset (or a single node) to another
>> +nodeset.  Its type can be represented like this:
>> +
>> +@smallexample
>> +  type Converter = Node|Nodeset -> Nodeset
>> +@end smallexample
>
> Rather @example (@smallexample means small typeface) and no space before
> ‘type’ (same for the other examples below.)

Good to know.  I fixed all of these type summaries to use “@example” and
without indentation.

>>  @deffn {Scheme Procedure} node-typeof? crit
>> +This function implements a 'Node test' as defined in Sec. 2.3 of XPath
>> +document.  A node test is one of the components of a location step.  It
>
> Missing “the” (“of the XPath document”.)

Added.

>> +Combinators are higher-order functions that transmogrify a converter or
>> +glue a sequence of converters into a single, non-trivial converter.  The
>> +goal is to arrive at converters that correspond to XPath location paths.
>> +
>> +From a different point of view, a combinator is a fixed, named
>> +@emph{pattern} of applying converters.  Given below is a complete set of
>
> When introducing a term, use @dfn; so @dfn{pattern}.

Okay.

>> +@smallexample
>> +(define (node-closure f)
>> +     (node-or
>> +       (select-kids f)
>> +       (node-reduce (select-kids (node-typeof? '*)) (node-closure f))))
>
> Align below the ‘e’ of ‘define’.

Done.

> Could you send an updated patch?

Attached is an updated patch.

After all this time I noticed that even after this patch, the SXPath
documentation is still not clear enough.  I again found myself looking
up the sources to see actual examples.  In a later patch I’d like to add
general examples as well as some examples for specific functions, if
that’s okay.

~~ Ricardo


[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #2: 0001-doc-Add-SXPath-documentation-from-sources.patch --]
[-- Type: text/x-patch, Size: 13929 bytes --]

From 302fd7dd9d293ca1e88a39c255704ccd173caed4 Mon Sep 17 00:00:00 2001
From: Ricardo Wurmus <rekado@elephly.net>
Date: Sun, 30 Aug 2015 10:58:42 +0200
Subject: [PATCH] doc: Add SXPath documentation from sources

* doc/ref/sxml.texi (SXPath): Add procedure documentation from sources.
---
 doc/ref/sxml.texi | 298 +++++++++++++++++++++++++++++++++++++++++++++++-------
 1 file changed, 263 insertions(+), 35 deletions(-)

diff --git a/doc/ref/sxml.texi b/doc/ref/sxml.texi
index 75867f3..75ebedc 100644
--- a/doc/ref/sxml.texi
+++ b/doc/ref/sxml.texi
@@ -250,8 +250,8 @@ internal and external parsed entities, user-controlled handling of
 whitespace, and validation.  This module therefore is intended to be a
 framework, a set of ``Lego blocks'' you can use to build a parser
 following any discipline and performing validation to any degree.  As an
-example of the parser construction, this file includes a semi-validating
-SXML parser.
+example of the parser construction, the source file includes a
+semi-validating SXML parser.
 
 SSAX has a ``sequential'' feel of SAX yet a ``functional style'' of DOM.
 Like a SAX parser, the framework scans the document only once and
@@ -725,95 +725,323 @@ location path is a relative path applied to the root node.
 Similarly to XPath, SXPath defines full and abbreviated notations for
 location paths.  In both cases, the abbreviated notation can be
 mechanically expanded into the full form by simple rewriting rules.  In
-case of SXPath the corresponding rules are given as comments to a sxpath
-function, below.  The regression test suite at the end of this file shows
-a representative sample of SXPaths in both notations, juxtaposed with
-the corresponding XPath expressions.  Most of the samples are borrowed
-literally from the XPath specification, while the others are adjusted
-for our running example, tree1.
+the case of SXPath the corresponding rules are given in the
+documentation of the @code{sxpath} procedure.
+@xref{sxpath-procedure-docs,,SXPath procedure documentation}.
+
+The regression test suite at the end of the file @file{SXPATH-old.scm}
+shows a representative sample of SXPaths in both notations, juxtaposed
+with the corresponding XPath expressions.  Most of the samples are
+borrowed literally from the XPath specification.
+
+Much of the following material is taken from the SXPath sources by Oleg
+Kiselyov et al.
+
+@subsubsection Basic Converters and Applicators
+
+A converter is a function mapping a nodeset (or a single node) to another
+nodeset.  Its type can be represented like this:
+
+@example
+type Converter = Node|Nodeset -> Nodeset
+@end example
+
+A converter can also play the role of a predicate: in that case, if a
+converter, applied to a node or a nodeset, yields a non-empty nodeset,
+the converter-predicate is deemed satisfied.  Likewise, an empty nodeset
+is equivalent to @code{#f} in denoting failure.
 
-@subsubsection Usage
 @deffn {Scheme Procedure} nodeset? x
+Return @code{#t} if @var{x} is a nodeset.
 @end deffn
 
 @deffn {Scheme Procedure} node-typeof? crit
+This function implements a 'Node test' as defined in Sec. 2.3 of the
+XPath document.  A node test is one of the components of a location
+step.  It is also a converter-predicate in SXPath.
+
+The function @code{node-typeof?} takes a type criterion and returns a
+function, which, when applied to a node, will tell if the node satisfies
+the test.
+
+The criterion @var{crit} is a symbol, one of the following:
+
+@table @code
+@item id
+tests if the node has the right name (id)
+
+@item @@
+tests if the node is an <attributes-coll>
+
+@item *
+tests if the node is an <Element>
+
+@item *text*
+tests if the node is a text node
+
+@item *PI*
+tests if the node is a PI (processing instruction) node
+
+@item *any*
+@code{#t} for any type of node
+@end table
 @end deffn
 
 @deffn {Scheme Procedure} node-eq? other
+A curried equivalence converter predicate that takes a node @var{other}
+and returns a function that takes another node.  The two nodes are
+compared using @code{eq?}.
 @end deffn
 
 @deffn {Scheme Procedure} node-equal? other
+A curried equivalence converter predicate that takes a node @var{other}
+and returns a function that takes another node.  The two nodes are
+compared using @code{equal?}.
 @end deffn
 
 @deffn {Scheme Procedure} node-pos n
+Select the @var{n}'th element of a nodeset and return as a singular
+nodeset.  If the @var{n}'th element does not exist, return an empty
+nodeset.  If @var{n} is a negative number the node is picked from the
+tail of the list.
+
+@example
+((node-pos 1) nodeset)  ; return the the head of the nodeset (if exists)
+((node-pos 2) nodeset)  ; return the node after that (if exists)
+((node-pos -1) nodeset) ; selects the last node of a non-empty nodeset
+((node-pos -2) nodeset) ; selects the last but one node, if exists.
+@end example
 @end deffn
 
 @deffn {Scheme Procedure} filter pred?
-@verbatim 
- -- Scheme Procedure: filter pred list
-     Return all the elements of 2nd arg LIST that satisfy predicate
-     PRED.  The list is not disordered - elements that appear in the
-     result list occur in the same order as they occur in the argument
-     list.  The returned list may share a common tail with the argument
-     list.  The dynamic order in which the various applications of pred
-     are made is not specified.
-
-          (filter even? '(0 7 8 8 43 -4)) => (0 8 8 -4)
-
- 
-@end verbatim
+A filter applicator, which introduces a filtering context.  The argument
+converter @var{pred?} is considered a predicate, with either @code{#f}
+or @code{nil} meaning failure.
 @end deffn
 
 @deffn {Scheme Procedure} take-until pred?
+@example
+take-until:: Converter -> Converter, or
+take-until:: Pred -> Node|Nodeset -> Nodeset
+@end example
+
+Given a converter-predicate @var{pred?} and a nodeset, apply the
+predicate to each element of the nodeset, until the predicate yields
+anything but @code{#f} or @code{nil}.  Return the elements of the input
+nodeset that have been processed until that moment (that is, which fail
+the predicate).
+
+@code{take-until} is a variation of the @code{filter} above:
+@code{take-until} passes elements of an ordered input set up to (but not
+including) the first element that satisfies the predicate.  The nodeset
+returned by @code{((take-until (not pred)) nset)} is a subset -- to be
+more precise, a prefix -- of the nodeset returned by @code{((filter
+pred) nset)}.
 @end deffn
 
 @deffn {Scheme Procedure} take-after pred?
+@example
+take-after:: Converter -> Converter, or
+take-after:: Pred -> Node|Nodeset -> Nodeset
+@end example
+
+Given a converter-predicate @var{pred?} and a nodeset, apply the
+predicate to each element of the nodeset, until the predicate yields
+anything but @code{#f} or @code{nil}.  Return the elements of the input
+nodeset that have not been processed: that is, return the elements of
+the input nodeset that follow the first element that satisfied the
+predicate.
+
+@code{take-after} along with @code{take-until} partition an input
+nodeset into three parts: the first element that satisfies a predicate,
+all preceding elements and all following elements.
 @end deffn
 
 @deffn {Scheme Procedure} map-union proc lst
+Apply @var{proc} to each element of @var{lst} and return the list of results.
+If @var{proc} returns a nodeset, splice it into the result
+
+From another point of view, @code{map-union} is a function
+@code{Converter->Converter}, which places an argument-converter in a joining
+context.
 @end deffn
 
 @deffn {Scheme Procedure} node-reverse node-or-nodeset
+@example
+node-reverse :: Converter, or
+node-reverse:: Node|Nodeset -> Nodeset
+@end example
+
+Reverses the order of nodes in the nodeset.  This basic converter is
+needed to implement a reverse document order (see the XPath
+Recommendation).
 @end deffn
 
 @deffn {Scheme Procedure} node-trace title
+@example
+node-trace:: String -> Converter
+@end example
+
+@code{(node-trace title)} is an identity converter.  In addition it
+prints out the node or nodeset it is applied to, prefixed with the
+@var{title}.  This converter is very useful for debugging.
 @end deffn
 
+@subsubsection Converter Combinators
+
+Combinators are higher-order functions that transmogrify a converter or
+glue a sequence of converters into a single, non-trivial converter.  The
+goal is to arrive at converters that correspond to XPath location paths.
+
+From a different point of view, a combinator is a fixed, named
+@dfn{pattern} of applying converters.  Given below is a complete set of
+such patterns that together implement XPath location path specification.
+As it turns out, all these combinators can be built from a small number
+of basic blocks: regular functional composition, @code{map-union} and
+@code{filter} applicators, and the nodeset union.
+
 @deffn {Scheme Procedure} select-kids test-pred?
+@code{select-kids} takes a converter (or a predicate) as an argument and
+returns another converter.  The resulting converter applied to a nodeset
+returns an ordered subset of its children that satisfy the predicate
+@var{test-pred?}.
 @end deffn
 
 @deffn {Scheme Procedure} node-self pred?
-@verbatim 
- -- Scheme Procedure: filter pred list
-     Return all the elements of 2nd arg LIST that satisfy predicate
-     PRED.  The list is not disordered - elements that appear in the
-     result list occur in the same order as they occur in the argument
-     list.  The returned list may share a common tail with the argument
-     list.  The dynamic order in which the various applications of pred
-     are made is not specified.
-
-          (filter even? '(0 7 8 8 43 -4)) => (0 8 8 -4)
-
- 
-@end verbatim
+Similar to @code{select-kids} except that the predicate @var{pred?} is
+applied to the node itself rather than to its children.  The resulting
+nodeset will contain either one component, or will be empty if the node
+failed the predicate.
 @end deffn
 
 @deffn {Scheme Procedure} node-join . selectors
+@example
+node-join:: [LocPath] -> Node|Nodeset -> Nodeset, or
+node-join:: [Converter] -> Converter
+@end example
+
+Join the sequence of location steps or paths as described above.
 @end deffn
 
 @deffn {Scheme Procedure} node-reduce . converters
+@example
+node-reduce:: [LocPath] -> Node|Nodeset -> Nodeset, or
+node-reduce:: [Converter] -> Converter
+@end example
+
+A regular functional composition of converters.  From a different point
+of view, @code{((apply node-reduce converters) nodeset)} is equivalent
+to @code{(foldl apply nodeset converters)}, i.e., folding, or reducing,
+a list of converters with the nodeset as a seed.
 @end deffn
 
 @deffn {Scheme Procedure} node-or . converters
+@example
+node-or:: [Converter] -> Converter
+@end example
+
+This combinator applies all converters to a given node and produces the
+union of their results.  This combinator corresponds to a union
+(@code{|} operation) for XPath location paths.
 @end deffn
 
 @deffn {Scheme Procedure} node-closure test-pred?
+@example
+node-closure:: Converter -> Converter
+@end example
+
+Select all @emph{descendants} of a node that satisfy a
+converter-predicate @var{test-pred?}.  This combinator is similar to
+@code{select-kids} but applies to grand... children as well.  This
+combinator implements the @code{descendant::} XPath axis.  Conceptually,
+this combinator can be expressed as
+
+@example
+(define (node-closure f)
+  (node-or
+    (select-kids f)
+    (node-reduce (select-kids (node-typeof? '*)) (node-closure f))))
+@end example
+
+This definition, as written, looks somewhat like a fixpoint, and it will
+run forever.  It is obvious however that sooner or later
+@code{(select-kids (node-typeof? '*))} will return an empty nodeset.  At
+this point further iterations will no longer affect the result and can
+be stopped.
 @end deffn
 
 @deffn {Scheme Procedure} node-parent rootnode
+@example
+node-parent:: RootNode -> Converter
+@end example
+
+@code{(node-parent rootnode)} yields a converter that returns a parent
+of a node it is applied to.  If applied to a nodeset, it returns the
+list of parents of nodes in the nodeset.  The @var{rootnode} does not
+have to be the root node of the whole SXML tree -- it may be a root node
+of a branch of interest.
+
+Given the notation of Philip Wadler's paper on semantics of XSLT,
+
+@verbatim
+  parent(x) = { y | y=subnode*(root), x=subnode(y) }
+@end verbatim
+
+Therefore, @code{node-parent} is not the fundamental converter: it can
+be expressed through the existing ones.  Yet @code{node-parent} is a
+rather convenient converter.  It corresponds to a @code{parent::} axis
+of SXPath.  Note that the @code{parent::} axis can be used with an
+attribute node as well.
 @end deffn
 
+@anchor{sxpath-procedure-docs}
 @deffn {Scheme Procedure} sxpath path
+Evaluate an abbreviated SXPath.
+
+@example
+sxpath:: AbbrPath -> Converter, or
+sxpath:: AbbrPath -> Node|Nodeset -> Nodeset
+@end example
+
+@var{path} is a list.  It is translated to the full SXPath according to
+the following rewriting rules:
+
+@example
+(sxpath '())
+@result{} (node-join)
+
+(sxpath '(path-component ...))
+@result{} (node-join (sxpath1 path-component) (sxpath '(...)))
+
+(sxpath1 '//)
+@result{} (node-or
+   (node-self (node-typeof? '*any*))
+   (node-closure (node-typeof? '*any*)))
+
+(sxpath1 '(equal? x))
+@result{} (select-kids (node-equal? x))
+
+(sxpath1 '(eq? x))
+@result{} (select-kids (node-eq? x))
+
+(sxpath1 ?symbol)
+@result{} (select-kids (node-typeof? ?symbol)
+
+(sxpath1 procedure)
+@result{} procedure
+
+(sxpath1 '(?symbol ...))
+@result{} (sxpath1 '((?symbol) ...))
+
+(sxpath1 '(path reducer ...))
+@result{} (node-reduce (sxpath path) (sxpathr reducer) ...)
+
+(sxpathr number)
+@result{} (node-pos number)
+
+(sxpathr path-filter)
+@result{} (filter (sxpath path-filter))
+@end example
 @end deffn
 
 @node sxml ssax input-parse
-- 
2.4.3


^ permalink raw reply related	[flat|nested] 6+ messages in thread

* bug#19478: [PATCH] Improve SXPath documentation
  2015-08-30  9:27   ` Ricardo Wurmus
@ 2015-09-04 20:59     ` Ludovic Courtès
  2015-09-05  9:52       ` Ricardo Wurmus
  0 siblings, 1 reply; 6+ messages in thread
From: Ludovic Courtès @ 2015-09-04 20:59 UTC (permalink / raw)
  To: Ricardo Wurmus; +Cc: 19478

Howdy,

Ricardo Wurmus <rekado@elephly.net> skribis:

> After all this time I noticed that even after this patch, the SXPath
> documentation is still not clear enough.  I again found myself looking
> up the sources to see actual examples.  In a later patch I’d like to add
> general examples as well as some examples for specific functions, if
> that’s okay.

Sure, these would be definitely welcome.

> From 302fd7dd9d293ca1e88a39c255704ccd173caed4 Mon Sep 17 00:00:00 2001
> From: Ricardo Wurmus <rekado@elephly.net>
> Date: Sun, 30 Aug 2015 10:58:42 +0200
> Subject: [PATCH] doc: Add SXPath documentation from sources
>
> * doc/ref/sxml.texi (SXPath): Add procedure documentation from sources.

Pushed.  I added a note at the top of the file that this is from
SXPath.scm, which is public-domain.

Should this bug be closed now?  Your call.

Thanks!

Ludo’.





^ permalink raw reply	[flat|nested] 6+ messages in thread

* bug#19478: [PATCH] Improve SXPath documentation
  2015-09-04 20:59     ` Ludovic Courtès
@ 2015-09-05  9:52       ` Ricardo Wurmus
  2015-09-05 16:53         ` Ludovic Courtès
  0 siblings, 1 reply; 6+ messages in thread
From: Ricardo Wurmus @ 2015-09-05  9:52 UTC (permalink / raw)
  To: Ludovic Courtès; +Cc: 19478

>> From 302fd7dd9d293ca1e88a39c255704ccd173caed4 Mon Sep 17 00:00:00 2001
>> From: Ricardo Wurmus <rekado@elephly.net>
>> Date: Sun, 30 Aug 2015 10:58:42 +0200
>> Subject: [PATCH] doc: Add SXPath documentation from sources
>>
>> * doc/ref/sxml.texi (SXPath): Add procedure documentation from sources.
>
> Pushed.  I added a note at the top of the file that this is from
> SXPath.scm, which is public-domain.

Actually, my patch included these lines near the end of the first
section:

> +Much of the following material is taken from the SXPath sources by Oleg
> +Kiselyov et al.

Putting a note at the top also seems appropriate.

> Should this bug be closed now?  Your call.

Yes, it can be closed now.  For future SXML documentation updates should
I just send patches to guile-devel or open a bug report?

~~ Ricardo






^ permalink raw reply	[flat|nested] 6+ messages in thread

* bug#19478: [PATCH] Improve SXPath documentation
  2015-09-05  9:52       ` Ricardo Wurmus
@ 2015-09-05 16:53         ` Ludovic Courtès
  0 siblings, 0 replies; 6+ messages in thread
From: Ludovic Courtès @ 2015-09-05 16:53 UTC (permalink / raw)
  To: Ricardo Wurmus; +Cc: 19478-done

Ricardo Wurmus <rekado@elephly.net> skribis:

>>> From 302fd7dd9d293ca1e88a39c255704ccd173caed4 Mon Sep 17 00:00:00 2001
>>> From: Ricardo Wurmus <rekado@elephly.net>
>>> Date: Sun, 30 Aug 2015 10:58:42 +0200
>>> Subject: [PATCH] doc: Add SXPath documentation from sources
>>>
>>> * doc/ref/sxml.texi (SXPath): Add procedure documentation from sources.
>>
>> Pushed.  I added a note at the top of the file that this is from
>> SXPath.scm, which is public-domain.
>
> Actually, my patch included these lines near the end of the first
> section:
>
>> +Much of the following material is taken from the SXPath sources by Oleg
>> +Kiselyov et al.
>
> Putting a note at the top also seems appropriate.

Yes, that’s what I thought; it makes it easier to see at a glance where
the material comes from and what its copyright status is.

>> Should this bug be closed now?  Your call.
>
> Yes, it can be closed now.  For future SXML documentation updates should
> I just send patches to guile-devel or open a bug report?

Either way is fine; it’s easier not to lose track of bugs, though.

Thanks!

Ludo’.





^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2015-09-05 16:53 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2014-12-31 16:45 bug#19478: [PATCH] Improve SXPath documentation rekado
2015-03-04  9:23 ` Ludovic Courtès
2015-08-30  9:27   ` Ricardo Wurmus
2015-09-04 20:59     ` Ludovic Courtès
2015-09-05  9:52       ` Ricardo Wurmus
2015-09-05 16:53         ` Ludovic Courtès

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).