> Date: Fri, 07 Jul 2017 12:47:35 +0300
> I timed the current implementation at 0.2 msec, in an optimized build;Trying to repeat this for several large buffers, I get much faster
performance: 0.04 micro-seconds per call. I guess something was wrong
with my original timing.
In any case, it's _fast_.