A new approach to Stepper and Stream materialization #70

szeiger · 2016-04-18T17:50:00Z

Getting the most efficient Stepper implementation requires knowledge
of the static type of a collection. This problem could be solved in the
most elegant way by integrating Steppers into the collections framework
as a replacement for Iterators (i.e. every collection gets a stepper
method and iterator delegates to stepper by default).

But this is at odds with the handling of specialized primitive Steppers
in the current implementation. If we want stepper to be an instance
methods instead of an extension method, there needs to be a single such
method for all specialized Steppers. The fundamental change in this
new implementation is to encode the translation from element type to
Stepper type (including widening conversions) as a functional dependency
via the new StepperShape trait.

This greatly reduces the number of implicit methods and classes and
keeps all specialized versions of MakesStepper together. The default
base classes support all unboxing and widening conversions so that a
simple MakesStepper for a collection of boxed elements only needs to
handle the AnyStepper case.

keyStepper and valueStepper are handled in the same way.

StreamConverters use a separate StreamShape for the translation from
element type to BaseStream subtype which is compatible with
StepConverters and supports the same primitive types and widening
conversions.

- `WrappedArray` now produces the same primitive, non-boxing `Stepper` that the underlying `Array` would. - Plus some improvements to the unit tests and benchmarks. An interesting observation from the benchmarks: `IntStream.sum` is horribly slow when used on a seqStream built from an `IntStepper`. It is still slow when used on a seqStream built directly from an `Array` with Java’s own stream support. Using a `while` loop on an `IntIterator` produced by the stream beats both hands down and has the same performance independent of the stream source.

This was already done for `Array` in the same way. Streams produced by `Arrays.stream` can be faster than going through a `Stepper`.

Getting the most efficient Stepper implementation requires knowledge of the static type of a collection. This problem could be solved in the most elegant way by integrating Steppers into the collections framework as a replacement for Iterators (i.e. every collection gets a `stepper` method and `iterator` delegates to `stepper` by default). But this is at odds with the handling of specialized primitive Steppers in the current implementation. If we want `stepper` to be an instance methods instead of an extension method, there needs to be a single such method for all specialized Steppers. The fundamental change in this new implementation is to encode the translation from element type to Stepper type (including widening conversions) as a functional dependency via the new `StepperShape` trait. This greatly reduces the number of implicit methods and classes and keeps all specialized versions of `MakesStepper` together. The default base classes support all unboxing and widening conversions so that a simple `MakesStepper` for a collection of boxed elements only needs to handle the `AnyStepper` case. `keyStepper` and `valueStepper` are handled in the same way. `StreamConverters` use a separate `StreamShape` for the translation from element type to `BaseStream` subtype which is compatible with `StepConverters` and supports the same primitive types and widening conversions.

szeiger · 2016-04-19T13:14:45Z

@Ichoran, do you have time to review this?

Ichoran · 2016-04-19T19:35:17Z

I can on Wednesday morning.

Ichoran · 2016-04-20T03:35:13Z

src/main/scala/scala/compat/java8/StreamConverters.scala

+  implicit val CharValue    = intStreamShape[Char]
+  implicit val FloatValue   = doubleStreamShape[Float]
+}
+trait StreamShapeLowPrio {


Can we spell out "Priority" here?

Ichoran · 2016-04-20T03:58:20Z

Partway through the review, but I'm out of time for right now. I'll try to finish later tonight or, failing that, (my) Wednesday morning.

This does away with the default implementations of `MakesStepper` et. al. and moves the unboxing logic into `StepperShape` for simpler implementations with faster dispatch.

szeiger · 2016-04-20T17:25:26Z

Updated based on review comments.

Ichoran · 2016-04-20T18:18:42Z

src/main/scala/scala/compat/java8/StreamConverters.scala

+}
+
+trait PrimitiveStreamUnboxer[A, S] {
+  def apply(boxed: Stream[A]): S


I'm not sure apply is the best choice for a method name, since it makes it impossible (with that signature) to have something that is both a function and a PrimitiveStreamUnboxer. If it's for internal use only, just seal it and it's fine. Otherwise I would pick something like streamUnbox to mirror streamAccumulate above.

This is from your original PR, it just got moved around. I haven't really looked into this area of the codebase yet after the initial review.

Ichoran · 2016-04-20T18:35:09Z

Aside for the few extra comments I left, this looks great! Definitely a more compact design.

Have you run the benchmarks again to see if performance stays comparable (especially for small collections)? There's always a concern that even though the extra abstraction is logically entirely removable that in practice it will cause a performance hit.

szeiger · 2016-04-21T14:02:48Z

I ran a subset of the benchmark suite (fast summation on ArrayBuffer, Array and Vector). My original PR lost a few percent of performance in some cases but I couldn't pinpoint the reason. I saw bigger losses for the large test cases where the time spent constructing the Stepper should have less of an impact.

Measurements on the updated PR aren't fully conclusive, either. The Vector test with 10 elements is still a bit slower, but the one with ArrayBuffer is faster. I'd have to run more comprehensive tests on an idle machine but so far it looks like this doesn't have a negative impact on performance compared the old version (before this PR).

As the next step I want to try removing the specialized Steppers for boxed collections. I saw a few percent of performance degradation when I tried this with the old version. We could remove quite a bit of code by always going through AnyStepper in these cases but if the manually specialized versions turn out to still be measurably faster, they may be worth keeping.

szeiger added 3 commits April 13, 2016 20:15

Special-case primitive WrappedArray types when creating Streams

c022190

This was already done for `Array` in the same way. Streams produced by `Arrays.stream` can be faster than going through a `Stepper`.

Ichoran reviewed Apr 20, 2016
View reviewed changes

Improved StepperShape

1590004

This does away with the default implementations of `MakesStepper` et. al. and moves the unboxing logic into `StepperShape` for simpler implementations with faster dispatch.

Ichoran reviewed Apr 20, 2016
View reviewed changes

szeiger merged commit fd8a54a into scala:master Jun 30, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

A new approach to Stepper and Stream materialization #70

A new approach to Stepper and Stream materialization #70

Uh oh!

szeiger commented Apr 18, 2016

Uh oh!

szeiger commented Apr 19, 2016

Uh oh!

Ichoran commented Apr 19, 2016

Uh oh!

Ichoran Apr 20, 2016

Uh oh!

Ichoran commented Apr 20, 2016

Uh oh!

szeiger commented Apr 20, 2016

Uh oh!

Ichoran Apr 20, 2016

Uh oh!

szeiger Apr 21, 2016

Uh oh!

Ichoran commented Apr 20, 2016

Uh oh!

szeiger commented Apr 21, 2016

Uh oh!

Uh oh!

A new approach to Stepper and Stream materialization #70

A new approach to Stepper and Stream materialization #70

Uh oh!

Conversation

szeiger commented Apr 18, 2016

Uh oh!

szeiger commented Apr 19, 2016

Uh oh!

Ichoran commented Apr 19, 2016

Uh oh!

Ichoran Apr 20, 2016

Choose a reason for hiding this comment

Uh oh!

Ichoran commented Apr 20, 2016

Uh oh!

szeiger commented Apr 20, 2016

Uh oh!

Ichoran Apr 20, 2016

Choose a reason for hiding this comment

Uh oh!

szeiger Apr 21, 2016

Choose a reason for hiding this comment

Uh oh!

Ichoran commented Apr 20, 2016

Uh oh!

szeiger commented Apr 21, 2016

Uh oh!

Uh oh!