Skip to content

Add blog article about preventing version conflicts with versionScheme #1210

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 4 commits into from
Feb 17, 2021
Merged
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
127 changes: 127 additions & 0 deletions blog/_posts/2021-02-16-improving-scala-library-ecosystem.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,127 @@
---
layout: blog-detail
post-type: blog
by: Eugene Yokota, Julien Richard-Foy
title: Improving the Scala Library Ecosystem
---

One of the things that makes Scala powerful and fun to use is its library ecosystem — spanning across the Maven ecosystem for JVM, and npm for Scala.JS. We can use libraries to do operations like download files from the web, or we can use it to adopt different programming paradigms. The library ecosystem allows us to write more useful programs with less effort.

## What is a version conflict?

However, the library ecosystem is not without problems. A library that you pulled could depend on other libraries, and the transitive dependencies could cause version conflicts. Here's a quick example of a Scala project that uses Akka HTTP, a Postgres database, and JSON. Its build declares two library dependencies, `akka-http-circe` and `doobie-postgres-circe`:
Copy link
Member

@SethTisue SethTisue Feb 8, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Perhaps this should explicitly be framed as a general JVM problem, not a Scala-specific problem. As JVM vets that's ingrained in us, but not all readers are #JVM4Life.

Copy link
Member

@SethTisue SethTisue Feb 8, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also, readers might like to know if the solution only works for Scala libraries, or whether Java libraries could use it too.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

While you're right that this stems from the loose behavior of the dependency resolvers used in JVM, given we have Scala.js and Native, I don't think we should make it too JVM centric either.


~~~ scala
libraryDependencies ++= Seq(
"de.heikoseeberger" %% "akka-http-circe" % "1.26.0",
"org.tpolecat" %% "doobie-postgres-circe" % "0.10.0"
)
~~~

Note that both libraries use Circe for working with JSON.

sbt loads the project build, the project compiles successfully, but when you try to run it, you may get a `NoSuchMethodError` exception at run-time!

What happens here is that one of the transitive dependencies of `akka-http-circe` depends on `circe-core` version 0.11.1, and another transitive dependency of `doobie-postgres-circe` depends on `circe-core` version 0.13.0. Here's an excerpt of the output of sbt’s `dependencyTree` task:

~~~ text
[info] +-de.heikoseeberger:akka-http-circe_2.12:1.26.0 [S]
[info] | +-io.circe:circe-core_2.12:0.11.1 (evicted by: 0.13.0)
[info] |
[info] +-org.tpolecat:doobie-postgres-circe_2.12:0.10.0 [S]
[info] +-io.circe:circe-core_2.12:0.13.0 [S]
~~~

Unfortunately, these two versions of Circe are not binary compatible. Such version conflicts are inevitable in a dependency graph of any practical size. The problem is that we carefully program our code using a statically checked type system, but when it comes to production code we accept swapping out the JAR file with something that our dependency resolver like Coursier and Apache Ivy selected on a whim.

Scala Center, Alexandre Archambault (author of Coursier), and I (Eugene Yokota) have been working on a solution to improve this situation so we can be confident about creating libraries, and using them.

In the next section, I will explain the mechanism that was in place so far in sbt to address this issue, and I will discuss its limits. Then, I will introduce a new solution, which concerns both library authors, and library users.

## Eviction warnings

Actually this is not the first time I've thought of this issue. In 2014, I added [eviction warning][1] feature to sbt 0.13.6. In the lingo of dependency resolver, when you have two candidate versions 0.11.1 and 0.13.0, and when it picks 0.13.0, 0.11.1 is said to be evicted.

~~~ text
sbt:killer-app> evicted
[warn] Found version conflict(s) in library dependencies; some are suspected to be binary incompatible:
[warn] * io.circe:circe-core_2.12:0.13.0 is selected over 0.11.1
[warn] +- org.tpolecat:doobie-postgres-circe_2.12:0.10.0 (depends on 0.13.0)
[warn] +- de.heikoseeberger:akka-http-circe_2.12:1.26.0 (depends on 0.11.1)
[success] Total time: 1 s, completed Feb 1, 2021, 2:59:56 PM
~~~

Here, we see that sbt successfully detected the conflict for circe-core.

This was a step in the right direction, but it did not work well in reality because I had no way to tell whether two versions of a library would be binary compatible or not. In 2014 what I did was guess that a Java library would adopt [Semantic Versioning][2], and a Scala library would adopt [PVP][3]. (Apparently, using the first two numbers of the version, e.g. 1.2.x, to mean major version has a name, and it's Haskell Package Versioning Policy, or PVP for short).
Copy link
Member

@SethTisue SethTisue Feb 8, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I've never heard of PVP in my life. Apparently it is not common parlance. I'm not sure whether introducing it here is educational, or simply a distraction. I lean towards distraction. Maybe substitute "epoch.major.minor"? Not sure.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

At some point we should introduce the term PVP since it's one of the supported version schemes - https://www.scala-sbt.org/1.x/docs/Publishing.html#Version+scheme


Unfortunately, the reality was different and while some Scala libraries use PVP (e.g., Play framework, the standard library, …), some others do use Semantic Versioning (e.g., `scala-collection-compat`, `cats`, …). This makes eviction warnings less reliable since many of these warnings are false positive, and, in practice, [they are ignored by developers][7].

## Early SemVer and sbt-version-policy

If you're maintaining a library, the first thing you could do to help is picking a version scheme. Our recommendation for libraries going forward is adopting **Early Semver** as described in the documentation page [Binary Compatibility for Library Authors][4]. Please read the linked post by Jacob Wang for more details on binary compatibility and source compatibility.

Given a version number `major.minor.patch`, you MUST increment the:

1. `major` version if backward **binary compatibility** is broken,
2. `minor` version if backward **source compatibility** is broken, and
3. `patch` version to signal neither binary nor source incompatibility.

- When the `major` version is `0`, a minor version increment MAY contain **both source and binary breakages**, but a patch version increment MUST remain **binary compatible**.

We call this Early SemVer, because according to the [Semantic Versioning Spec][2] there are no guarantees between any versions when the major version is `0`. In the Scala library ecosystem, though, we often start guaranteeing binary compatibility for `0.y.z` like sbt 0.13 and Scala.JS 0.6.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"We call this" is ambiguous who "we" is. Is "early semver" standard terminology, or is it sbt-specific?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

early-semver is a version scheme Alex came up as a name in coursier/versions#7, and I've been pushing it as implementation in Coursier, sbt, and elsewhere.


Unfortunately, it is not an easy task to know whether a change on public API broke source or binary compatibility. Scala Center contracted Alexandre Archambault to create [sbt-version-policy][5]. This plugin helps library authors to self-check a version scheme. To use this, add the following to your `project/plugins.sbt`:

~~~ scala
addSbtPlugin("ch.epfl.scala" % "sbt-version-policy" % "1.0.0-RC5")
~~~

And declare your compatibility intention regarding the next release, in your `build.sbt`:

~~~
// The next release must be binary and source compatible with the previous release (it will be a patch release)
ThisBuild / versionPolicyIntention := Compatibility.BinaryAndSourceCompatible
~~~

This plugin provides a task called `versionPolicyCheck`, which you can call in the CI (continuous integration) server. The task performs an automatic binary compatibility check.

~~~ text
> versionPolicyCheck
~~~

I think Early Semver gives better flexibility to both library authors and library users since it gives more information about what would be in minor upgrades (I often call this feature release) vs patch. For example, sbt 1 ships bug fixes as 1.3.x patch releases without going through RC cycle, so bug fixes are released quickly.

During the minor upgrade (feature release), we aggressively add new features, while maintaining binary compatibility for the plugin ecosystem. These only come out once a year, and it goes through the RC cycle.

## versionScheme, libraryDependencySchemes, and sbt 1.5.0

As mentioned earlier, sbt contains a built-in eviction warning feature, but there are too many false positives right now because it needs to guess.

Since sbt 1.4.0, there is a new setting `versionScheme`, which can be used by library authors as follows in `build.sbt`:

~~~ scala
ThisBuild / versionScheme := Some("early-semver")
~~~

sbt 1.4.0 includes this information into `pom.xml` and `ivy.xml` as a property. In addition, sbt uses the information to take the guessing out of eviction warning when this information is available.

In sbt 1.5.0, eviction warnings will be replaced with [eviction errors][8]. Since we can now reliably detect whether two dependencies with different versions are compatible, or if they conflict, the build will fail if an incompatibility is detected in your dependencies.

It might take a few years for the `versionScheme` information to become prevalent in the ecosystem, but once it happens the eviction warning could become more accurate. In the meantime, as a user of libraries you can manually configure the versioning scheme used by your libraries by using a new setting, `libraryDependencySchemes`. For instance, here is how we can tell sbt that Circe’s artifacts follow the Early SemVer scheme:

~~~ scala
ThisBuild / libraryDependencySchemes += "io.circe" %% "circe-*" % "early-semver"
~~~

## Summary

If you are a library author, check out [sbt-version-policy][5] to enforce the recommended versioning scheme. Or, at least declare the versioning scheme you use, with the `versionScheme` key. If you are a library user, keep in mind that starting from sbt 1.5.0 you should configure your `libraryDependencySchemes` to get accurate eviction errors.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I would suggest putting the recommendation to use versionScheme first. It is a universal recommendation to all library authors that we realllllllly want eeeeeeverybody to take.

The other recommendations are more on a "if you want" basis.

Copy link
Member

@SethTisue SethTisue Feb 8, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In fact, iiuc, that recommendation is so important, so central, that maybe it ought to be right at the top of the blog post, as well as here.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It might be worth turning this into a list

  • Set ThisBuild / versionScheme key if you're publishing a library
  • Check out [sbt-version-policy][5] to enforce the recommended versioning scheme
  • From sbt 1.5.0, use libraryDependencySchemes to supplement the version scheme information

and also copying it to the head of the post as tl;dr in the spirit of "we're not writing fiction".


[1]: https://github.com/sbt/sbt/pull/147
[2]: https://semver.org/
[3]: https://pvp.haskell.org/
[4]: https://docs.scala-lang.org/overviews/core/binary-compatibility-for-library-authors.html#versioning-scheme---communicating-compatibility-breakages
[5]: https://github.com/scalacenter/sbt-version-policy
[7]: https://github.com/sbt/sbt/issues/5976
[8]: https://github.com/sbt/sbt/pull/6221