I recently gave a talk at the Advanced Scala meetup in London on free monads. Despite the name of the group, I think that free monads are eminently simple as well as being extremely useful. Let me explain.
The free monad brings together two concepts, monads and interpreters, allowing the creation of composable monadic interpreters. That's a bunch of big words, but why should we care? Because it allows simple solutions to difficult problems.
Take the example of Facebook's Haxl and Twitter's Stitch. Both systems solve a problem faced by companies that have aggressively adopted a service oriented architecture:1 service orchestration.
Consider rendering a user's Twitter stream. Hypothetically, the process might first retrieve the list of recent tweets from one service. Then for each tweet it might fetch the tweeter's name and picture to go alongside the tweet, which could require a request to two more services. No doubt there are logging and analytics services that would also be involved. All told a great number of services and requests can be involved in answering what is a single request from the UI perspective. With this explosion of requests there are a number of problems: increased network traffic, increased latency (which goes hand-in-hand with traffic), and consistency. The last point deserves some explanation. Imagine two tweets by the same person are in the stream. That person could change their details inbetween fetching the name and photo for the first and second tweet. If we allow this inconsistency to occur it makes for a very poor user experience, as the user can't tell at a glance that the two tweets are by the same person. It's fairly clear that we could avoid this inconsistency and solve our network traffic and latency issues if we just cached data. We could implement this by writing special-purpose request aggregation and caching for each request type, which is quickly going to be a losing battle as APIs and interfaces evolve. Or we could write a general purpose tool that allows us to describe the data we need and takes care of the optimisation for us. The free monad allows us to easily do this. Sold? Ok, let's get back to describing the free monad.
Remember I said the free monad brings together monads and interpreters. Let's start with the monad part. I'm going to assume you understand monads already. If not, don't worry. They're just like cats or burritos or something.
Now recall that a monad is defined by two operations2, point
and flatMap
, with signatures
point[M[_], A](a: A): M[A]
; andflatMap[M[_], A, B](fa: F[A])(f: A => F[B]): F[B]
.Point
is not very interesting --- it just wraps a monad around a value. FlatMap
is, however, the distinguishing feature of a monad and it tells us something very important: monads are fundamentally about control flow. The signature of flatMap
says you combine a F[A]
and a function A => F[B]
to create a F[B]
. The only way to do this is to get the A
out of the F[A]
and apply it to the A => F[B]
function. There is a clear ordering of operations here, and repeated applications of flatMap
creates a sequence of operations that must execute from left to right. So we see that monads explicitly encode control flow.
Related to this, the continuation monad can be used to encode any other monad. What is a continuation? It's a universal control flow primitive. Any control flow can be expressed using continuations.
We usually use monads to glue together pure functions with special purpose control-flow, such as fail fast error handling (using \/
or Either
) or asynchronous computation (using Future
). The free monad allows us to abstractly specify control flow between pure functions, and separately define an implementation.
Ok, so that's monads: control flow. What about interpreters. Interpreters are about separating the representation of a computation from the way it is run. Any interpreter has two parts3:
A simple example is in order. Consider the expression 1 + 2 + 3
. We can execute this directly, evaluating to 6
, or we could represent it as an abstract syntax tree such as Add(1, Add(2, 3))
. Given the AST we could choose from many different ways to interpret it:
Ints
, Doubles
, or arbitrary precision numbers.Hopefully this gives you a feel for the structure and power of the interpreter pattern.
We have talked about monads and interpreters. I said the free monad is just the combination of the two. Concretely this means the free monad provides:
What does the AST look like? It simply represents the monad operations without giving meaning to them. The usual representation of the free monad represents the monadic operations in terms of point
along with join
, instead of the more familiar flatMap
, but the point is still the same. An example encoding is
sealed trait Free[F[_], A]
final case class Return[F[_], A](a: A) extends Free[F, A]
final case class Suspend[F[_], A](s: F[Free[F, A]]) extends Free[F, A]
Now what does a free monad interpreter look like? It's just a function from F[_]
, the representation inside the free monad, to G[_]
some monad in which we really run the computation (the Id
monad is a popular choice). This type of function has a special name, a [natural computation][natural-computation].
Here's a simple example.
First we define an algebraic data type to represent the actions we're going to store in our monad.
import scalaz.{Free, ~>, Id, Functor}
sealed trait Log[A]
final case class Debug[A](msg: String, value: A) extends Log[A]
final case class Warn[A](msg: String, value: A) extends Log[A]
final case class Error[A](msg: String, value: A) extends Log[A]
For technical reasons we need to have a Functor
instance.
object Log {
implicit val logFunctor: Functor[Log] = new Functor[Log] {
def map[A, B](fa: Log[A])(f: A => B): Log[B] =
fa match {
case Debug(msg, value) => Debug(msg, f(value))
case Warn(msg, value) => Warn(msg, f(value))
case Error(msg, value) => Error(msg, f(value))
}
}
// Smart constructors
def debug[A](msg: String, value: A): Log[A] = Debug(msg, value)
def warn[A](msg: String, value: A): Log[A] = Warn(msg, value)
def error[A](msg: String, value: A): Log[A] = Error(msg, value)
}
Now we define an interpreter for Log
. This interpreter just prints to the console. You can imagine more elaborate interpreters that, say, output logs to Kafka or other infrastructure. The interpreter is just simple structural recursion on Log
.
object Println extends (Log ~> Id.Id) {
import Id._
import scalaz.syntax.monad._
def apply[A](in: Log[A]): Id[A] =
in match {
case Debug(msg, value) =>
println(s"DEBUG: $msg")
value.point[Id]
case Warn(msg, value) =>
println(s"WARN: $msg")
value.point[Id]
case Error(msg, value) =>
println(s"ERROR: $msg")
value.point[Id]
}
}
Finally here's an example of definition and use.
object Example {
val free =
for {
x <- Free.liftF(Log.debug("Step 1", 1))
y <- Free.liftF(Log.warn("Step 2", 2))
z <- Free.liftF(Log.error("Step 3", 3))
} yield x + y + z
val result =
free.foldMap(Println)
}
That's the basics of the free monad: it's something we can wrap around an arbitrary type constructor (a F[_]
) to construct a monad. It allows us to separate the structure of the computation from its interpreter, thereby allowing different interpretation depending on context.
There are a lot of conveniences for using the free monad. We can use something called the Coyoneda theorem to automatically convert a type constructor into a functor that the free monad requires. We can compose different types wrapped in the free monad, and different interpreters, using coproducts. This is all useful stuff but not essential for understanding the core idea.
The core idea, separating the structure and interpretation of computer programs, is incredibly powerful (wizardly, even). Haxl and Stitch are just one prominent example of this.
If you are interested in learning more about these ideas, we are writing a book Essential Interpreters that covers the basics of interpreters up to the free monad.
Etsy, for example, faces the same problem but [their solution][https://codeascraft.com/2015/04/06/experimenting-with-hhvm-at-etsy/] is rather less elegant and performant.
And the monad laws.
Some very simple interpreters entwine these two parts, but they are conceptually if not literally separate.