alpmestan/-

## -
---
title: Why is servant a type-level DSL?
author: Alp Mestanogullari
date: 2018-07-43 20:00
toc: true
---

---

This post is an attempt at explaining servant's design as an embedded domain
specific language, and particularly why it _had to_ be a _type-level_ domain
specific language, given our requirements. Along the way, we will discuss
approaches for designing extensible EDSLs in Haskell and see why other simpler
approaches just don't cut it.

# A little bit of context

Back in 2014, Sönke Hahn, Julian Arni and myself were working together in "the
Haskell team" at Zalora on all sorts of projects. Many of them involved serving
web applications, querying external APIs or our own services from Haskell,
PHP, JS and probably a few other languages. At the time, we were using
a few of the well established "web frameworks", among which `scotty`,
whenever we had to offer some service over HTTP.

However, writing all those functions for hitting our own webservices was a lot
of manual, error-prone, tedious work. The bigger web applications got, the more
tedious it became. And it had to be done once per language in which we wanted to
hit the application. This could not continue.

For reference, this is what a simple scotty application looks like:

``` haskell
{-# LANGUAGE OverloadedStrings #-}
import Data.Text (split)
import Web.Scotty

main :: IO ()
main = scotty 8000 $
  get "/repeat/:n" $ do
    n <- param "n"
    json (replicate n n)

  post "/message" $ do
    msg <- jsonData
    json (split "\n" msg)
```

How could we somewhat automate the creation of one client function per
endpoint of the web application? In an ideal world, we would just show this
application to some program or library and it would collect all the data it
needs about the overall structure of the application from the code itself, in
order to produce 2 client functions:

``` haskell
-- Client for the first endpoint.
--
-- The Int is the value you want to set ":n" to (/repeat/23, /repeat/10, ...).
getRepeat :: Int -> ClientMonad [Int]

-- Client for the second endpoint.
--
-- The JSON body (just a naked string) to send is the Text argument.
postMessage :: Text -> ClientMonad [Text]
```

which would do all the hard work of preparing an HTTP request for us, even
taking care of JSON encoding and decoding for us. But... the entire structure
of the application is just hidden in the `do` block and we just cannot
programmatically access it.

So... this is not realistically doable. We clearly _need_ to change (a little bit?
a lot?) the way we write our applications, making sure we get a description
of the web's application structure (the endpoints, the part of the request they
use or depend on, what they return) that we could then hand over to _something_,
which would get us our client functions.

We will now try implementating such a web application description DSL in the
most straightforward way possible.

# A first attempt

We want to produce client functions that look like the ones above, that
prepare and send HTTP requests for us by taking some pieces of data
given as arguments to those functions and encoding then storing
them in the right places of the request (request path for
URL captures, request body, headers, etc). Let's perhaps start
simple with a data type for describing an endpoint that can be
served under some path (which can contain static string fragments
and captures), for a given http method, ignoring everything else for now.

It could look like this:

``` haskell
data Method = Get | Post

data Endpoint = Static String Endpoint
              | Capture Endpoint
			  | Verb Method

-- GET /hello/:name
getHello :: Endpoint
getHello = Static "hello" (Capture (Verb Get))
```

and, if we want it to look a little more "servant-y", we can define:

``` haskell
infixr 5 :>
(:>) :: (Endpoint -> Endpoint) -> Endpoint -> Endpoint
f :> x = f x

getHelloNew :: Endpoint
getHelloNew = Static "hello" :> Capture :> Verb Get
```

Unlike servant though, as you can see with the type of `getHello` and
`getHelloNew`, our descriptions are good old Haskell values, both of the
`Endpoint` type.

Given those few definitions, how could we go about, say, generating links
to endpoints? Well, here is a straightforward attempt.

``` haskell
-- a link here is just a list of path components
-- (we ignore query parameters in this post)
type Link = [String]

linkTo :: Endpoint -> Link
linkTo (Static str rest) = str : linkTo rest
linkTo (Verb _method)    = []
linkTo (Capture rest)    = ??? : linkTo rest
```

But... what should we put in place of those `???`, if anything?

Well, we definitely want to add _some_ path component, to fill the
`Capture` slot. However, by definition, a captured path fragment is
not fixed, it is allowed to vary. In other words, `Capture :> Verb Post`
matches both `POST /x` and `POST /y`. We cannot just pick one value
and hope that it is the one the user wanted. We need to take it as an
argument. But what about `Capture :> Capture :> Verb Post`? We would need
our `linkTo` function to take 2 arguments for that case. And zero additional
argument for `Static "hello" :> Verb Post`. This is quite problematic.

Indeed, we would like the type of `linkTo` to be `Endpoint -> Link`,
`Endpoint -> String -> Link`, `Endpoint -> String -> String -> Link` and
so on depending on what the `Endpoint` argument is. In other words,
we want the return type of `linkTo` (when really seen as a function of
one argument, which it is anyway) to depend on the value of type `Endpoint`
it gets as input. That is, we want a type that depends on a value, i.e
dependent types.

Fortunately, our needs are simple enough ...

> TODO: simple GADT that just tracks captures

> TODO: what about when we need request bodies? we can introduce
> a type param for every single thing we want to track. not really
> extensible, if we want req bodies, auth, headers, query params, etc.

> TODO: perhaps mention a GADT that would just track API types

> TODO: but still not extensible enough! We wanted to try a few
> different things that were hard to reconcile without
> a little bit of time and perspective. How do people build
> modular embedded domain specific languages in Haskell?

# The Expression Problem

To quote [Phil Wadler](http://homepages.inf.ed.ac.uk/wadler/papers/expression/expression.txt):
"_the expression problem is a new name for an old problem. The goal is to define
a datatype by cases, where one can add new cases to the datatype and new
functions over the datatype, without recompiling existing code, and while
retaining static type safety (e.g., no casts)_".

In Haskell, the standard approach to representing some domain is
to define an algebraic data type for it. For a simple type of expressions
with additions and integers, we usually do:

``` haskell
data Expr = I Integer | Add Expr Expr
```

and proceed to write what we call "interpreters", which in this case are just
functions that take expressions as input and do something interesting with them.

``` haskell
eval :: Expr -> Integer
eval (I n)     = n
eval (Add a b) = eval a + eval b

prettyPrint :: Expr -> String
prettyPrint (I n) = show n
prettyPrint (Add a b) = unwords [prettyPrint a, "+", prettyPrint b]
```

So, given an expression type, we can easily "add new functions over the
data type", to reuse Phil's wording. We just write a new function.
However, when the time comes to "add new cases to the data type", this approach
becomes painful. A "new case" here means a new constructor for our `Expr` data
type. Let's say we want to support multiplications too:

``` haskell
data Expr = I Integer | Add Expr Expr | Mul Expr Expr
```

Now, we have to modify _every single function that patterns matches on an
`Expr`_ to handle the `Mul` constructor, including our `eval` and `prettyPrint`
"interpreters". For any non-trivial domain, this becomes _very_ painful,
_very_ quickly. Fine, so what other options are there?

[Ralf Lämmel's slides](https://userpages.uni-koblenz.de/~laemmel/TheEagle/resources/pdf/xproblem1.pdf)
on the topic have been of a great help for me, back when we were looking for
a solution suitable to our needs. With Oleg Kiselyov, they show how we can
reasonably easily (that is, in Haskell 98) achieve full extensibility in both
directions (constructors and interpretations) in Haskell. It boils down to:

- Turn what would be a constructor into its own little data type.
- Turn what would be a simple function that operates on the data type into
  a typeclass with a method.
- Write instances of those typeclasses for the data types representing the DSL's
  constructs.

This effectively means that we won't have a single type to represent all the
vald "endpoint descriptions". Instead, with this approach, we will be able to
process any "reasonable" combination of "endpoint components". The `Expr` typeclass
below is exactly what lets us say what is a valid endpoint description and what
isn't. Using their approach for our expressions would look like this:

``` haskell
-- our expression constructs, one data type per
-- constructor we had previously.

	-- integer constants
data I = I Integer

-- Since we don't have an 'Expr' type anymore, to use as a type for
-- the fields of Add, we just make them type parameters. Sometimes
-- 'l' and 'r' might be I, some other times they might be 'Add I I',
-- or 'Add (Add I I) I', and so on. The type reflects the recursive
-- structure.
data Add l r = Add l r

-- an "open union" to be able to describe all the
-- valid expression types.
class Expr a
instance Expr I
instance (Expr l, Expr r) => Expr (Add l r)

-- our first interpretation, evaluation
class Expr a => Eval a where
  eval :: a -> Integer

-- evaluating a constant amounts to returning it
instance Eval I where
  eval (I n) = n

-- if we know how to evaluate two things, we know how to evaluate
-- their addition
instance (Eval l, Eval r) => Eval (Add l r) where
  eval (Add a b) = eval a + eval b

-- our second interpretation, pretty printing
class Expr a => Pretty a where
  pretty :: a -> String

instance Pretty I where
  pretty (I n) = show n

instance (Pretty l, Pretty r) => Pretty (Add l r) where
  pretty (Add a b) = unwords [pretty a, "+", pretty b]
```

Every constructor that we had in our previous `Expr` data type is now
turned into its own little type, and every interpretation becomes a type
class that all those little types are then free to provide an instance for.
In fact, we do not necessarily have to supply an instance of each interpretation
for all of our constructs. If we try to interpret an expression that uses a
construct not supported by this interpretation, we get a type error! This is
much better than calling `error` in some corner cases that should in theory
not be reached... In theory. Right.

Anyway, if we now want to add support for multiplications, we can simply do:

``` haskell
data Mul l r = Mul l r
instance (Expr l, Expr r) => Expr (Mul l r)

instance (Eval l, Eval r) => Eval (Mul l r) where
  eval (Mul a b) = eval a * eval b

instance (Pretty l, Pretty r) => Pretty (Mul l r) where
  pretty (Mul a b) = unwords [autoParens a, "*", autoParens b]
    where autoParens a@(Add _ _) = "(" ++ pretty a ++ ")"
          autoParens           a = pretty a
```

We didn't have to change any existing function, that's great! Let's apply this
approach to a very simplified web application description "language" that we
could make out of tiny building blocks (static path fragments, captures, etc).

# A first serious attempt

Adapting the approach from the previous section to our domain, we can give a
shot at decomposing the kind of information we want to represent into
a few different "constructs" (i.e data types).

``` haskell
-- static path fragments
data Static = Static String

-- variable path fragments ("captures")
data Capture = Capture

-- HTTP method
data Method = Get | Post
-- Leaf of a chain of :>'s, specifies the HTTP method
data Verb = Verb Method
é
-- chain a few "endpoint components" with this operator,
-- all chains must be terminated with a 'Verb' component.
infixr :> 5
data a :> b = a :> b

-- a class to specify all the valid endpoint descriptions
class Endpoint a
instance Endpoint (Verb a)
instance Endpoint rest => Endpoint (Static :> rest)
instance Endpoint rest => Endpoint (Capture :> rest)

-- GET /hello
endpoint1 = Static "hello" :> Verb Get
```

OK, why not. Let's now try to write an interpretation for generating links
to endpoints like the one above. This is a lot simpler and self-contained than
investigating client generation or server-side routing, while retaining many of
the difficulties. The main one is that depending on what we find in the
description of the endpoint, we need the type of the link-generating function
to change: indeed, if we encounter `Capture`s, then the user has to supply
values for them. We will let the user do that through one additional
argument per Capture we encounter.

Let's start with something really simple.

``` haskell
type Link = [String]

-- @renderLink ["hello", "world"] == "/hello/world"@
renderLink :: Link -> String
renderLink xs = '/' : intercalate '/' xs

class HasLink endpoint where
  -- return the path components
  link :: endpoint -> [String]

instance HasLink api => HasLink (Static :> api) where
  link (Static s :> api) = s : link api

instance HasLink api => HasLink (Capture :> api) where
  link (Capture :> api) = ??? : link api

instance HasLink Verb where
  link _ = []
```

We should be appending something in place of those `???` there.
But since `Capture` represents variable path fragments (like `:userid` in
`/user/:userid`, in many web frameworks), we do not want to pick a fixed string,
we would like for this string to be supplied by the caller of `link`, as stated
above. Let's introduce a slightly fancier `HasLink` class to make it seemingly
"variadic".

``` haskell
class HasLink endpoint where
  type LinkType endpoint :: *
  link :: endpoint -> LinkType

instance HasLink Verb where
  type LinkType Verb = Link
  link _ = []

instance HasLink api => HasLink (Static :> api) where
  type LinkType (Static :> api) = LinkType api
  link (Static s :> api) = s : link api

instance HasLink api => HasLink (Capture :> api) where
  -- HERE! we introduce a String argument
  type LinkType (Capture :> api) = String -> LinkType api

  -- we expand the type of link:
  -- link :: (Capture :> api) -> String -> LinkType api
  -- we see that our little `LinkType` trick there allows
  -- link to receive arguments when appropriate
  link (Capture :> api) captureValue = captureValue : link api

-- examples:

-- "/hello"
simpleEndpointLink = renderLink (link endpoint1)

endpoint2 = Capture :> Verb Post
linkFun2 :: String -> Link
linkFun2 = link endpoint2

link2a = renderLink (linkFun2 "foo") -- "/foo"
link2b = renderLink (linkFun2 "bar") -- "/bar"

endpoint3 = Static "hello" :> Capture :> Capture :> Verb Get
link3 = renderLink (link endpoint3 "x" "y") -- "/hello/x/y"
```

This looks promising. Let's now try to introduce some more types here,
by allowing captures to not be specified just as simple strings, but any
`Show`able type (this is terrible, but simple enough for this post). We need
to modify `Capture` to track that `Show`able type we will use to specify
the value of that path fragment.

``` haskell
data Capture a = Capture

instance (Show a, HasLink api) => HasLink (Capture a :> api) where
  -- HERE! we introduce an argument of type 'a'
  type LinkType (Capture :> api) = a -> LinkType api

  -- we expand the type of link:
  -- link :: (Capture a :> api) -> a -> LinkType api
  -- we see that our little `LinkType` trick there allows
  -- link to receive the argument of type 'a' at the right time, just
  -- when we need to stick it at the top of the list
  link (Capture :> api) captureValue = show captureValue : link api
```

We unfortunately cannot just "track" some type by storing it in a field
(which is different from storing _a value of that type_). Instead we make
`Capture` a clone of `Proxy` (from `Data.Proxy`) and just carry around a phantom
type parameter. This is a little inconvenient as we will _have to_ type annotate
_all_ `Capture`s (or use the `TypeApplications` language extension), but let's
roll with this approach for now.

Let's now see an endpoint description using this variant of `Capture`.

``` haskell
endpoint4  = Static "hello" :> (Capture :: Capture Int) :> Verb Post
-- or, with TypeApplications:
endpoint4' = Static "hello" :> (Capture @ Int) :> Verb Post
```

OK, interesting, why not. It does look a little bit ugly. It would look
even uglier if we included the response type in `Verb`, turning it into
`data Verb a = Verb Method` which would require the same kind of type
annotations. And the same problem would manifest itself if we were to add
all the similar types from servant (`ReqBody`, `QueryParam`, `Header`, etc).
This is quite disappointing.

Unrelatedly, have you noticed that I have not given the type of any of our
endpoint descriptions so far? This is on purpose, because those types are
a little bit fancy. Fortunately, they should look familiar:

``` haskell
endpoint1 :: Static :> Verb
endpoint2 :: Capture String :> Verb
endpoint3 :: Static :> Capture String :> Capture String :> Verb
endpoint4 :: Static :> Capture Int :> Verb
```

That's right, not only do the descriptions (which are good old haskell values)
look like servant's API types, but their types too! We can see that we are just
"hiding" the strings in the static path fragment and the HTTP method in verbs
from the type-level. And it gets worse as you want to support a richer description
language. For instance, when we consider content types, we have no choice
but to keep track of them at the type level too, even with this design. Because
we need to make sure suitable encoding/decoding instances are available for the
types that will be represented with those MIME types, and this cannot be done when
discovering `"application/json"` in a list somewhere, at runtime.

> TODO

# Servant's approach (simplified)

First, let me emphasize that any of the designs we have considered so far
are interesting on their own and are fruitful in different ways. They simply
were not quite good enough to meet our requirements which were, again, dictated
by the projects and needs we had at work. This whole project started because
we were sick of getting things wrong when manually constructing (client) or
deconstructing (server) HTTP requests and so on.

> TODO

# Going further

- [Servant, Type Families, and Type-level Everything - A look at advanced GHC features used in Servant](https://arow.info/blog/posts/2015-07-10-servant-intro.html)

  I suspect this is a rather useful resource for Haskellers who haven't yet
  encountered type-level programming in (GHC) Haskell.

- [Implementing a minimal version of servant](https://www.well-typed.com/blog/2015/11/implementing-a-minimal-version-of-haskell-servant/)

  A more approchable and more narrowly focused alternative to the servant paper,
  which consists in implementing a very simplified version of servant, using
  however the same "API type" based approach for the EDSL as the real servant.

- [the servant paper](https://alpmestan.com/servant/servant-wgp.pdf), published at
  the Workshop on Generic Programming, 2015.

- [Software extensions and Integration with Type Classes](https://www.informatik.uni-marburg.de/~kos/papers/gpce06.pdf)

  by Ralf Lämmel and Klaus Ostermann talks in greater depth than the slides
  about the highly modular approach to embedded domain specific languages in
  Haskell and uses it on several examples.

- [serv](https://github.com/tel/serv) and [solga](https://github.com/chpatrick/solga)
  are smaller, younger and (I think) humbler relatives of servant which make
  slightly different choices for the DSL.

  Somewhat relatedly, there is [servant-0.1](https://github.com/alpmestan/servant/tree/master#servant),
  which wasn't anything like the servant most people know.
	---
	title: Why is servant a type-level DSL?
	author: Alp Mestanogullari
	date: 2018-07-43 20:00
	toc: true
	---

	---

	This post is an attempt at explaining servant's design as an embedded domain
	specific language, and particularly why it _had to_ be a _type-level_ domain
	specific language, given our requirements. Along the way, we will discuss
	approaches for designing extensible EDSLs in Haskell and see why other simpler
	approaches just don't cut it.

	# A little bit of context

	Back in 2014, Sönke Hahn, Julian Arni and myself were working together in "the
	Haskell team" at Zalora on all sorts of projects. Many of them involved serving
	web applications, querying external APIs or our own services from Haskell,
	PHP, JS and probably a few other languages. At the time, we were using
	a few of the well established "web frameworks", among which `scotty`,
	whenever we had to offer some service over HTTP.

	However, writing all those functions for hitting our own webservices was a lot
	of manual, error-prone, tedious work. The bigger web applications got, the more
	tedious it became. And it had to be done once per language in which we wanted to
	hit the application. This could not continue.

	For reference, this is what a simple scotty application looks like:

	``` haskell
	{-# LANGUAGE OverloadedStrings #-}
	import Data.Text (split)
	import Web.Scotty

	main :: IO ()
	main = scotty 8000 $
	get "/repeat/:n" $ do
	n <- param "n"
	json (replicate n n)

	post "/message" $ do
	msg <- jsonData
	json (split "\n" msg)
	```

	How could we somewhat automate the creation of one client function per
	endpoint of the web application? In an ideal world, we would just show this
	application to some program or library and it would collect all the data it
	needs about the overall structure of the application from the code itself, in
	order to produce 2 client functions:

	``` haskell
	-- Client for the first endpoint.
	--
	-- The Int is the value you want to set ":n" to (/repeat/23, /repeat/10, ...).
	getRepeat :: Int -> ClientMonad [Int]

	-- Client for the second endpoint.
	--
	-- The JSON body (just a naked string) to send is the Text argument.
	postMessage :: Text -> ClientMonad [Text]
	```

	which would do all the hard work of preparing an HTTP request for us, even
	taking care of JSON encoding and decoding for us. But... the entire structure
	of the application is just hidden in the `do` block and we just cannot
	programmatically access it.

	So... this is not realistically doable. We clearly _need_ to change (a little bit?
	a lot?) the way we write our applications, making sure we get a description
	of the web's application structure (the endpoints, the part of the request they
	use or depend on, what they return) that we could then hand over to _something_,
	which would get us our client functions.

	We will now try implementating such a web application description DSL in the
	most straightforward way possible.

	# A first attempt

	We want to produce client functions that look like the ones above, that
	prepare and send HTTP requests for us by taking some pieces of data
	given as arguments to those functions and encoding then storing
	them in the right places of the request (request path for
	URL captures, request body, headers, etc). Let's perhaps start
	simple with a data type for describing an endpoint that can be
	served under some path (which can contain static string fragments
	and captures), for a given http method, ignoring everything else for now.

	It could look like this:

	``` haskell
	data Method = Get \| Post

	data Endpoint = Static String Endpoint
	\| Capture Endpoint
	\| Verb Method

	-- GET /hello/:name
	getHello :: Endpoint
	getHello = Static "hello" (Capture (Verb Get))
	```

	and, if we want it to look a little more "servant-y", we can define:

	``` haskell
	infixr 5 :>
	(:>) :: (Endpoint -> Endpoint) -> Endpoint -> Endpoint
	f :> x = f x

	getHelloNew :: Endpoint
	getHelloNew = Static "hello" :> Capture :> Verb Get
	```

	Unlike servant though, as you can see with the type of `getHello` and
	`getHelloNew`, our descriptions are good old Haskell values, both of the
	`Endpoint` type.

	Given those few definitions, how could we go about, say, generating links
	to endpoints? Well, here is a straightforward attempt.

	``` haskell
	-- a link here is just a list of path components
	-- (we ignore query parameters in this post)
	type Link = [String]

	linkTo :: Endpoint -> Link
	linkTo (Static str rest) = str : linkTo rest
	linkTo (Verb _method) = []
	linkTo (Capture rest) = ??? : linkTo rest
	```

	But... what should we put in place of those `???`, if anything?

	Well, we definitely want to add _some_ path component, to fill the
	`Capture` slot. However, by definition, a captured path fragment is
	not fixed, it is allowed to vary. In other words, `Capture :> Verb Post`
	matches both `POST /x` and `POST /y`. We cannot just pick one value
	and hope that it is the one the user wanted. We need to take it as an
	argument. But what about `Capture :> Capture :> Verb Post`? We would need
	our `linkTo` function to take 2 arguments for that case. And zero additional
	argument for `Static "hello" :> Verb Post`. This is quite problematic.

	Indeed, we would like the type of `linkTo` to be `Endpoint -> Link`,
	`Endpoint -> String -> Link`, `Endpoint -> String -> String -> Link` and
	so on depending on what the `Endpoint` argument is. In other words,
	we want the return type of `linkTo` (when really seen as a function of
	one argument, which it is anyway) to depend on the value of type `Endpoint`
	it gets as input. That is, we want a type that depends on a value, i.e
	dependent types.

	Fortunately, our needs are simple enough ...

	> TODO: simple GADT that just tracks captures

	> TODO: what about when we need request bodies? we can introduce
	> a type param for every single thing we want to track. not really
	> extensible, if we want req bodies, auth, headers, query params, etc.

	> TODO: perhaps mention a GADT that would just track API types

	> TODO: but still not extensible enough! We wanted to try a few
	> different things that were hard to reconcile without
	> a little bit of time and perspective. How do people build
	> modular embedded domain specific languages in Haskell?

	# The Expression Problem

	To quote [Phil Wadler](http://homepages.inf.ed.ac.uk/wadler/papers/expression/expression.txt):
	"_the expression problem is a new name for an old problem. The goal is to define
	a datatype by cases, where one can add new cases to the datatype and new
	functions over the datatype, without recompiling existing code, and while
	retaining static type safety (e.g., no casts)_".

	In Haskell, the standard approach to representing some domain is
	to define an algebraic data type for it. For a simple type of expressions
	with additions and integers, we usually do:

	``` haskell
	data Expr = I Integer \| Add Expr Expr
	```

	and proceed to write what we call "interpreters", which in this case are just
	functions that take expressions as input and do something interesting with them.

	``` haskell
	eval :: Expr -> Integer
	eval (I n) = n
	eval (Add a b) = eval a + eval b

	prettyPrint :: Expr -> String
	prettyPrint (I n) = show n
	prettyPrint (Add a b) = unwords [prettyPrint a, "+", prettyPrint b]
	```

	So, given an expression type, we can easily "add new functions over the
	data type", to reuse Phil's wording. We just write a new function.
	However, when the time comes to "add new cases to the data type", this approach
	becomes painful. A "new case" here means a new constructor for our `Expr` data
	type. Let's say we want to support multiplications too:

	``` haskell
	data Expr = I Integer \| Add Expr Expr \| Mul Expr Expr
	```

	Now, we have to modify _every single function that patterns matches on an
	`Expr`_ to handle the `Mul` constructor, including our `eval` and `prettyPrint`
	"interpreters". For any non-trivial domain, this becomes _very_ painful,
	_very_ quickly. Fine, so what other options are there?

	[Ralf Lämmel's slides](https://userpages.uni-koblenz.de/~laemmel/TheEagle/resources/pdf/xproblem1.pdf)
	on the topic have been of a great help for me, back when we were looking for
	a solution suitable to our needs. With Oleg Kiselyov, they show how we can
	reasonably easily (that is, in Haskell 98) achieve full extensibility in both
	directions (constructors and interpretations) in Haskell. It boils down to:

	- Turn what would be a constructor into its own little data type.
	- Turn what would be a simple function that operates on the data type into
	a typeclass with a method.
	- Write instances of those typeclasses for the data types representing the DSL's
	constructs.

	This effectively means that we won't have a single type to represent all the
	vald "endpoint descriptions". Instead, with this approach, we will be able to
	process any "reasonable" combination of "endpoint components". The `Expr` typeclass
	below is exactly what lets us say what is a valid endpoint description and what
	isn't. Using their approach for our expressions would look like this:

	``` haskell
	-- our expression constructs, one data type per
	-- constructor we had previously.

	-- integer constants
	data I = I Integer

	-- Since we don't have an 'Expr' type anymore, to use as a type for
	-- the fields of Add, we just make them type parameters. Sometimes
	-- 'l' and 'r' might be I, some other times they might be 'Add I I',
	-- or 'Add (Add I I) I', and so on. The type reflects the recursive
	-- structure.
	data Add l r = Add l r

	-- an "open union" to be able to describe all the
	-- valid expression types.
	class Expr a
	instance Expr I
	instance (Expr l, Expr r) => Expr (Add l r)

	-- our first interpretation, evaluation
	class Expr a => Eval a where
	eval :: a -> Integer

	-- evaluating a constant amounts to returning it
	instance Eval I where
	eval (I n) = n

	-- if we know how to evaluate two things, we know how to evaluate
	-- their addition
	instance (Eval l, Eval r) => Eval (Add l r) where
	eval (Add a b) = eval a + eval b

	-- our second interpretation, pretty printing
	class Expr a => Pretty a where
	pretty :: a -> String

	instance Pretty I where
	pretty (I n) = show n

	instance (Pretty l, Pretty r) => Pretty (Add l r) where
	pretty (Add a b) = unwords [pretty a, "+", pretty b]
	```

	Every constructor that we had in our previous `Expr` data type is now
	turned into its own little type, and every interpretation becomes a type
	class that all those little types are then free to provide an instance for.
	In fact, we do not necessarily have to supply an instance of each interpretation
	for all of our constructs. If we try to interpret an expression that uses a
	construct not supported by this interpretation, we get a type error! This is
	much better than calling `error` in some corner cases that should in theory
	not be reached... In theory. Right.

	Anyway, if we now want to add support for multiplications, we can simply do:

	``` haskell
	data Mul l r = Mul l r
	instance (Expr l, Expr r) => Expr (Mul l r)

	instance (Eval l, Eval r) => Eval (Mul l r) where
	eval (Mul a b) = eval a * eval b

	instance (Pretty l, Pretty r) => Pretty (Mul l r) where
	pretty (Mul a b) = unwords [autoParens a, "*", autoParens b]
	where autoParens a@(Add _ _) = "(" ++ pretty a ++ ")"
	autoParens a = pretty a
	```

	We didn't have to change any existing function, that's great! Let's apply this
	approach to a very simplified web application description "language" that we
	could make out of tiny building blocks (static path fragments, captures, etc).

	# A first serious attempt

	Adapting the approach from the previous section to our domain, we can give a
	shot at decomposing the kind of information we want to represent into
	a few different "constructs" (i.e data types).

	``` haskell
	-- static path fragments
	data Static = Static String

	-- variable path fragments ("captures")
	data Capture = Capture

	-- HTTP method
	data Method = Get \| Post
	-- Leaf of a chain of :>'s, specifies the HTTP method
	data Verb = Verb Method
	é
	-- chain a few "endpoint components" with this operator,
	-- all chains must be terminated with a 'Verb' component.
	infixr :> 5
	data a :> b = a :> b

	-- a class to specify all the valid endpoint descriptions
	class Endpoint a
	instance Endpoint (Verb a)
	instance Endpoint rest => Endpoint (Static :> rest)
	instance Endpoint rest => Endpoint (Capture :> rest)

	-- GET /hello
	endpoint1 = Static "hello" :> Verb Get
	```

	OK, why not. Let's now try to write an interpretation for generating links
	to endpoints like the one above. This is a lot simpler and self-contained than
	investigating client generation or server-side routing, while retaining many of
	the difficulties. The main one is that depending on what we find in the
	description of the endpoint, we need the type of the link-generating function
	to change: indeed, if we encounter `Capture`s, then the user has to supply
	values for them. We will let the user do that through one additional
	argument per Capture we encounter.

	Let's start with something really simple.

	``` haskell
	type Link = [String]

	-- @renderLink ["hello", "world"] == "/hello/world"@
	renderLink :: Link -> String
	renderLink xs = '/' : intercalate '/' xs

	class HasLink endpoint where
	-- return the path components
	link :: endpoint -> [String]

	instance HasLink api => HasLink (Static :> api) where
	link (Static s :> api) = s : link api

	instance HasLink api => HasLink (Capture :> api) where
	link (Capture :> api) = ??? : link api

	instance HasLink Verb where
	link _ = []
	```

	We should be appending something in place of those `???` there.
	But since `Capture` represents variable path fragments (like `:userid` in
	`/user/:userid`, in many web frameworks), we do not want to pick a fixed string,
	we would like for this string to be supplied by the caller of `link`, as stated
	above. Let's introduce a slightly fancier `HasLink` class to make it seemingly
	"variadic".

	``` haskell
	class HasLink endpoint where
	type LinkType endpoint :: *
	link :: endpoint -> LinkType

	instance HasLink Verb where
	type LinkType Verb = Link
	link _ = []

	instance HasLink api => HasLink (Static :> api) where
	type LinkType (Static :> api) = LinkType api
	link (Static s :> api) = s : link api

	instance HasLink api => HasLink (Capture :> api) where
	-- HERE! we introduce a String argument
	type LinkType (Capture :> api) = String -> LinkType api

	-- we expand the type of link:
	-- link :: (Capture :> api) -> String -> LinkType api
	-- we see that our little `LinkType` trick there allows
	-- link to receive arguments when appropriate
	link (Capture :> api) captureValue = captureValue : link api

	-- examples:

	-- "/hello"
	simpleEndpointLink = renderLink (link endpoint1)

	endpoint2 = Capture :> Verb Post
	linkFun2 :: String -> Link
	linkFun2 = link endpoint2

	link2a = renderLink (linkFun2 "foo") -- "/foo"
	link2b = renderLink (linkFun2 "bar") -- "/bar"

	endpoint3 = Static "hello" :> Capture :> Capture :> Verb Get
	link3 = renderLink (link endpoint3 "x" "y") -- "/hello/x/y"
	```

	This looks promising. Let's now try to introduce some more types here,
	by allowing captures to not be specified just as simple strings, but any
	`Show`able type (this is terrible, but simple enough for this post). We need
	to modify `Capture` to track that `Show`able type we will use to specify
	the value of that path fragment.

	``` haskell
	data Capture a = Capture

	instance (Show a, HasLink api) => HasLink (Capture a :> api) where
	-- HERE! we introduce an argument of type 'a'
	type LinkType (Capture :> api) = a -> LinkType api

	-- we expand the type of link:
	-- link :: (Capture a :> api) -> a -> LinkType api
	-- we see that our little `LinkType` trick there allows
	-- link to receive the argument of type 'a' at the right time, just
	-- when we need to stick it at the top of the list
	link (Capture :> api) captureValue = show captureValue : link api
	```

	We unfortunately cannot just "track" some type by storing it in a field
	(which is different from storing _a value of that type_). Instead we make
	`Capture` a clone of `Proxy` (from `Data.Proxy`) and just carry around a phantom
	type parameter. This is a little inconvenient as we will _have to_ type annotate
	_all_ `Capture`s (or use the `TypeApplications` language extension), but let's
	roll with this approach for now.

	Let's now see an endpoint description using this variant of `Capture`.

	``` haskell
	endpoint4 = Static "hello" :> (Capture :: Capture Int) :> Verb Post
	-- or, with TypeApplications:
	endpoint4' = Static "hello" :> (Capture @ Int) :> Verb Post
	```

	OK, interesting, why not. It does look a little bit ugly. It would look
	even uglier if we included the response type in `Verb`, turning it into
	`data Verb a = Verb Method` which would require the same kind of type
	annotations. And the same problem would manifest itself if we were to add
	all the similar types from servant (`ReqBody`, `QueryParam`, `Header`, etc).
	This is quite disappointing.

	Unrelatedly, have you noticed that I have not given the type of any of our
	endpoint descriptions so far? This is on purpose, because those types are
	a little bit fancy. Fortunately, they should look familiar:

	``` haskell
	endpoint1 :: Static :> Verb
	endpoint2 :: Capture String :> Verb
	endpoint3 :: Static :> Capture String :> Capture String :> Verb
	endpoint4 :: Static :> Capture Int :> Verb
	```

	That's right, not only do the descriptions (which are good old haskell values)
	look like servant's API types, but their types too! We can see that we are just
	"hiding" the strings in the static path fragment and the HTTP method in verbs
	from the type-level. And it gets worse as you want to support a richer description
	language. For instance, when we consider content types, we have no choice
	but to keep track of them at the type level too, even with this design. Because
	we need to make sure suitable encoding/decoding instances are available for the
	types that will be represented with those MIME types, and this cannot be done when
	discovering `"application/json"` in a list somewhere, at runtime.

	> TODO

	# Servant's approach (simplified)

	First, let me emphasize that any of the designs we have considered so far
	are interesting on their own and are fruitful in different ways. They simply
	were not quite good enough to meet our requirements which were, again, dictated
	by the projects and needs we had at work. This whole project started because
	we were sick of getting things wrong when manually constructing (client) or
	deconstructing (server) HTTP requests and so on.

	> TODO

	# Going further

	- [Servant, Type Families, and Type-level Everything - A look at advanced GHC features used in Servant](https://arow.info/blog/posts/2015-07-10-servant-intro.html)

	I suspect this is a rather useful resource for Haskellers who haven't yet
	encountered type-level programming in (GHC) Haskell.

	- [Implementing a minimal version of servant](https://www.well-typed.com/blog/2015/11/implementing-a-minimal-version-of-haskell-servant/)

	A more approchable and more narrowly focused alternative to the servant paper,
	which consists in implementing a very simplified version of servant, using
	however the same "API type" based approach for the EDSL as the real servant.

	- [the servant paper](https://alpmestan.com/servant/servant-wgp.pdf), published at
	the Workshop on Generic Programming, 2015.

	- [Software extensions and Integration with Type Classes](https://www.informatik.uni-marburg.de/~kos/papers/gpce06.pdf)

	by Ralf Lämmel and Klaus Ostermann talks in greater depth than the slides
	about the highly modular approach to embedded domain specific languages in
	Haskell and uses it on several examples.

	- [serv](https://github.com/tel/serv) and [solga](https://github.com/chpatrick/solga)
	are smaller, younger and (I think) humbler relatives of servant which make
	slightly different choices for the DSL.

	Somewhat relatedly, there is [servant-0.1](https://github.com/alpmestan/servant/tree/master#servant),
	which wasn't anything like the servant most people know.