dataPulverizer/ctseq.md

## ctseq.md

      
    Raw
  

              ctseq.md
            
          
    Introduction

Compile time sequences in D are insane, not insane "crazy" but insanely interesting. One of the selling points of D is that it takes compile time programming in C++ to another level, where templates in C++ were a 'happy?' accident, in D they were purposely built into the language along with other generic and meta-programming tools. To highlight this aspect of the D language, we will implement a selection of templates to manipulate compile time sequences of types, only the text function from the standard library in the std.conv module will be imported which is for concatenating text, we will build everything else including AliasSeq and Nothing from the std.meta module which they are easy one-liners.
This article builds on the concepts of a previous article on compile time reads of IDX files in D but here the focus is on compile time sequences, and introducing some new tools and concepts. Some desireable features of compile time sequences are:

Indexing - selecting items using a (numerical) index. This is built-in, we don't need to do anything to get this functionality.
Append, Prepend, and Concatenate - we will see that these are implicit in the definition of compile time sequences.
Removing item(s).
Inserting item(s).

Good sources for further information of the concepts discussed here are given in:

D's language reference.
"Programming in D" book by Ali Çehreli
Documentation of the std.meta package.
The  std.meta package on GitHub. I don't routinely recommend reading actual code libraries and maybe I should do that more, but you can learn a lot about how to write compile time code in D by reading std.meta.

Template abbreviations and preliminaries

There are lots of templatable data types and commands in D and it's easy to over-look some. These include:

Functions.
Structs, Classes, and Interfaces.
Enumerations.
Alias - used for type aliasing.
Template expressions. Templates in D are an important part of compile time programming, as are static if,  static foreach loops (covered in the previous article), and recursion. You could say C++ templates are also powerful, but remember that templates in D were not incidental, they were designed into the language from the beginning, they are there on purpose, very powerful, and easy to use. They are also prettier than C++ templates.

Function template abbreviations

We won't be creating any template functions in these compile time sequences, however for completeness the example below shows a template function for the inverse square law for gravitational attraction force between two bodies:
// Longhand
template GravitationalAttraction(T)
{
  auto GravitationalAttraction(T G, T m1, T m2, T r)
  {
    return (G*m1*m2)/(r^^2); //the two carets are not a mistake
  }
}

// Shorthand
auto GravitationalAttraction(T)(T G, T m1, T m2, T r)
{
  return (G*m1*m2)/(r^^2);
}
These two forms are treated exactly the same by the compiler therefore you can only include one of them in your script.
An example call for the template would be:
auto f = GravitationalAttraction!(float)(G, m1, m2, r);
Note that D has type inference for templates but this topic will not be covered here.
Structs classes and interfaces

Structs classes and interface template abbreviations work in the same way, so I'll only show an example for a struct:
// Longhand
template Planet(T)
{
  struct Planet
  {
    T mass;
    T volume;
    T surfaceArea;
    T meanSurfaceTemp;
  }
}
// Shorthand
struct Planet(T)
{
  T mass;
  T volume;
  T surfaceArea;
  T meanSurfaceTemp;
}
The template struct can be called using:
auto planet = Planet!(double)(mass, vol, surfA, meanTemp);.
Enumerations

Traits in D are templates that "extract information about types and symbols at compile time", often these are compile time predicates and enumeration templates are an opportunity to show how very simple versions of these are contructed. Suppose we wanted to restrict the template types in the functions and struts shown previously to floating point T, float, double, and real? A compile time predicate is exactly what you need:
// Longhand
template isFloat(T)
{
  enum bool isFloat = is(T == float) || is(T == double) || is(T == real);
}
// Shorthand
enum bool isFloat(T) = is(T == float) || is(T == double) || is(T == real);
Notice that you can not simply write T == float for types, you need to use the is() operator to obtain a bool from that statement. This enum can be applied to all the previous templates. For functions:
// Longhand
template GravitationalAttraction(T)
if(isFloat!T)
{
  auto GravitationalAttraction(T G, T m1, T m2, T r)
  {
    return (G*m1*m2)/(r^^2);
  }
}
// Shorthand
auto GravitationalAttraction(T)(T G, T m1, T m2, T r)
if(isFloat!T)
{
  return (G*m1*m2)/(r^^2);
}
structs:
// Longhand
template Planet(T)
if(isFloat!T)
{
  struct Planet
  {
    T mass;
    T volume;
    T surfaceArea;
    T meanSurfaceTemp;
  }
}
// Shorthand
struct Planet(T)
if(isFloat!T)
{
  T mass;
  T volume;
  T surfaceArea;
  T meanSurfaceTemp;
}
Restricting templates in this way is how we create template constraints in D.
The alias keyword

The usual use for the alias keyword is for aliases of a type. For example and alias for a pointer type:
// Longhand
template P(T)
{
  alias P = T*;
}
// Shorthand
alias P(T) = T*;

to use it do P!(real) x;
AliasSeq!(T) compile time sequences

AliasSeq is implemented in the std.meta library of Phobos (D's standard library). The implementation is very simple:
alias AliasSeq(T...) = T;
we will also borrow the implementation of Nothing from the same module though this is not available for being imported (package only visibility):
alias Nothing = AliasSeq!();
That's it!
Indexing compile time sequences

To create a sequence of types we simply do:
alias tList = AliasSeq!(bool, string, ubyte, short, ushort);
pragmas interact directly with the compiler, and we can print a message at compile time using pragma(msg, "The message, ", "then another message ...") for instance:
pragma(msg, "Input types: ", tList);
outputs:
Input types: (bool, string, ubyte, short, ushort)
We can access individual elements using for instance tList[1] and carry out slice operations using tList[1..3] so the following:
pragma(msg, "Single index selection: ", tList[1]);
pragma(msg, "Slice selection: ", tList[1..3]);
outputs:
Single index selection: string
Slice selection: (string, ubyte)
Append, prepend, and concatenating type lists

One important thing about AliasSeq(T...) typelists is that they are not a traditional "container"; when they are input into templates they kind of just "spread out" and become like separate arguments of a function as if they were not contained but inputted in separately. We'll see what that means later but one consequence is there is no need to define operations for Append, Prepend, and Concatenate because it's already implied in the definition. Just put typelists together and that's it:
Append

Here the type ulong is appended to the previously created typelist:
alias appended = AliasSeq!(tList, ulong);
pragma(msg, "Appended list: ", appended);
Output:
Appended list: (bool, string, ubyte, short, ushort, ulong)
Prepend

Here the type ulong is preprended to the previously created typelist
alias preprended = AliasSeq!(ulong, tList);
pragma(msg, "Prepended list: ", preprended, "\n");
Output:
Prepended list: (ulong, bool, string, ubyte, short, ushort)
Concatenate

Here two typelists are concatenated ...
alias concat = AliasSeq!(tList, AliasSeq!(int, long, ulong));
pragma(msg, "Cocatenated list: ", concat);
Output:
Concatenated list: (bool, string, ubyte, short, ushort, int, long, ulong)
Notice how they are now one typelist.
Replacing items

Replacing a single item by index

The code below shows the implementation for the internals of the Replace template that replaces a single item in an AliasSeq compile time sequence. It is marked private which limits its visibility to the module being defined, its interface is defined later.
private template Replace(long i, long r, S, Args...)
{
  static if(Args.length == 1)
  {
    static if(i == r)
    {
      alias Replace = S;
    }else{
      alias Replace = Args[0];
    }
  }else static if(Args.length > 1)
  {
    static if(i == r)
    {
      alias Replace = AliasSeq!(Replace!(i - 1, r, S, 
                                          Args[0..($ - 1)]), S);
    }else
    {
      alias Replace = AliasSeq!(Replace!(i - 1, r, S, 
                                          Args[0..($ - 1)]), Args[$ - 1]);
    }
  }else static if(Args.length == 0)
  {
    alias Replace = Args;
  }else{
    static assert(false, "Error from Replace template something went wrong");
  }
}
Let's break down what's going on here. Firstly the declaration:
private template Replace(long i, long r, S, Args...){/*... Code ...*/}
Templates can have typed arguments in this case long i and long r. Here r is the index location to the item we want replaced, i is a candidate index, we check if i == r and if it is, we do a replacement.
We have an individual type S and then a variable number of types Args.... S is what we replace whatever is in r with and Args is the type sequence where the replacement will be done. Note that you can only have a variable number of template arguments at the end, and in such a situation S will always be assumed to be single element by the compiler, what is happening is essentially pattern matching. For example, if we compile this script:
alias AliasSeq(T...) = T;

alias lhs = AliasSeq!(byte, ubyte);
alias rhs = AliasSeq!(short, ushort);

template Join(L, R...)
{
  pragma(msg, "L: ", L);
  pragma(msg, "R: ", R);
  alias Join = AliasSeq!(L, R);
}

alias joined = Join!(lhs, rhs);
pragma(msg, "Join!(lhs, rhs): ", joined);

void main(){}
we get the output:
L: byte
R: (ubyte, short, ushort)
Join!(lhs, rhs): (byte, ubyte, short, ushort)
Compiler-interpreter:
If you are "weirded out" by the void main(){} at the end, it's because all our code is compile time and the main function is created for runtime, though we could put our code in main, we don't have to. To compile the script in a "standard way" we need a main function, but we can get rid of this by using the -o- flag to suppress the creation of an object (executable) file. If we do this we can delete the main function and compile using dmd script.d -o- (for dmd). There's nothing to "run" because we don't create an object file and everything we do happens at compile time a bit like an interpreted script.
If we try to use this template instead:
template Join(L, R)
{
  alias Join = AliasSeq!(L, R);
}
we will get a compiler error:
Error: template instance Join!(byte, ubyte, short, ushort) 
            does not match template declaration Join(L, R)
and if we try
template Join(L..., R...)
{
  alias Join = AliasSeq!(L, R);
}
we will get... Error: variadic template parameter must be last.
The Replace template is recursive, and there are two conditions we care about. Firstly the termination, when Args.length == 1 and then the continue condition when Args.length > 1. When Args.length == 1 we terminate with:
static if(i == r)
{
  alias Replace = S;
}else{
  alias Replace = Args[0];
}
When Args.length > 1 we do the following recursive call:
static if(i == r)
{
  alias Replace = AliasSeq!(Replace!(i - 1, 
                       r, S, Args[0..($ - 1)]), S);
}else
{
  alias Replace = AliasSeq!(Replace!(i - 1, 
                       r, S, Args[0..($ - 1)]), Args[$ - 1]);
}
so we start at the end of the sequence and work our way backwards to the first element.
There are two other conditions, the first of these is if we ever get a case when Args.length == 0 i.e. Args = AliasSeq!() argument, and the other is if we get something else we don't expect (which may well be overkill) but is an example of using a static assert to trigger an error in a specific case. Before going further into the first of these cases let's look at the interface - the template that is meant to be called using a template overload:
alias Replace(long r, S, Args...) = Replace!(Args.length - 1, r, S, Args);
... meaning that we should never hit the error case. If we wanted, we could move the Args.length == 0 case to another template entirely using template constraints:
private template Replace(long i, long r, S, Args...)
if(Args.length == 0)
{
  alias Replace = Args;
}
and the declaration for the first template becomes:
private template Replace(long i, long r, S, Args...)
if(Args.length > 0)
{/*... Code ... */}
Let's see if Replace template works:
// Type list
alias tList = AliasSeq!(bool, string, ubyte, short, ushort);
// Replace examples ...
alias replace0 = Replace!(0, int, tList);
pragma(msg, "Modify (bool @ 0) => int: ", replace0);
alias replace1 = Replace!(1, byte, tList);
pragma(msg, "Modify (string @ 1) => byte: ", replace1);
alias replace2 = Replace!(tList.length - 1, uint, tList);
pragma(msg, "Modify (ushort @ end) => uint: ", replace2);
// Replace in Nothing for Args.length == 0
alias replace3 = Replace!(0, int, Nothing);
pragma(msg, "Replace in Nothing: ", replace3);
Output:
Modify (bool @ 0) => int: (int, string, ubyte, short, ushort)
Modify (string @ 1) => byte: (bool, byte, ubyte, short, ushort)
Modify (ushort @ end) => uint: (bool, string, ubyte, short, uint)
Replace in Nothing: ()
Replacing multiple items with an individual type

In this section we introduce the concept of string mixins. String mixins are a little like macros in C. They allow the user to use strings to construct code and include them in scripts. Unlike macros in C, "[string mixins] in text must form complete declarations, statements, or expressions", and they have other added protections that make them inherently safer than those in C. However in this case we are not creating runtime code from mixins but code for templates.
In D the usual method for concatenating strings is using '~' operator, however we have to be careful with this, '~' will interpret a number such as 0 as a null byte, so we use the text function to do concatenation [reference]. The template for doing a Replace of multiple items with a single one is given below:
import std.conv: text;
template Replace(alias indices, S, Args...)
if(Args.length > 0)
{
  enum N = indices.length;
  static foreach(i; 0..N)
  {
    static if(i == 0)
    {
      debug pragma(msg, "Compiled string 1: ", 
          text(`alias x`, i, ` = Replace!(indices[i], S, Args);`));
      mixin(text(`alias x`, i, ` = Replace!(indices[i], S, Args);`));
    }else{
      debug pragma(msg, "Compiled string 2: ",
          text(`alias x`, i, ` = Replace!(indices[i], r, S, x`, (i - 1), `);`));
      mixin(text(`alias x`, i, ` = Replace!(indices[i], S, x`, (i - 1), `);`));
    }
  }
  debug pragma(msg, "Compiled string 3: ", 
        text(`alias Replace = x`, N - 1, `;`));
  mixin(text(`alias Replace = x`, N - 1, `;`));
}
Let's break this down. As we discussed in a previous article all compile time variables are constants so when they are defined they can not be changed. In this case we iterate over all the numbers in indices and call the Replace template for single items on each of the indices. Since we can not store each iteration into the same (immutable) variable we generate a new compile time sequence each time, x0, x1, ...xN and we do this by referring to the previous sequence Args, x0, ..., x(N-1). static foreach loops in D enclosed in single curly braces { are not scoped, it is the equivalent of pasting the code over and over each time. So we are in fact creating a series of sequences with different names and at the end effectively returning the final sequence from the template:
mixin(text(`alias Replace = x`, N - 1, `;`));
The code also shows debug conditional compilation directives before each pragma statement which allows us only run these with the -debug flag in the dmd compiler. You may or may not have noticed that we introduced an alias keyword in our template declaration template Replace(alias indices, S, Args...){/*... Code ...*/}, this allows us to specify anything that is not a template and we don't have to give the type.
Using this Replace overload for the multiple replacement by a single item:
alias replace4 = Replace!([0, 2, 4], ulong, tList);
pragma(msg, "Replace ([0, 2, 4]) => ulong: ", replace4);
output:
Replace ([0, 2, 4]) => ulong: (ulong, string, ulong, short, ulong)
Replacing multiple items by a tuple of items

Things will now get a little more interesting. We've already shown that there is no way of passing more than one AliasSeq!(T) sequence as template parameters while retaining containment, you always end up with all but one (or however extra) of the template parameters in the variable length. To remedy this we can use a tuple typelist, there are tuple types and functions in std.typecons but we will be building our own:
struct Tuple(T...)
{
  enum long length = T.length;
  alias get(long i) = T[i];
  alias getTypes = T;
}
This struct is special, it has one member, an enum long constant and no methods only templates aliases, perfect for our compile time needs. We will be able to use it as a "container" for types that can be passed along with AliasSeq without merging typelists. The length constant gives us the length of the typelist, the get(long i) template allows up to get the ith template parameter usage for a tuple x is x.get!(i), and the getTypes method returns the typelist T.
Next we create a predicate trait that tells us whether something is a Tuple or not:
enum bool isTuple(Tup...) = false;
enum bool isTuple(Tup: Tuple!T, T...) = true;
You might notice that this is slightly different from our previous predicate. Here we do a kind of "catchall" for types that are not Tuple to false and then Tuple types to true. Now let's see if this works (D has unit testing capabilities but we will not be covering them here):
alias tupleConcat = AliasSeq!(real, Tuple!(tList));
pragma(msg, "Concatenation with Tuple: ", tupleConcat);
output:
Concatenation with Tuple: (real, Tuple!(bool, string, ubyte, short, ushort))
Now the functionality of Tuple:
pragma(msg, "\nTuple length: ", Tuple!(tList).length);
pragma(msg, "Tuple types: ", Tuple!(tList).getTypes);
pragma(msg, "Tuple!(tList).get!(0): ", Tuple!(tList).get!(0));
pragma(msg, "Tuple!(tList).get!(1): ", Tuple!(tList).get!(1));
pragma(msg, "Tuple!(tList).get!(2): ", Tuple!(tList).get!(2), "\n");
Tuple length: 5L
Tuple types: (bool, string, ubyte, short, ushort)
Tuple!(tList).get!(0): bool
Tuple!(tList).get!(1): string
Tuple!(tList).get!(2): ubyte
and isTuple:
pragma(msg, "\nTesting isTuple (false): ", isTuple!(real));
pragma(msg, "Testing isTuple (false): ", 
                          isTuple!(AliasSeq!(long, ulong, real)));
pragma(msg, "Testing isTuple (true): ", 
                          isTuple!(Tuple!(long, ulong, real)), "\n");
Testing isTuple (false): false
Testing isTuple (false): false
Testing isTuple (true): true
Now we are ready for the template that replaces multiple items in AliasSeq by the types in a tuple:
template Replace(alias indices, S, Args...)
if((Args.length > 0) && isTuple!(S) && (indices.length == S.length))
{
  enum N = indices.length;
  static foreach(i; 0..N)
  {
    static if(i == 0)
    {
      mixin(text(`alias x`, i, ` = Replace!(indices[i], S.get!(i), Args);`));
    }else{
      mixin(text(`alias x`, i, 
             ` = Replace!(indices[i], S.get!(i), x`, (i - 1), `);`));
    }
  }
  mixin(text(`alias Replace = x`, N - 1, `;`));
}
Note that the template constraint includes isTuple, we should go back to our previous implementation of Replace and amend it's constraints to ensure that they are distinguishable:
// Replacing multiple items with an individual type
template Replace(alias indices, S, Args...)
if(Args.length > 0 && !isTuple!(S)){/*... Code ...*/}
Notice that in addition to the isTuple!(S) constraint in the new Replace template there is an aditional constraint indices.length == S.length. We could have used a static assert(indices.length == S.length, "... message ..."); in the template body but it's a design choice where cases with indices.length != S.length don't enter the template body. Instead we could create yet another overload for that case:
template Replace(alias indices, S, Args...)
if((Args.length > 0) && isTuple!(S) && (indices.length != S.length))
{
  static assert(false, "tuple length is not equal to length of replacement indices");
}

Testable with:
//alias replace6 = Replace!([0, 1, 2, 4], Tuple!(long, ulong, real), tList);
Removing items from a compile time sequence

Removing an item from a compile time sequence

In this case we would like to remove the rth item from Args, the internal function in this very similar to the recursive internal function in the Replace case. Of course by now you know that this can be written in different ways, for example break up the cases for different lengths of Args to different templates using template constraints, note the use of Nothing here.
private template Remove(long i, long r, Args...)
{
  static if(Args.length == 1)
  {
    static if(i == r)
    {
      alias Remove = Nothing;
    }else{
      alias Remove = Args[0];
    }
  }else static if(Args.length > 1)
  {
    static if(i == r)
    {
      alias Remove = Remove!(i - 1, r, Args[0..($ - 1)]);
    }else
    {
      alias Remove = AliasSeq!(Remove!(i - 1, r, Args[0..($ - 1)]), Args[$ - 1]);
    }
  }else static if(Args.length == 0)
  {
    alias Remove = Args;
  }else{
    static assert(0, "Error from Remove template something went wrong");
  }
}
the programmer interface is:
alias Remove(long r, Args...) = Remove!(Args.length - 1, r, Args);
we can check that it all works with:
pragma(msg, "\nSequence: ", tList);
alias remove0 = Remove!(0, tList);
pragma(msg, "Removed item @ 0 (bool) => Nothing: ", remove0);
alias remove1 = Remove!(2, tList);
pragma(msg, "Removed item @ 2 (ubyte) => Nothing: ", remove1);
alias remove2 = Remove!(tList.length - 1, tList);
pragma(msg, "Removed item @ end (ushort) => Nothing: ", remove2);
output:
Sequence: (bool, string, ubyte, short, ushort)
Removed item @ 0 (bool) => Nothing: (string, ubyte, short, ushort)
Removed item @ 2 (ubyte) => Nothing: (bool, string, short, ushort)
Removed item @ end (ushort) => Nothing: (bool, string, ubyte, short)
Removing multiple items from a compile time sequence

That's gravy, we built it directly from what we've learnt, now for multiple selection, using alias for specifying multiple items could be a bit too "general", lets see if we can use a Tuple to specify the indices this time. First add a template method for getting enums rather than types to our previous Tuple definition:
struct Tuple(T...)
{
  enum long length = T.length;
  alias get(long i) = T[i];
  enum getEnum(long i) = T[i]; //added
  alias getTypes = T;
}
Now specify the multiple Remove template:
template Remove(Indices, Args...)
if(isTuple!(Indices) && (Indices.length <= Args.length))
{
  enum N = Indices.length;
  static foreach(i; 0..N)
  {
    static if(i == 0)
    {
      mixin(text(`alias x`, i, ` = Remove!(Indices.getEnum!(i), Args);`));
    }else{
      mixin(text(`alias x`, i, ` = Remove!(Indices.getEnum!(i) - i, x`, 
                                                           (i - 1), `);`));
    }
  }
  mixin(text(`alias Remove = x`, N - 1, `;`));
}
Observe that for indices greater than zero we amend the index using Indices.getEnum!(i) - i because the sequence has decreased in length by one. Let's see if it works:
alias indices = Tuple!(0L, 2L, 4L);
alias remove3 = Remove!(indices, tList);
pragma(msg, "\nRemove multiple items ", "@", indices, ": ", remove3);
output:
Remove multiple items @Tuple!(0L, 2L, 4L): (string, short)
Insertion of types into compile time sequences

Insertions of single and multiple items is simple. The case for inserting a single type E at position i shifting the item at the previous i and everything to the right of it one space to the right is as follows:
template Insert(long i, E, Args...)
if((i < Args.length) && (i > 0)  && !isTuple!E)
{
  alias lhs = Args[0..(i)];
  alias rhs = Args[(i)..$];
  alias Insert = AliasSeq!(lhs, E, rhs);
}
The code for inserting multiple items E defined as a Tuple the first of which is at position i is given as:
template Insert(long i, E, Args...)
if((i < Args.length) && (i > 0) && isTuple!E)
{
  alias lhs = Args[0..(i)];
  alias rhs = Args[(i)..$];
  alias Insert = AliasSeq!(lhs, E.getTypes, rhs);
}
let's see if it works:
pragma(msg, "\nInitial sequence: ", tList);
alias insertSeq = AliasSeq!(int, long, ulong);
pragma(msg, "Sequence to insert: ", insertSeq);
alias insert0 = Insert!(1, Tuple!(insertSeq), tList);
pragma(msg, "Inserted sequence @ 1: ", insert0);
alias insert1 = Insert!(tList.length - 1, Tuple!(insertSeq), tList);
pragma(msg, "Inserted sequence @ ($ - 1): ", insert1);
output:
Initial sequence: (bool, string, ubyte, short, ushort)
Sequence to insert: (int, long, ulong)
Inserted sequence @ 1: (bool, int, long, ulong, string, ubyte, short, ushort)
Inserted sequence @ ($ - 1): (bool, string, ubyte, short, int, long, ulong, ushort)
That's it! The final case I leave as an exercise, here I is a Tuple!(long[]...) and E is a Tuple!(T...).
template Insert(I, E, Args...)
if(isTuple!I && isTuple!E && (I.length == E.length))
{/* Code ...*/}
Coda

You've reached the end of the article, well done! It might seem as if this is a rabbit hole of compile time programming madness but I assure you it is not, it does actually have uses one of which I plan to write about at a later date.
Thank you!