ajs/perl6-python-lessons.md

## perl6-python-lessons.md

      
    Raw
  

              perl6-python-lessons.md
            
          
    Some lessons I think Python should learn from Perl 6

I'm not going to wax poetic. I'm a professional Python programmer who has worked with the language for about
10 years and a former Perl 5 developer who also worked with the developers of Perl 6 for a few years.
I have no illusions about Perl 5's ... age, let's say. It was a solid tool for the day, but Python aged well
and Perl 5 did not. Perl 6, on the other hand... well, it's a bag of tricks, and not all of those tricks are
ones you want played on you. But, I think there are lessons here. Let's get to them.
First the non-lessons

Perl 6 is famed for its grammars (Lots of people say "rules" but that's not quite right. Perl 6 rules are
just one part of the larger system of grammars which are an extension of the object space into parsing much
like database ORMs are for SQL) They're great, but they also rely heavily on the way Perl 6 thinks. A native
parser system in Python would be amazing, but it wouldn't be Perl 6 rules.
Also, I don't think that Python would benefit from the way Perl 6 does its exception handling or other things
that Python already does, if not better, at least differently and not worse (exception handling is actually
something I think Python does better).
There are also things in Perl 6 that Python doesn't do and Perl 6 doesn't do well, but I see as a herald of
things to come in Python, once a better solution is found. One great exampole is the command-line processing.
Perl 6 command-line processing is shockingly easy to use (it just introspects the MAIN function for arguments,
types and specially formed comments then turns them into parameters). But it's horribly limited and clunky
if you want to do anything more than basic command-line flags. Good idea, incomplete execution.
Now to the lessons

gather/take: a better yield

The idea to gather and take is that take works just like yield, but it works on a block with a gather outside
of it, so you might
have something like this:
return [~] gather for 65..90 -> $ord_value {
    take chr($ord_value);
}

Here, we concatenate ([~]) the results of a for loop that generates the ASCII capital letters, returning the string
ABCD...WXYZ.
This is a trivial example that could be done other ways, but it gives you the basic idea. take just yields its value
to the nearest enclosing gather which returns an iterator of those values.
This resolves the problem of what you see all too often in Python:
def letters():
    def _letters():
        for ord_value in range(65,91):
            yield chr(ord_value)
    return ''.join(_letters())

The issue with turning this into the Perl 6 version is that it requires something Python doesn't much like: a statement
that can contain an expression that contains a statement. This is something Python works hard to avoid, mostly because
of the issues that it causes with indentation. The easy solution, though, is to require gather to only ever come
immediately after an assignment operator (not counting parameter assignment):
def letters():
    _letters = gather for ord_value in range(65,91):
        take chr(ord_value)
    return ''.join(_letters())

This avoids defining a nested function which makes code harder to read and can be inserted into any scope
and multiple times at that!
Again, these examples are toys and could be condensed into simple comprehensions. But that's the problem with examples:
either they're too tiny to be meaningful or they're too large to focus on the thing you are trying to demonstrate.
Not-including signifiers

Perl 6 uses extra bits of punctuation syntax to do a great many things that Python doesn't need or want to do
that way, but the underlying ideas without the syntax are often helpful.
For example, Perl's equivalent of Python's range is the ..
operator, but .. is really just one of its four forms. Here they all are:
0 ..   3  # 0, 1, 2, 3
0 ..^  3  # 0, 1, 2
0 ^..  3  # 1, 2, 3
0 ^..^ 3  # 1, 2

And because this ends up being used so often, there is a special case:
^3  # 0, 1, 2

Now, in Python there's no need for extra punctuation. Python could simply add two new flags to range:
def range(start, stop, step=1, include_start=True, include_end=False):
    ...

Now those examples from above become:
range(0, 3, include_end=True)
range(0, 3)
range(0, 3, include_start=False, include_end=True)
range(0, 3, include_start=False)

I don't think that single-point range really makes much sense for Python, but if we wanted it, we
could define it as a new function:
upto(3)

So the real question is why is this useful? Sure, it's cute, but does it gain us anything? And
indded it does. It reduces the number of times a programmer has to decide whether to increment or
decrement a value to compensate for the language's default. This means a bevy of sometimes hard
to spot off-by-one errors
simply go away!
A switch construct

"Smart matching"
in Perl 6 was essentially created to enable the HLL-equicalent of a C switch that
many people had been asking for in Perl 5, but which never seemed practical because HLLs deal with
data so differently from their lower-level cousins. A Perl 6 switch looks like this:
given $foo {
    when 1 { ... }
    when 2 { ... }
    default { ... }
}

But when can take any object or expression and evaluate it in a smart-match against the topic of
the given, which means that if your topic is a list, you can test all of its elements or a when
might take a regular expression. These when blocks can also contain a proceed to continue testing
rather than breaking out of the given after a successful match.
given assigns its topic value with the default variable, $_ which is not used as much in Perl 6
as it was in Perl 5, but does create a fascinating new bit of syntax, the bare method call:
given $foo {
    when .working { say "Working" }
    when .starting { say "Starting" }
    default { say "Broken" }
}

These methods are called on $_ by default, which is aliased to our topic, $foo. Indeed, there are
many benefits to thinking of methods this way, one of which is that the dot begins to look like
any other operator, and operators get assignment equivalents: foo .= bar meaning foo = foo.bar,
which has all of the advantages of any other operator assignment, mostly having to do with
not duplicating the left hand side value, potentially invoking side-effects.
Reduction operators

This is probably going to be the most controversial of my points because it introduces a great
many new operators into the language (well, just one... but it has many heads). Perl 6 introduced
the idea of meta-operators. That is operators formed by applying some transformation to a regular
operator. Reductions are the simplest version of these, and easiest to understand. Here's Python's
sum operator:
sum([1,2,3,4]) # 10

But why is addition special, here? Why not have a function for every chainable operator? For example:
powertower([3,3,3]) # 7625597484987

For those unfamiliar with mathematical power-towers, they're just exponetiation, but grouped
right-associatively so, the above becomes 3 ** (3 ** 3), which, because Python's ** is
right associative is actually just 3 ** 3 ** 3 or 7625597484987.
But, since the operator already knows how it works and the idea of reduction is already built
in to Python, why not make this automatic:
\** [3,3,3] # 7625597484987

What's the advantage? Well, other than it being much simpler (as any expression is when you
don't have to wrap parens all the way around it) it's also almost all syntax we already know.
The associativity of ** is well documented and any beginning user of Python should know
that ** is exponentiation. What's more, this isn't just for math:
\+ [1, 2, 3] # 6
\+ ['a', 'b', 'c'] # 'abc'
\+ ['a', 2, 3] # TypeError

Notice that there's nothing going on here that's all that magical. This is just expanding to
the elements of the input sequence "joined" in a sematic sense by the + operator. It's
extremely intuitive and easy to use.
But, I hear you say, we already have a sum builtin function. That's true... and when this is deployed
it would still be true, but just as there's no plus builtin function, I think sum would
become deprecated eventually to avoid the duplication. Is this traumatic for long-term Python users?
Maybe. But that's not a good reason to enshrine duplication in a langauge that says there's
one way to do it.
Some other examples:
\|   [...] # bitwise or of ...
\//  [...] # chained integer division of ...
\==  [...] # chained equality of ...
\and [...] # all(...)
\or  [...] # any(...)

This is even more powerful in a language with user-defined operators, but why not skip that step and
just allow functions to be reduced?
def moo($a, $b):
    return $a ~ 'moo' ~ $b
\moo ['a', 'b', 'c'] # 'amoobmooc'

Sadly, there's no way to define precedence on functions, but that's perhaps something for a future
Pytyhon...
Nestable quotes

Perl 5 and Perl 6 share a rather unusual feature: there are nestable versions of the quotation
mark operators. Here is an example:
my $string = q{Now is the time for all good men...};

Why is this valuable? Well, try quoting this string in Python:
q{"''"}

There's nothing shocking about this. It's just a string that contains some quotes. The correct
way to quote it in Python is to use tripple-single quotes:
'''"''"'''

And to be honest, I had to go back and count those carefully while typing because it's horrific.
Backslashes aren't any cleaner, really, and it's so obvious to just use a balanced operator. Python
even has the idea already of special quoting constructs that involve a prefix character. Perl, of
course, being Perl, allows anything after the q, and balances it with itself if it's not defined in
the Unicode spec as balanced (e.g. q/My string/) or with its mate if it is (e.g. `q«My string »).
But that's not a Python-friendly thing
to do, and there's really no reason for it. Oh, and by the way, this works:
q{"'{}'"}

The little things

Perl 6 has many little things that I find to be a tremendous improvement, but which might or might
not fit in Python. Some are fundamental to the language and would require, perhaps, a Python 4
to do right.


Loop controls like break take a label and labels can be placed on loops, so you can break out
to an enclosing loop


Every block is a closure. In Python, this would mean that this:
for i in range(10):
    yield lambda x: x + i

would actually create a new i closure every time, rather than returning a lambda with the
same i each time, thus making this do something it really doesn't look like it is to
anyone coming from other langauges.


Automatic accessors and semi-automatic class parameterization i.e.:
class Point {
    has $.x = 0;
    has $.y = 0;
}

This is very punctuation-heavy in a Perlish way, but it's specifying that attributes
x and y exist on class Point, have accessors named the same and can be
initialized via the new method like so: $point = Point.new(:x(10), :y(20)) and
accessed like so: say $point.x and $point.x = 20.


One place where Perl 6 used less punctuation than Python is in accessing metaclasses.
The Perl 6 for metaclass access is just to put a ^ after the . in method/attribute
access. So Python's foo.__class__.__name__ in Perl 6 becomes foo.^name, which, for
my money, is quite a bit easier to manage in my code and cleaner looking.


The primitive pair type. Perl 6 has a datatype for the associative pair that acts as
the basis for hashes (dicts in Python terms). This means that, rather than returning
two-tuples, Perl 6's equivalent of items returns objects with a named .key and
.value. It also means that everything from named parameter passing to hash construction
use the same mechanism. A Pair is constructed using the => operator, familiar to
Perl 5 programmers.


... to note undefined blocks. This is something many people who see Perl 6 code
online miss. They see something like sub foo {...} and assume that this is
pseudocode, but it's not. It's a perfectly valid bit of Perl 6, which gives a
failure exception if the function is called. It's nearly equivalent to Python's
pass,
but because it can't be successfully executed, makes it much clearer that this is
"to be done" rather than intentially empty.


Word quoting. Perl 6 uses <...> to surround lists of strings that are automatically
quoted by breaking on whitespace. For example: <a b c d e> which is certainly much
easier to type than ['a', 'b', 'c', 'd', 'e'] and thus tends to be slung around
more casually in expressions.


Named local variable passing. A Perl 6 local variable can be passed to a function
which takes a named parameter where the two names are the same like so: foo(:$bar).
In this case, the variable $bar is passed as the bar named parameter to foo.
This might seem inconsequential, but how often do you see Python code littered with
foo(a=a, b=b, c=c, ...)?


Infinity. Having ∞ or its ascii equivalent Inf in Perl 6 makes a great deal of
otherwise obtuse code quite plain.


Block interpolation. Interpolation in general is a touchy issue in Python, but
I think Perl 6 shows a path to a rational solution to the issue. "foo: {foo.perl}."
is equivalent to the Python, "foo: {}.".format(repr(foo)) but with a significant
differnence: the block inside the string is not a runtime construct. Therefore
the syntax of the string includes the syntax of its subordinate block. This means
you detect errors earlier and the flow of reading the code is smoother.