Gankra/gist:93b2de2505ef438f1a0e

## gistfile1.md

      
    Raw
  

              gistfile1.md
            
          
    This is written as a companion to the Rust collections library documentation.
It provides an intuitive explanation of performance analysis in the context
of data structures, with Rust's collections in mind.
Measuring Performance

The performance of a collection is a difficult thing to precisely capture. One
cannot simply perform an operation and measure how long it takes or how much
space is used, as the results will depend on details such as how it was
compiled, the hardware it's running on, the software managing its execution, and
the current state of the program. These precise details are independent of the
collection's implementation itself, and are far too diverse to exhaustively test
against.
To avoid this issue, we use asymptotic analysis to measure how performance
scales with the size of the input (commonly denoted by n), or other more
exotic properties. This is an excellent first-order approximation of
performance, but has some drawbacks that we discuss below.
Big-Oh Notation

The most common tool in performing asymptotic analysis is Big-Oh notation.
Big-Oh notation is a way of expressing the relation between the growth rate of
two functions. Given two functions f and g, when we say f(x) is
O(g(x)), you can generally read this as "f(x) is on the order of g(x)".
Informally, we say f(x) is O(g(x)) if g(x) grows at least as fast as
f(x).  In effect, g(x) is an upper-bound on how f(x) scales with x. This
scaling ignores constant factors, so 2x is O(x), even though 2x grows
faster.
This ignoring of constants is exactly what we want when discussing the
performance of collections, because the precise compilation and execution
details will generally only provide constant-factor speed ups. In practice,
these constant factors can be large and important, and should be part of the
collection selection process. However, Big-Oh notation provides a useful way to
quickly identify what a collection does well, and what a collection does poorly,
particularly in comparison to other collections. Note also that Big-Oh notation
is only interested in the asymptotic performance of the functions. For small
values of x the relationship between these two functions may be arbitrary.
While the functions in Big-Oh notation can have arbitrary complexity, by
convention the function g(x) in O(g(x)) should be written as simply as
possible, and is expected to be as tight as possible. For instance, 2x is
O(3x² + 5x - 2),
but we would generally simplify the expression to only the
dominant factor, with constants stripped away. In this case,
<code>x<sup>2</sup></code> grows the fastest, and so we would simply say 2xis <code>O(x<sup>2</sup>)</code>. Similarly, although2x*is* <code>O(x<sup>2</sup>)</code>, this is needlessly weak. We would instead prefer to provide the stronger boundO(x)`.
Several functions occur very often in Big-Oh notation, and so we note them here
for convenience:


O(1) - Constant: The performance of the operation is effectively
independent of context. This is usually very cheap.


O(log n) - Logarithmic: Performance scales with the logarithm of n.
This is usually cheap.


O(n) - Linear: Performance scales proportionally to n.
This is considered expensive, but tractable.


O(n log n): Performance scales a bit worse than linear.
Not to be done frequently if possible.


O(n²) - Quadratic: Performance scales with the square of n.
This is considered very expensive, and is potentially catastrophic for large inputs.


O(2ⁿ) - Exponential: Performance scales exponentially with n.
This is considered intractable for anything but very small inputs.


Time Complexity

The most common measure of performance is how long something takes. However,
even at the abstraction level of Big-Oh notation, this is not necessarily
straight forward. Time complexity is separated into several different
categories, to capture important distinctions. In the simplest case, an
operation always takes O(g(x)) time to execute. However, we may also be
interested in the following measures of time:
Worst-Case Time

The amount of time an operation may take can vary greatly from input to input.
However, it is often possible to determine how much time is taken in the worst-
case. For some operations, the worst-case may be rare and very large. For other
operations, it may be the most common, with rare "fast" events.
For instance, if an operation sometimes takes O(1), O(log n), or O(n) time,
we simply say it takes O(n) worst-case time, since O(n) is the largest.
Worst-case analysis is often the easiest to perform, and is always applicable to
any operation. As such, it is the standard default measure of time complexity.
If time complexity is not qualified, it can be assumed to be worst-case. Having
a good worst-case time complexity is the most desirable, as it provides a strong
guarantee of reliable performance. However, sometimes the most efficient
operations in practice have poor worst-case times, due to rare degenerate
behaviors.
Vec's push operation usually takes O(1) time, but occasionally takes O(n) time,
and so takes O(n) worst-case time.
Expected Time

The running time of some operations may depend on an internal random
process. In this case, expected time is used to capture how long the operation
takes on average. The operation may take much more or less time on any given
input, or even on different calls on the same input.
For instance, if an operation takes O(n log n) time with high probability, but
very rarely takes O(n²) time,
then the operation takes O(n log n) expected time, even though it has a worst-case time
of O(n²). QuickSort is the
canonical randomized operation, with exactly this performance analysis.
Amortized Time

Some operations can have a very high worst-case cost, but over a sequence of
m operations the total cost can sometimes be guaranteed to not exceed some
smaller bound than m times the worst-case.
For instance, Vec's push operation almost always takes O(1) time, but after
(approximately) noperations, a single push may take O(n) time. By worst-case
analysis, all we can say of this situation is that a sequence of m pushes will
take O(mn) time. However, in reality we know that the sequence will only take
O(m) time, since the expensive O(n) operation can be amortized across the
many cheap operations that are guaranteed to occur before an expensive
operation. Therefore, we say that Vec.push() takes O(1) amortized time.
Space Complexity

Space complexity is less commonly discussed, but still an important
consideration. It can be used to measure either how much space a structure
consumes with respect to its contents, or how much additional space an operation
temporarily uses. Generally, a fast operation cannot use much space, because
time complexity is bounded below by space complexity. That is, it takes O(n)
time to even allocate O(n) memory, let alone use it productively.
However, space consumption can be important in resource constrained
environments, or just when working on large datasets. An operation that takes
O(n²) time on a large dataset might be unfortunate,
but consuming O(n²)
extra space to do it, even if only temporary, might prove catastrophic. If the
extra space consumed is greater than O(1), it is also likely allocated on the
heap, which is generally an expensive operation. Knowing this can help give
context to otherwise abstract time complexities.
Like time complexity, space complexity can be expressed in worst-case, expected,
or amortized terms. Unless otherwise stated, an operation can be assumed to use
only O(1) additional worst-case space, and a structure containing n items
can be assumed to have worst-case size O(n).
Problems with Big-Oh Notation

Big-Oh notation is great for broad-strokes analysis of collections and
operations, but it can sometimes be misleading in practice.
For instance, from a pure asymptotic analysis perspective, RingBuf appears to
be a strictly superior collection to Vec. RingBuf supports every operation
that Vec does in the "same" amount of time, while improving the performance of
some operations. However, in practice Vec will outperform RingBuf on many of
the operations they appear to be equally good at. This is because RingBuf
takes a small constant performance penalty to speed up its other operations.
This penalty is not reflected in asymptotic analysis, precisely because it is
a constant.
Similarly, DList appears to be better than Vec
at many operations, and even
provides strong worst-case guarantees on operations like push, where Vec
only provides strong amortized guarantees. However, in practice Vec is
expected to substantially outperform DList over any large sequence of
pushes.
Worse yet, it can sometimes be the case that for all practically sized inputs,
an operation that appears to be asymptotically slower than another may be faster
in practice, because the "hidden" constant of the theoretically fast operation
can be catastrophically large.
For these reasons, we will generally strive to discuss practical performance
considerations in addition to providing the much more convenient and simple
asymptotic bounds for high level comparisons. If an operation on a collection
does not provide any asymptotic performance information, it should be considered
a bug.