Skip to content

Instantly share code, notes, and snippets.

mjwillson /
Last active Aug 29, 2015
Decorate a generator function (or other iterator-returning function) as a multi-shot iterable. A fix for many Python gotchas relating to use of one-shot iterators
class iterable(object):
"""Decorates a generator function (or any other iterator-returning
function) as something which implements the iterable protocol and
can be safely passed to other code which may iterate over it
multiple times.
def foo():
mjwillson / equivalence_relation.rb
Created Nov 13, 2008
Simple Ruby class for an extensionally-specified Equivalence Relation with fast lookup
View equivalence_relation.rb
require 'set'
class EquivalenceRelation
def initialize
@equivalence_class_by_member = {}
def declare_equivalent(*items)
combined_equivalence_class = {|item| equivalence_class_of(item)}.inject( {|accum,set| accum.union(set)}
combined_equivalence_class.each {|member| @equivalence_class_by_member[member] = combined_equivalence_class}
mjwillson / yaml_fixtures.rb
Created Nov 20, 2008
Some general-purpose fixture-loading code which supports references and respects dependencies. Quite short as it hooks into yaml.rb to do most of the work
View yaml_fixtures.rb
# This is a mixin for Test::Unit::TestCase, for providing tests with a simple facility to load fixtures from
# YAML files. It aims to do as much as possible in a generic way with the YAML library rather than couple
# tightly to a database library.
# Fixtures can use arbitrary YAML to represent ruby objects, using !ruby/object:Some::ClassName where needed.
# you will typically need to implement yaml_initialize on the relevant class, in order for this to
# work in the way you desire (we do so here for Sequel::Model; others including ActiveRecord shouldn't be hard)
# Data within one fixture can refer to other fixtures by file and name, using the syntax: !fixture file/name.
# this is achieved by adding a special YAML domain type.
mjwillson / merb_restful_rangeable_collections.rb
Created Mar 6, 2009
RESTful way of exposing a collection resource in merb in a pageable / sub-range-fetchable way. Supports HTTP Content-Range
View merb_restful_rangeable_collections.rb
# Drop me a line if you wanna see this as a proper merb plugin.
class Merb::Controller
ITEM_RANGE = /^items=(\d+)-(\d+)$/
RANGE = /^(\d+)-(\d+)$/
# Displays a collection resource (using Merb's display method) while supporting requests for sub-ranges of items in a RESTful fashion.
# This supports a subset of the HTTP/1.1 spec for content ranges, using a custom range unit 'items'. eg:
# GET /collection HTTP/1.1
# Range: items 10-20
mjwillson / tax_bands.hs
Created Oct 29, 2009
Pure functional tax bands
View tax_bands.hs
infinity = 1.0/0
savingsIncomeTax200910 = [(0, 6475), (0.1, 8915), (0.2, 43875), (0.4, infinity)]
incomeTax200910 = [(0, 6475), (0.2, 43875), (0.4, infinity)]
taxWithBands bands income
= if income <= threshold then
rate * income
rate * threshold + taxWithBands loweredRemainingBands remainingIncome
mjwillson / tax_bands.rb
Created Oct 29, 2009
Same thing in ruby
View tax_bands.rb
Infinity = 1.0/0
SAVINGS_INCOME_TAX_200910 = [[0, 6475], [0.1, 8915], [0.2, 43875], [0.4, Infinity]]
INCOME_TAX_200910 = [[0, 6475], [0.2, 43875], [0.4, Infinity]]
def tax(income, bands=INCOME_TAX_200910)
rate, threshold = bands.first
if income <= threshold
rate * income
lowered_remaining_bands = bands[1..-1].map {|r, t| [r, t - threshold]}
mjwillson / gist:675cc0259e4291d97104
Created Sep 7, 2015 — forked from benanne/gist:1759022
Theano AdvancedSubtensor memory leak
View gist:675cc0259e4291d97104
import theano.tensor as T
import theano
import numpy as np
import gc
def freemem():
return theano.sandbox.cuda.cuda_ndarray.cuda_ndarray.mem_info()[0] / 1024**2
mjwillson / multimethod-matrix-multiplication.clj
Last active Dec 10, 2015
An example of how multi-method-based dispatch might work for a binary operation like matrix multiplication. Illustrates how a variety of coercion-based defaults can be specified to make life easy on the implementer, while still easily allowing for dispatch to optimal implementation-specific routines whenever it's desired.
View multimethod-matrix-multiplication.clj
;; First some dummy matrix types and operations for us to play with later:
(deftype GenericDense [])
(deftype FooMatrix [])
(deftype BarMatrix [])
(defn generic-multiply [x y] "generic-multiply")
(defn foo-multiply [x y] "foo-multiply")
(defn bar-multiply [x y] "bar-multiply")
(defn foo-generic-multiply [x y] "foo-generic-multiply")
(defn generic-foo-multiply [x y] "generic-foo-multiply")
(defn bar-generic-multiply [x y] "bar-generic-multiply")
mjwillson / mapreduce.clj
Last active Dec 16, 2015
Clojure-like psuedo-code for something closer to the actual structure of a Hadoop MapReduce computation
View mapreduce.clj
;; I'm sure I'm wrong here -- please correct me!
;; (Although note I'm not trying to capture the exact algorithms and dataflow complexities of hadoop here, just the logical structure of MapReduce computations and a rough sketch of how they're distributed.)
;; not just
(->> data
(pmap mapper)
(reduce reducer))
;; but something more like this.
mjwillson /
Created Apr 30, 2013
convert between float arrays and byte arrays
package randomindexing;
/* Helpers for encoding various primitive arrays as byte arrays.
(Why this isn't in the stdlib I have no idea! feel free to replace
with some existing library implementation...)
All encodings use little-endian byte order.
I benchmarked using
but it's around 3 times slower even on big input.