Skip to content

Instantly share code, notes, and snippets.

@gilbert
Last active August 29, 2015 14:09
Show Gist options
  • Save gilbert/fa990a9955c8b7050285 to your computer and use it in GitHub Desktop.
Save gilbert/fa990a9955c8b7050285 to your computer and use it in GitHub Desktop.
Hamming Distance Exercise
# Hamming
Write a program that can calculate the Hamming difference between two DNA strands.
A mutation is simply a mistake that occurs during the creation or
copying of a nucleic acid, in particular DNA. Because nucleic acids are
vital to cellular functions, mutations tend to cause a ripple effect
throughout the cell. Although mutations are technically mistakes, a very
rare mutation may equip the cell with a beneficial attribute. In fact,
the macro effects of evolution are attributable by the accumulated
result of beneficial microscopic mutations over many generations.
The simplest and most common type of nucleic acid mutation is a point
mutation, which replaces one base with another at a single nucleotide.
By counting the number of differences between two homologous DNA strands
taken from different genomes with a common ancestor, we get a measure of
the minimum number of point mutations that could have occurred on the
evolutionary path between the two strands.
This is called the 'Hamming distance'
GAGCCTACTAACGGGAT
CATCGTAATGACGGCCT
^ ^ ^ ^ ^ ^^
The Hamming distance between these two DNA strands is 7.
# Implementation notes
The Hamming distance is only defined for sequences of equal length. This means that based on the definition, each language could deal with getting sequences of equal length differently.
def hamming_distance(a, b)
# TODO
end
# # # # #
# Tests #
# # # # #
def assert(actual, expected)
if actual != expected
raise "Expected `#{expected}` (got `#{actual}` instead)"
end
end
distance = hamming_distance("GAGCCTACTAACGGGAT", "CATCGTAATGACGGCCT")
assert distance, 7
distance = hamming_distance("TTCGAGTTAGTACGTAG", "CGTGGTCGCACAAGCAC")
assert distance, 13
distance = hamming_distance("AGTAGATCCTCCCCGCG", "CATCCTATTTATTAAGT")
assert distance, 15
puts "All good."
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment